This repo is a modified version from the Coursera Guided Project Introduction to RAG by Alfredo Deza from Duke University.
The well-known wine dataset.
- Qdrant - in-memory vector database.
- Sentence Transformers - embeddings creation.
- Groq's Python API - connect to the LLM after retrieving the vectors response from Qdrant.
- Llamafile - connect to the LLM locally (alternative to GroqAPI compatible key and endpoint)
- Phi-2 model - using bc it is small (approx 2GB) so faster to play with. Download the model from the Llamafile repository and run it locally.
Create virtual environment:
python3 -m venv .venv
source .venv/bin/activate
Install dependencies:
.venv/bin/pip install -r requirements.txt
If this project is out of date or the req file install is acting up, here are the installs:
pip install --upgrade pip setuptools build wheel ipykernel ipywidgets jupyter pandas qdrant-client groq sentence-transformers
The groq key in a .env file will be required.
GROQ_API_KEY=<keeeeeyyyyy>
Machine Learning:
- MLOps Machine Learning Operations Specialization
- Open Source Platforms for MLOps
- Python Essentials for MLOps
Data Engineering: