rag_llama_demo

Retrieval Augmented Generation (RAG) for Llama models using e5 vector embedding model for private LLM usage. Origial blogpost on brandonharris.io.

Notebook, raw .py and requirements.txt for reference. Using llama_index and ehartford/Wizard-Vicuna-13B-Uncensored for interactive retrieval and e5 for embeddings.

The book used as an example in this code and hosted on this repo is public domain and was sourced from Project Gutenberg.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
README.md		README.md
Super-Science-December-1930.epub		Super-Science-December-1930.epub
rag_llama_demo.ipynb		rag_llama_demo.ipynb
rag_llama_demo.py		rag_llama_demo.py
requirements.txt		requirements.txt

Provide feedback