RAG Application with Phi3 Model and ChromaDB

Welcome to the RAG (Retrieval-Augmented Generation) application repository! This project leverages the Phi3 model and ChromaDB to read PDF documents, embed their content, store the embeddings in a database, and perform retrieval-augmented generation.

Introduction

This repository contains a RAG application that reads PDF files, generates embeddings using the Alibaba-NLP/gte-large-en-v1.5 model, stores these embeddings in ChromaDB, and performs retrieval-augmented generation to provide contextual answers based on the embedded content. The system is designed to enhance the capability of answering queries by leveraging the context from the embedded documents.

Features

PDF Reading: Extracts text content from PDF documents.
Embedding Generation: Utilizes the Alibaba-NLP/gte-large-en-v1.5 model to generate embeddings for the extracted text.
Database Storage: Stores the generated embeddings in ChromaDB.
Retrieval-Augmented Generation: Retrieves relevant embeddings from the database and generates contextually accurate responses.

Installation

Note: On first installation, this script will download the necessary NLTK stopwords, the NLP embedding model, and the large language model (LLM). As a result, the initial execution may take longer than subsequent runs.

To get started with the RAG application, follow these steps:

Download Ollama on to your desktop:

This is required to run LLM model locally. Download Ollama

Clone the repository:

git clone https://github.com/sankethsj/phi3-rag-application.git
cd phi3-rag-application

Create a virtual environment and activate it:

python -m venv venv
source venv/bin/activate  # On Windows, use `venv\Scripts\activate`

Install the required dependencies:
```
pip install -r requirements.txt
```

Usage

Run the script

python main.py

(Optional) Run the notebook

This is more interactive, you can see what's going in each step.

Open RAG-Workbook.ipynb and run all cells.

Contributing

We welcome contributions to enhance the capabilities of this RAG application. To contribute, please follow these steps:

Fork the repository.
Create a new branch for your feature or bugfix:
```
git checkout -b feature-name
```
Make your changes and commit them with descriptive messages.
Push your changes to your fork:
```
git push origin feature-name
```
Create a pull request to the main repository.

License

This project is licensed under the MIT License. See the LICENSE file for more details.

Thank you for using the RAG application with the Phi3 model and ChromaDB. If you encounter any issues or have any questions, please feel free to open an issue on this repository.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
docs		docs
rag		rag
.gitignore		.gitignore
LICENSE		LICENSE
RAG-Workbook.ipynb		RAG-Workbook.ipynb
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG Application with Phi3 Model and ChromaDB

Table of Contents

Introduction

Features

Installation

Usage

Run the script

(Optional) Run the notebook

Contributing

License

About

Releases

Packages

Languages

License

sankethsj/phi3-rag-application

Folders and files

Latest commit

History

Repository files navigation

RAG Application with Phi3 Model and ChromaDB

Table of Contents

Introduction

Features

Installation

Usage

Run the script

(Optional) Run the notebook

Contributing

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages