Chat With Docs (CWD): Local Document Question-Answering System

CWD is a powerful tool that allows you to ask questions about your documents without an internet connection, leveraging the capabilities of Large Language Models (LLMs).

Setup

Clone this repository.
Install the required dependencies:
```
pip install -r requirements.txt
```
Download the LLM model (default: ggml-gpt4all-j-v1.3-groovy.bin) and place it in a directory of your choice.
Update the .env file in the root directory with appropriate values for your setup.
- PERSIST_DIRECTORY is directory of stored embeddings.
- SOURCE_DIRECTORY is directory of documents.
- You can change the embedding model by modifying the EMBEDDINGS_MODEL_NAME in the .env file.
- To use a different LLM, update the MODEL_TYPE and MODEL_PATH in the .env file.
- MODEL_N_CTX is the model's context length to use for chunking purposes.

Usage

Place your documents in the source_documents directory.
Run the document ingestion script:
```
python load_docs.py
```
This script processes the documents, creates embeddings, and stores them in a local vector database.
Start the question-answering system:
```
python docGPT.py
```
Enter your questions when prompted. Type 'exit' to quit the program.

How it works

load_docs.py processes documents from the source_documents directory, splits them into chunks, creates embeddings using the specified model, and stores them in a Chroma vector database.
docGPT.py sets up the question-answering system using the LangChain library. It loads the stored embeddings and the specified LLM model.
When you ask a question, the system retrieves relevant document chunks from the vector database and uses the LLM to generate an answer based on the retrieved information.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
db_vector		db_vector
doc_gpt		doc_gpt
.gitignore		.gitignore
README.md		README.md
constants.py		constants.py
docGPT.py		docGPT.py
load.ipynb		load.ipynb
load_docs.py		load_docs.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Chat With Docs (CWD): Local Document Question-Answering System

Setup

Usage

How it works

About

Uh oh!

Releases

Packages

Uh oh!

Languages

ashishanand7/chat-with-docs

Folders and files

Latest commit

History

Repository files navigation

Chat With Docs (CWD): Local Document Question-Answering System

Setup

Usage

How it works

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages