Government Document Chatbot using langchain #3

AleenDhar · 2024-04-14T03:19:49Z

Hi @amit-s19, I am a 2nd year BTech student and here is my solution to the problem :-

I am using Langchain and Gemini-pro to achieve the following results:

Result

The steps are as follows:

Using PyPDF2 to read the pdf document
Using langchain.text_splitter to split the text written in the pdf
Using GoogleGenerativeAIEmbeddings for creating text embeddings
I have used FAISS as a vectorstore but we can easily replace that with ChromaDB or Pinecone.
I have used a basic prompt template and question_answering chain to talk to the PDF data
Using Streamlit to create a simple user interface.

The reason why I am using gemini-pro is because it has multi-language support.
later we can replace it with our own model fine-tuned in hindi or any other language.

we can use gemini-visison-pro to read the image data inside the pdf can convert it into text, which can later be used for question answering

AleenDhar and others added 4 commits April 14, 2024 07:59

Q/A pdf in both English and Hindi

60fe6b4

Update README.md

885394f

Update .env

3220e71

Update README.md

1ed85ce

AleenDhar changed the title ~~[DMP 2024]: Government Document Chatbot: Streamlining Access and Assistance~~ Government Document Chatbot using langchain Apr 14, 2024

AleenDhar added 9 commits April 14, 2024 08:58

Delete QA_PDF for GOV data by Aleen Dhar/myenv directory

ca14386

Delete QA_PDF for GOV data by Aleen Dhar/faiss_index directory

6ad887c

Update README.md

34c8cd0

Update README.md

4001343

Update README.md

1bdcda8

Update README.md

b7707f0

Update README.md

683185f

Update README.md

78da45c

Update README.md

aaf3230

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Government Document Chatbot using langchain #3

Government Document Chatbot using langchain #3

AleenDhar commented Apr 14, 2024 •

edited

Loading

Government Document Chatbot using langchain #3

Are you sure you want to change the base?

Government Document Chatbot using langchain #3

Conversation

AleenDhar commented Apr 14, 2024 • edited Loading

Result

AleenDhar commented Apr 14, 2024 •

edited

Loading