LLM endpoint chat

Load a PDF file and ask questions via llama_index, LangChain and a LLM endpoint hosted on OctoAI

Instructions

pip install -r requirements.txt -U

To run our example app, there are four simple steps to take:

Clone the Llama-2-7b demo template to your OctoAI account by visiting https://octoai.cloud/models/llama-2-7b-chat-demo then clicking "Deploy Endpoint."
- If you want to use a different LLM model you can select another demo template. You can also containerize the model and make a custom OctoAI endpoint yourself, by following Build a Container from Python and Create a Custom Endpoint from a Container
Paste your Endpoint URL in a file called .env in the root directory of the project.

ENDPOINT_URL=<your Endpoint URL here>

Get an API Token from your OctoAI account page.
Paste your API key in a file called .env in the root directory of the project.

OCTOAI_API_TOKEN=<your key here>

python3 chat_main.py

or

Select a file from the menu or replace the default file file.pdf with the PDF you want to use.
Run pdf_qa_main.py script to ask questions about your pdf file via llama_index, LangChain and the hosted endpoint.

python3 pdf_qa_main.py

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
files		files
pdf-qa		pdf-qa
web-retreiver		web-retreiver
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md