Bakbak

A flask based microservice that allows creating generative responses over a set of FAQs for an input. It doesnt maintain the session and uses a 1:1 QA response mechanism for now.

Dimensions

Key dimensions in building such a system using LLMs:

Loading the data for grounding: Data gets stored in vector stores as embeddings allowing for similarity search(cosine similarity for now)
Generating prompt: Using the ChatML from OpenAI to generate the prompt which relies on the retrieved data alone and doesnt hallucinate.
Using LLMs for forming sentences: Sending the prompt to generate summarized output

Things to explore

While the happy path works, the side effects for the system are huge:

MLOps aspect: Daily rebuilding of data index can degrade the search quality by quiet a bit.Thing can impact response from LLMs and need to be tackled with some metric. Need to explore what that metric looks like.
Scale: Scale of the data set consumed for vector search as well as number of tokens on the side of LLM dispatch need to be considered.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.github/workflows		.github/workflows
.vscode		.vscode
app		app
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bakbak

Dimensions

Things to explore

About

Releases

Packages

Contributors 2

Languages

tallandroid/Bakbak

Folders and files

Latest commit

History

Repository files navigation

Bakbak

Dimensions

Things to explore

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages