Skip to content

In this project, we leverage Weaviate, a vector database, to power our retrieval-augmented generation (RAG) application. Weaviate enables efficient vector similarity search, which is crucial for building effective RAG systems. Additionally, we use local language model (LLM) and embedding models.

Notifications You must be signed in to change notification settings

AyaFergany/RAG-App-Using-a-LLM

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 

Repository files navigation

RAG-App-Using-a-LLM

In this project, we leverage Weaviate, a vector database, to power our retrieval-augmented generation (RAG) application. Weaviate enables efficient vector similarity search, which is crucial for building effective RAG systems. Additionally, we use local language model (LLM) and embedding models to ensure privacy and confidentiality of sensitive information.

Here are the key components of our project:

Weaviate Integration

  • Weaviate serves as our vector database. It allows us to ingest and retrieve documents efficiently.
  • To connect to Weaviate, you’ll need the Weaviate API key and endpoint. You can find these details in the Weaviate dashboard or configure them programmatically.

Local Embedding and LLM Models

  • I’ve chosen the sentence_transformers/all-mpnet-base-v2 embedding model and the zephyr-7b-alpha LLM from Hugging Face.
  • These models are open source and well-suited for our use case.
  • Intended uses for sentence_transformers/all-mpnet-base-v2
    • this model is intented to be used as a sentence and short paragraph encoder. Given an input text, it ouptuts a vector which captures the semantic information. The sentence vector may be used for information retrieval, clustering or sentence similarity tasks.

By default, input text longer than 384 word pieces is truncated.

Ingesting HubermanLabs Podcasts into Weaviate

Creating a Chatbot with Gradio

  • Gradio is an excellent library for building interactive interfaces for LLMs, including chatbots.
  • I’ve created a chatbot using Gradio, which allows users to interact with the model through a user-friendly interface.
  • The chatbot responds to user input and displays the conversation history.

Alt Text

About

In this project, we leverage Weaviate, a vector database, to power our retrieval-augmented generation (RAG) application. Weaviate enables efficient vector similarity search, which is crucial for building effective RAG systems. Additionally, we use local language model (LLM) and embedding models.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published