Name	Name	Last commit message	Last commit date
parent directory ..
img	img
README.md	README.md

RAG and Grounding

This directory provides a curated list of notebooks that explore Retrieval Augmented Generation (RAG), grounding techniques, knowledge bases, grounded generation, and related topics like vector search and semantic search.

All of these links are notebooks or other examples in this repository, but are indexed here for your convenience.

What is RAG and Grounding?

Ungrounded generation relies on the LLM training data alone and is prone to hallucinations when it doesn't have all the right facts
Grounding a LLM with relevant facts provides fresh and potentially private data to the model as part of it's input or prompt
RAG is a technique which retrieves relevant facts, often via search, and provides them to the LLM

Using RAG and Grounding to improve generations and reduce hallucinations is becoming commonplace. Doing so well and generating extremely high quality results which are entirely grounded on the most relevant facts, potentially from a very large corpus of information and at high scale - is an art. Vertex AI provides a platform of tools and APIs which help you build and maintain a great search engine and RAG application, and the evaluations needed to hill climb "quality".

Measuring RAG/Grounding Quality

See this blog post: How to evaluate generated answers from RAG at scale on Vertex AI for a walkthrough.

evaluate_rag_gen_ai_evaluation_service_sdk.ipynb: Evaluates RAG systems using the Gen AI Evaluation Service SDK.
ragas_with_gemini.ipynb: Use Case - using Ragas with Gemini for Eval.
deepeval_with_gemini.ipynb: Use Case - using DeepEval with Gemini for Eval.

Out of the Box RAG/Grounding

Vertex AI Search - sample Web App: Take a look at this sample web app using Vertex AI Search, which is a flexible and easy to use "out of the box" solution for search & RAG/Grounding.
bulk_question_answering.ipynb: Answers multiple questions using a search system
contract_analysis.ipynb, question_answering.ipynb, rag_google_documentation.ipynb: Showcase specific RAG use cases
search_data_blending_with_gemini_summarization.ipynb: Demonstrates calling a search app that blends information from multiple stores (GCS, BQ, site) and summarizes search snippets and responses using the Gemini Pro model.
vertexai_search_options.ipynb: Shows how to use Vertex AI Search in conjunction with the Gemini Pro model to retrieve and summarize data across multiple data stores within Google Cloud Platform (GCP). It highlights how the Gemini Pro model is able to formulate a summary of user-specific prompts based on the retrieved snippets and citations from Vertex AI Search.

Build your own RAG/Grounding

We have several notebooks and examples for specific use cases or types of data which may require a custom RAG and Grounding. We have many products which can be used to build a RAG/Grounding pipeline of your own, or which you can add to an existing RAG and Grounding solution.

Vertex AI APIs for building search and RAG has a list of several APIs you can use in isolation or in combination
LlamaIndex on Vertex allows you to assemble a RAG search using popular OSS framework and components from Google or Open Source
This end-to-end DIY RAG example in a notebook written in LangChain and using some of these APIs
The Google Cloud Architecture Center has reference architectures on building a RAG infrastructure with GKE or using alloydb and a few Vertex services

Search

Vertex AI Search is an end-to-end Search engine which delivers high quality grounded generation and RAG at scale, built-in.

Vertex AI Vector Search is a extremely performant Vector Database which powers Vertex AI Search. Other databases like AlloyDB and BigQuery also have vector searches, each with different performance characteristics and retrieval performance.

Embeddings

intro_Vertex_AI_embeddings.ipynb: Introduces Vertex AI embeddings.
hybrid-search.ipynb: Explores combining different search techniques, potentially including vector search and keyword-based search.
intro-textemb-vectorsearch.ipynb: Introduces text embeddings and vector search.
vector-search-quickstart.ipynb: Quick start guide for implementing vector search.
bq-vector-search-log-outlier-detection.ipynb: Demonstrates using vector search with BigQuery logs to identify outliers.

Gemini

intro-grounding-gemini.ipynb: Introduces grounding in the context of Gemini.
building_DIY_multimodal_qa_system_with_mRAG.ipynb: Builds a custom multimodal question-answering system using mRAG.
code_retrieval_augmented_generation.ipynb: Demonstrates using code retrieval to improve code generation.
intro-grounding.ipynb: Introduction to grounding in natural language processing
langchain_bigquery_data_loader.ipynb: Uses LangChain to load data from BigQuery for RAG
question_answering_documents.ipynb, question_answering_documents_langchain.ipynb, question_answering_documents_langchain_matching_engine.ipynb: Focus on question answering over documents
summarization_large_documents.ipynb, summarization_large_documents_langchain.ipynb: Demonstrate summarizing large documents.

Open Models

cloud_run_ollama_gemma2_rag_qa.ipynb: Sets up a RAG-based question-answering system using Ollama and Gemma2 on Cloud Run

Agents on top of RAG

tutorial_vertex_ai_search_rag_agent.ipynb: Tutorial for building RAG agents using Vertex AI Search
tutorial_alloydb_rag_agent.ipynb: Tutorial for building RAG agents using AlloyDB
tutorial_cloud_sql_pg_rag_agent.ipynb: Tutorial for building RAG agents using Cloud SQL (PostgreSQL)

Use Cases

These notebooks offer a valuable resource to understand and implement RAG and grounding techniques in various applications. Feel free to dive into the notebooks that pique your interest and start building your own RAG-powered solutions.

Examples of RAG in different domains
- Document_QnA_using_gemini_and_vector_search.ipynb
- NLP2SQL_using_dynamic_RAG.ipynb
- RAG_Based_on_Sensitive_Data_Protection_using_Faker.ipynb
- code_rag.ipynb
- intra_knowledge_qna.ipynb
- intro_multimodal_rag.ipynb
- llamaindex_rag.ipynb
- multimodal_rag_langchain.ipynb
- small_to_big_rag.ipynb
Build RAG systems using BigQuery
- rag_qna_with_bq_and_featurestore.ipynb
- rag_vector_embedding_in_bigquery.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rag-grounding

rag-grounding

README.md

RAG and Grounding

What is RAG and Grounding?

Measuring RAG/Grounding Quality

Out of the Box RAG/Grounding

Build your own RAG/Grounding

Search

Embeddings

Gemini

Open Models

Agents on top of RAG

Use Cases

Files

rag-grounding

Directory actions

More options

Directory actions

More options

Latest commit

History

rag-grounding

Folders and files

parent directory

README.md

RAG and Grounding

What is RAG and Grounding?

Measuring RAG/Grounding Quality

Out of the Box RAG/Grounding

Build your own RAG/Grounding

Search

Embeddings

Gemini

Open Models

Agents on top of RAG

Use Cases