Skip to content

Commit

Permalink
Update scaling_rag_for_production.md
Browse files Browse the repository at this point in the history
  • Loading branch information
ClaireSuperlinked authored Feb 1, 2024
1 parent 7a5844d commit 5d0e812
Showing 1 changed file with 0 additions and 2 deletions.
2 changes: 0 additions & 2 deletions docs/use_cases/scaling_rag_for_production.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,8 +2,6 @@

# Scaling RAG for Production

![](assets/use_cases/recommender_systems/cover.jpg) <Placeholder>

Retrieval-augmented Generation (RAG) combines Large Language Models (LLMs) with external data to reduce the probability of machine hallucinations - AI-generated information that misrepresents underlying data or reality. When developing RAG systems, scalability is often an afterthought. This creates problems when moving from initial development to production. Having to manually adjust code while your application grows can get very costly and is prone to errors.

Our tutorial provides an example of **how you can develop a RAG pipeline with production workloads in mind from the start**, using the right tools - ones that are designed to scale your application.
Expand Down

0 comments on commit 5d0e812

Please sign in to comment.