Skip to content

Commit

Permalink
doc update of the blog (#3280)
Browse files Browse the repository at this point in the history
  • Loading branch information
agunapal committed Aug 5, 2024
1 parent 3f40180 commit 04f1e6a
Show file tree
Hide file tree
Showing 2 changed files with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion examples/usecases/RAG_based_LLM_serving/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -359,7 +359,7 @@ The system architecture for the end-to-end solution using RAG based LLM serving
![RAG + LLM Deployment](https://raw.githubusercontent.com/pytorch/serve/master/examples/usecases/RAG_based_LLM_serving/assets/rag_llm.png "RAG + LLM Deployment")


The steps for full deployment are mentioned in [Deploy.md](https://github.com/pytorch/serve/blob/master/examples/usecases/RAG_based_LLM_serving/Deploy.md)
The steps for full deployment are mentioned in [Deployment Guide](https://github.com/pytorch/serve/blob/master/examples/usecases/RAG_based_LLM_serving/Deploy.md#Deploy-Llama-&-RAG-using-TorchServe)

The code snippet which can chain the RAG endpoint with Llama endpoint is shown below

Expand Down
Binary file modified examples/usecases/RAG_based_LLM_serving/assets/rag_perf.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.

0 comments on commit 04f1e6a

Please sign in to comment.