Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
vadimkantorov authored Dec 27, 2023
1 parent f3dd37c commit beebd99
Showing 1 changed file with 2 additions and 0 deletions.
2 changes: 2 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,6 +2,8 @@
> Triton Inference Server's "Generate" extension might be a better choice for string processing endpoints:
> - https://github.com/triton-inference-server/server/blob/main/docs/protocol/extension_generate.md
> - https://github.com/triton-inference-server/server/pull/6412
> - https://blog.marvik.ai/2023/10/16/deploying-llama2-with-nvidia-triton-inference-server/
> - https://docs.nvidia.com/deeplearning/triton-inference-server/user-guide/docs/protocol/extension_generate.html
## Primer of a string processing pipeline on Triton Inference Server on a CPU-only Docker-less system

Expand Down

0 comments on commit beebd99

Please sign in to comment.