Name		Name	Last commit message	Last commit date
parent directory ..
models		models
notebook_images		notebook_images
pytorch_to_onnx		pytorch_to_onnx
Dockerfile		Dockerfile
README.md		README.md
build.sh		build.sh
build_python_stub.sh		build_python_stub.sh
entrypoint.sh		entrypoint.sh
example_client.ipynb		example_client.ipynb
start_server.sh		start_server.sh

README.md

Triton + Rapids Example

Triton

Triton Inference Server simplifies the deployment of AI models at scale in production. It lets teams deploy trained AI models from any framework (TensorFlow, NVIDIA® TensorRT, PyTorch, ONNX Runtime, or custom) from local storage or cloud platform on any GPU- or CPU-based infrastructure (cloud, data center, or edge) and deploy them on the cloud.

Check out the Triton documentation at link

Using Rapids and Triton together

We use Triton's python backend, which allows you to serve Python "models" that can execute arbitrary python (and thus RAPIDS) code.

Here we showcase a simple example of using RAPIDS+Pytorch with Triton.

Build

build.sh creates a Triton+RAPIDS docker container which you can use to deploy your rapids code with Triton.

bash build.sh

Model

Tokenization of strings into numerical vectors using cuDF's subwordTokenizer.
- Tokenization model code is present in models/rapids_tokenizer/1/model.py
- Tokenization model configuration is defined in models/rapids_tokenizer/config.pbtxt
Sentiment Prediction using Pytorch model
- Sentiment model code is present in models/sentiment_analysis_model/1/model.py
- Sentiment model configuration is defined in models/sentiment_analysis_model/config.pbtxt
Ensemble Model Configuration is present in models/end_to_end_model/config.pbtxt

Serving

Triton inference server is started using start_server.sh.

bash start_server.sh

Client Code

The client logic to interact with the served Triton model is present in example_client.ipynb.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rapids_triton_example

rapids_triton_example

README.md

Triton + Rapids Example

Triton

Using Rapids and Triton together

Build

Model

Serving

Client Code

Files

rapids_triton_example

Directory actions

More options

Directory actions

More options

Latest commit

History

rapids_triton_example

Folders and files

parent directory

README.md

Triton + Rapids Example

Triton

Using Rapids and Triton together

Build

Model

Serving

Client Code