Skip to content

Latest commit

 

History

History
 
 

linear_random_pytorch

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 

Linear model in PyTorch

Overview

The example presents a simple Linear model implemented in PyTorch

Example consists of following scripts:

  • server.py - start the model with Triton Inference Server
  • client.py - execute HTTP/gRPC requests to the deployed model

Requirements

The example requires the torch package. It can be installed in your current environment using pip:

pip install torch

Or you can use NVIDIA PyTorch container:

docker run -it --gpus 1 --shm-size 8gb -v {repository_path}:{repository_path} -w {repository_path} nvcr.io/nvidia/pytorch:24.08-py3 bash

If you select to use container we recommend to install NVIDIA Container Toolkit.

Quick Start

The step-by-step guide:

  1. Install PyTriton following the installation instruction
  2. In current terminal start the model on Triton using server.py
./server.py
  1. Open new terminal tab (ex. Ctrl + T on Ubuntu) or window
  2. Go to the example directory
  3. Run the client.py to perform queries on model:
./client.py