TorchServe quick start example #3040

agunapal · 2024-03-23T23:16:08Z

Description

The goal for the PR is to enable TorchServe user to quickly deploy a model for serving using a single script without having to install anything manually.

This PR adds the following quick start example for TorchServe

./examples/getting_started/build_image.sh vit  # optional arg --torch.compile

docker run --rm -it --env TORCH_COMPILE=false --env MODEL_NAME=vit --platform linux/amd64 -p 127.0.0.1:8080:8080 -v /home/ubuntu/serve/model_store:/home/model-server/model-store pytorch/torchserve:demo

# In another terminal, run the following command for inference
curl http://127.0.0.1:8080/predictions/vit -T ./examples/image_classifier/kitten.jpg

For BERT models, set the following

export HUGGINGFACE_TOKEN=< Your token>

Supports the following models

resnet, densenet, vit, fasterrcnn, bertsc, berttc, bertqa, berttg

if Nvidia GPU is present, it automatically picks the GPU image and uses the GPU
It also supports torch.compile

Fixes #(issue)

Type of change

Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
New feature (non-breaking change which adds functionality)
This change requires a documentation update

Feature/Issue validation/testing

Please describe the Unit or Integration tests that you ran to verify your changes and relevant result summary. Provide instructions so it can be reproduced.
Please also list any relevant details for your test configuration.

Inference

curl http://127.0.0.1:8080/predictions/vit -T ./examples/image_classifier/kitten.jpg
{
  "tabby": 0.5469660758972168,
  "tiger_cat": 0.23448117077350616,
  "Egyptian_cat": 0.1045909970998764,
  "lynx": 0.0010765091283246875,
  "Persian_cat": 0.00040055206045508385
}

Checklist:

Did you have fun?
Have you added tests that prove your fix is effective or that this feature works?
Has code been commented, particularly in hard-to-understand areas?
Have you made corresponding changes to the documentation?

…torch/serve into examples/getting_started_curl

lxning · 2024-03-25T19:30:31Z

This example contains Dockerfile and the config+handler for BERT, Resnet, and Desnet. It's not clear for cx to follow up this example since TS has dir for Dockerfile and model handlers for these models.

We need have a clear guidance and implementation to support model card.

msaroufim

personally I like the direction this is going in. Recommending docker early on to users feels like a natural thing since torchserve will just work on any device

I also like making the compiler flag an env variable will make trying things out quite simple

What I don't like is that bert examples still have these unsinspiring labels for "accepted" vs "not accepted" which is confusing because it might make people think it's a sytem level error instead of the expected output of the mode. I would also want to see the code duplication issues fixed. There's also still a link to an existing getting started guide on the main page, let's get rid of that and the tutorial

Otherwise once this is in I do expect this to be default way people first use torchserve, presumably you're doing llama7b and 70b in a future PR?

msaroufim · 2024-03-26T05:46:56Z

examples/getting_started/index_mapping/berttg_index_to_name.json

@@ -0,0 +1,4 @@
+{


these labels have been confusing forever, can we do something else?

msaroufim · 2024-03-26T05:48:00Z

examples/getting_started/Transformer_handler_generalized.py

@@ -0,0 +1,538 @@
+import ast


is this copy pasted from the examples transformer handler? The duplication will hurt us would recommend having this in 1 place

msaroufim · 2024-03-26T05:49:13Z

examples/getting_started/Download_Transformer_models.py

@@ -0,0 +1,188 @@
+import argparse


same point on duplication here

msaroufim · 2024-03-26T05:49:46Z

examples/getting_started/Dockerfile

+COPY $EXAMPLE_DIR/config.properties /home/model-server/config.properties
+
+WORKDIR /home/model-server/getting_started
+RUN chmod +x /usr/local/bin/dockerd-entrypoint.sh \


agunapal and others added 4 commits March 23, 2024 23:15

TorchServe quickstart example

0dcb5ca

Merge branch 'master' into examples/getting_started_curl

7d303f5

TorchServe quickstart example

84e6d12

Merge branch 'examples/getting_started_curl' of https://github.com/py…

548874e

…torch/serve into examples/getting_started_curl

agunapal marked this pull request as ready for review March 23, 2024 23:37

agunapal requested a review from msaroufim March 23, 2024 23:37

msaroufim requested changes Mar 26, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TorchServe quick start example #3040

TorchServe quick start example #3040

agunapal commented Mar 23, 2024 •

edited

Loading

lxning commented Mar 25, 2024

msaroufim left a comment •

edited

Loading

msaroufim Mar 26, 2024

msaroufim Mar 26, 2024

msaroufim Mar 26, 2024

msaroufim Mar 26, 2024

TorchServe quick start example #3040

Are you sure you want to change the base?

TorchServe quick start example #3040

Conversation

agunapal commented Mar 23, 2024 • edited Loading

Description

Type of change

Feature/Issue validation/testing

Checklist:

lxning commented Mar 25, 2024

msaroufim left a comment • edited Loading

Choose a reason for hiding this comment

msaroufim Mar 26, 2024

Choose a reason for hiding this comment

msaroufim Mar 26, 2024

Choose a reason for hiding this comment

msaroufim Mar 26, 2024

Choose a reason for hiding this comment

msaroufim Mar 26, 2024

Choose a reason for hiding this comment

agunapal commented Mar 23, 2024 •

edited

Loading

msaroufim left a comment •

edited

Loading