Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

UNAVAILABLE: Unsupported: Triton TRITONBACKEND API version: X does not support 'onnxruntime' TRITONBACKEND API version X #259

Open
pultarmi opened this issue Jun 28, 2024 · 0 comments

Comments

@pultarmi
Copy link

pultarmi commented Jun 28, 2024

I followed the README instructions on compilation and at the end I faced the

UNAVAILABLE: Unsupported: Triton TRITONBACKEND API version: 1.16 does not support 'onnxruntime' TRITONBACKEND API version: 1.19

error when trying to run Triton Inference Server with the backend swapped. I think you forgot to mention in the README that it is absolutely necessary to specify correct -DTRITON_CORE_REPO_TAG, otherwise it fails on API version check in TRITONBACKEND_Initialize in onnxruntime.cc.


For example, you say to run

cmake -DCMAKE_INSTALL_PREFIX:PATH=`pwd`/install -DTRITON_BUILD_ONNXRUNTIME_VERSION=1.14.1 -DTRITON_BUILD_CONTAINER_VERSION=23.04 ..

"to build the ONNX Runtime backend for Triton 23.04" which is simply wrong and the correct command would be

cmake -DCMAKE_INSTALL_PREFIX:PATH=`pwd`/install -DTRITON_BUILD_ONNXRUNTIME_VERSION=1.14.1 -DTRITON_BUILD_CONTAINER_VERSION=23.04 -DTRITON_CORE_REPO_TAG=r23.04 ..

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant