✨[Feature] Is there a plan to support to convert quantized PT2 to trt ? #3471

MaltoseFlower · 2025-04-14T02:00:39Z

when i use torch-tensorrt 2.4.0 to convert a quantized PT2 to trt, i got this error blow.

I wonder whether it will support in ther future? Or, i am able to do this based on the current version (maybe through converting the torch.ops.quantized_decomposed.dequantize_per_tensor.default operator to other quantization operator? ).

The text was updated successfully, but these errors were encountered:

narendasan · 2025-04-29T17:38:14Z

@lanluo-nvidia Can you take a look at this post FP4?

lanluo-nvidia · 2025-05-20T21:46:16Z

@MaltoseFlower do you have any example code which I can look into this further?

lanluo-nvidia · 2025-05-26T02:03:39Z

meanwhile here is an example of the fp8/int8 PTQ for your reference:
https://github.com/pytorch/TensorRT/blob/main/examples/dynamo/vgg16_ptq.py

MaltoseFlower added the feature request New feature or request label Apr 14, 2025

MaltoseFlower assigned narendasan Apr 14, 2025

narendasan assigned lanluo-nvidia Apr 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

✨[Feature] Is there a plan to support to convert quantized PT2 to trt ? #3471

✨[Feature] Is there a plan to support to convert quantized PT2 to trt ? #3471

MaltoseFlower commented Apr 14, 2025

narendasan commented Apr 29, 2025

Uh oh!

lanluo-nvidia commented May 20, 2025

Uh oh!

lanluo-nvidia commented May 26, 2025

Uh oh!

✨[Feature] Is there a plan to support to convert quantized PT2 to trt ? #3471

✨[Feature] Is there a plan to support to convert quantized PT2 to trt ? #3471

Comments

MaltoseFlower commented Apr 14, 2025

narendasan commented Apr 29, 2025

Uh oh!

lanluo-nvidia commented May 20, 2025

Uh oh!

lanluo-nvidia commented May 26, 2025

Uh oh!