Troubleshoot mistral model #2632

krishnanpooja · 2024-12-26T12:35:02Z

GPU-A100
TensorRT-LLM: 0.15.0

easy debugging of new models

no debugging setup available

I need to debug the reason for reduction in accuracy of Biomistral model (https://github.com/huggingface/transformers/blob/5d7739f15a6e50de416977fe2cc9cb516d67edda/src/transformers/models/mistral/modeling_mistral.py#L1015) on TensorRT-Engine compared to vLLM.
Looking at the instructions in https://nvidia.github.io/TensorRT-LLM/reference/troubleshooting.html#debug-on-e2e-models, I was wondering if there is any model closely related to my model above that I can use to troubleshoot? (I hope some nearly similar model exists because Im able to draw inference using TensorRT-LLM, its just the accuracy seems to be low.
Can you please help me?

The text was updated successfully, but these errors were encountered:

zhangts20 · 2024-12-28T01:54:00Z

krishnanpooja added the bug Something isn't working label Dec 26, 2024

Provide feedback