-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support for DeepseekV2ForCausalLM #2340
Comments
@fengyang95, little update as promised: CI pass and code review ongoing, will add some benchmark data and analysis in README, MR is pretty huge so took little longer time, will update the status on Monday, thanks for support TRT-LLM ! |
nice work!!! |
little update on Friday: we are in code rebase and possible merge in main branch in next week... finger crossed.. thanks! |
@fengyang95 , thanks for attention on Tensorrt-LLM, deepseek-v2 is live in main branch and will be officially announced in v0.15, will close this bug now .. welcome to give any suggestion or issue when using Tensorrt-LLM.. |
@dominicshanshan Nice work!!! Looking forward to support for fp8 and cc8.9 (e.g., L40) since our GPU resources are relatively limited. |
message received, working on FP8 and sm89, sm89 support should available soon, will update the status once it passed our internal tests.. |
@dominicshanshan really appreciate your work on deepseek. is the fp8 support getting closer? |
yes, working on it, should be enable soon.. |
Question
Can we get DeepseekV2 supported?
Code to reproduce
Error
The text was updated successfully, but these errors were encountered: