Update on the development branch #614
kaiyux
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi,
The TensorRT-LLM team is pleased to announce that we are pushing an update to the development branch (also includes the Triton backend) this December 8th, 2023.
This update includes:
trtllm-build
command(already applied to blip2 and OPT )ModelRunnerCpp
that wraps C++gptSession
StoppingCriteria
andLogitsProcessor
in Python generate API (thanks to the contribution from @zhang-ge-hao)the value update is not the same shape as the original. updated: (2560, 3840), original (5120, 3840)
#580Thanks,
The TensorRT-LLM Engineering Team
Beta Was this translation helpful? Give feedback.
All reactions