Update on the development branch #1170
kaiyux
announced in
Announcements
Replies: 1 comment
-
It's very good and helps to customize for product tasks, thanks! Please add the ability to configure the sampling pipeline (adding custom BaseSamplingLayer) |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi,
The TensorRT-LLM team is pleased to announce that we are pushing an update to the development branch (and the Triton backend) this February 27, 2024.
This update includes:
examples/multimodal
early_stopping=False
in beam search for C++ Runtimeend_id
issue for Qwen qwen end_id setting is wrong so cannot stop at right postition! #987Thanks,
The TensorRT-LLM Engineering Team
Beta Was this translation helpful? Give feedback.
All reactions