-
Notifications
You must be signed in to change notification settings - Fork 617
Support Qwen3-VL #4093
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support Qwen3-VL #4093
Conversation
This reverts commit c979730.
|
Improved the config check part, tested with internvl / intern-s1 / qwen3vl / qwen3 / qwen2.5vl / glm4.1v, seems good. |
|
May share the evalution test results |
|
LLM evaluation test failed by following #4094 |
Can reproduce the which is the same as the one mentioned in Therefore, I would conclude that this is a bug related to scheduling, rather than the current qwen3vl codes. |
Tested with VLMEvalKit, dataset: OCRBench, temperature: 0.7, max_new_tokens: 16384
Official ACC refers to: |
grimoire
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
|
After merging main, benchmark serving by |
|
I have also deployed two additional models: Qwen/Qwen3-8B and OpenGVLab/InternVL3_5-8B. After benchmarking both services, I confirmed that they functioned properly. Therefore, I suspect that this PR may contain potential issues. |
@lvhan028 Tested with Qwen/Qwen3-8B, with the following benchmark settings, appears that the main branch code still triggers the
Therefore, I think does not really resolve the scheduling bug. Additionally, for pure text inputs, the qwen3vl text parts work as a class inherited from qwen3, unlikely to cause scheduling errors. I will insist on the view that this is not a bug caused by the current qwen3vl code. |
cc @grimoire |


Related
TODO