[Model][LoRA]LoRA support added for Qwen #9622

jeejeelee · 2024-10-23T16:27:48Z

FILL IN THE PR DESCRIPTION HERE

Distinguish between Qwen LLM and VL to better support LoRA (similar treatment needed for ChatGLM as well). ~~Currently set as WIP, the main purpose is to discuss whether this solution(separate LLM and VL) is acceptable , if accepted, I will continue to complete it.~~
ping @ywang96 @DarkLight1337

Signed-off-by: Jee Jee Li <[email protected]>

github-actions · 2024-10-23T16:28:00Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can do one of these:

Add ready label to the PR
Enable auto-merge.

🚀

Signed-off-by: Jee Jee Li <[email protected]>

DarkLight1337

Looks good, sorry for making you wait!

Signed-off-by: Jee Jee Li <[email protected]> Signed-off-by: Randall Smith <[email protected]>

DarkLight1337 · 2024-10-31T05:21:46Z

Just realized that the Supported Models page hasn't been updated yet. @jeejeelee can you open a new PR to update that page with the new LoRA support? We should also explicitly inherit from SupportsLoRA in QWenLMHeadModel.

jeejeelee · 2024-10-31T05:24:16Z

Just realized that the Supported Models page hasn't been updated yet. @jeejeelee can you open a new PR to update that page with the new LoRA support? We should also explicitly inherit from SupportsLoRA in QWenLMHeadModel.

Okay, handling it now

jeejeelee · 2024-10-31T05:36:41Z

We should also explicitly inherit from SupportsLoRA in QWenLMHeadModel.

Why do we need to do it?

DarkLight1337 · 2024-10-31T05:37:36Z

We should also explicitly inherit from SupportsLoRA in QWenLMHeadModel.

Why do we need to do it?

Easier to find which models support LoRA.

jeejeelee · 2024-10-31T05:42:13Z

We should also explicitly inherit from SupportsLoRA in QWenLMHeadModel.

Why do we need to do it?

Easier to find which models support LoRA.

Get it!

Signed-off-by: Jee Jee Li <[email protected]> Signed-off-by: NickLucche <[email protected]>

Signed-off-by: Jee Jee Li <[email protected]>

Signed-off-by: Jee Jee Li <[email protected]> Signed-off-by: Linkun Chen <[email protected]>

Signed-off-by: Jee Jee Li <[email protected]> Signed-off-by: Loc Huynh <[email protected]>

Signed-off-by: Jee Jee Li <[email protected]> Signed-off-by: Sumit Dubey <[email protected]>

Signed-off-by: Jee Jee Li <[email protected]>

Signed-off-by: Jee Jee Li <[email protected]> Signed-off-by: Maxime Fournioux <[email protected]>

Signed-off-by: Jee Jee Li <[email protected]> Signed-off-by: Tyler Michael Smith <[email protected]>

Signed-off-by: Jee Jee Li <[email protected]>

Init

6493ee4

Signed-off-by: Jee Jee Li <[email protected]>

jeejeelee marked this pull request as draft October 23, 2024 16:27

Complete QWenVL support LoRA

6462961

Signed-off-by: Jee Jee Li <[email protected]>

jeejeelee force-pushed the qwen-support-lora branch from ccc2f34 to 6462961 Compare October 28, 2024 08:12

Merge branch 'vllm-project:main' into qwen-support-lora

9b126aa

jeejeelee marked this pull request as ready for review October 28, 2024 08:15

Delete redundant code

804a361

Signed-off-by: Jee Jee Li <[email protected]>

DarkLight1337 added the ready ONLY add when PR is ready to merge/full CI is needed label Oct 29, 2024

DarkLight1337 approved these changes Oct 29, 2024

View reviewed changes

DarkLight1337 enabled auto-merge (squash) October 29, 2024 02:33

DarkLight1337 merged commit 7a4df5f into vllm-project:main Oct 29, 2024
76 checks passed

rasmith pushed a commit to rasmith/vllm that referenced this pull request Oct 30, 2024

[Model][LoRA]LoRA support added for Qwen (vllm-project#9622)

91dfb3f

Signed-off-by: Jee Jee Li <[email protected]> Signed-off-by: Randall Smith <[email protected]>

jeejeelee deleted the qwen-support-lora branch October 31, 2024 05:44

NickLucche pushed a commit to NickLucche/vllm that referenced this pull request Oct 31, 2024

[Model][LoRA]LoRA support added for Qwen (vllm-project#9622)

a2827b0

Signed-off-by: Jee Jee Li <[email protected]> Signed-off-by: NickLucche <[email protected]>

NickLucche pushed a commit to NickLucche/vllm that referenced this pull request Oct 31, 2024

[Model][LoRA]LoRA support added for Qwen (vllm-project#9622)

bcede33

Signed-off-by: Jee Jee Li <[email protected]> Signed-off-by: NickLucche <[email protected]>

lk-chen pushed a commit to lk-chen/vllm that referenced this pull request Nov 4, 2024

[Model][LoRA]LoRA support added for Qwen (vllm-project#9622)

33fdcae

Signed-off-by: Jee Jee Li <[email protected]>

lk-chen pushed a commit to lk-chen/vllm that referenced this pull request Nov 4, 2024

[Model][LoRA]LoRA support added for Qwen (vllm-project#9622)

80937ef

Signed-off-by: Jee Jee Li <[email protected]> Signed-off-by: Linkun Chen <[email protected]>

JC1DA pushed a commit to JC1DA/vllm that referenced this pull request Nov 11, 2024

[Model][LoRA]LoRA support added for Qwen (vllm-project#9622)

35bbb1e

Signed-off-by: Jee Jee Li <[email protected]> Signed-off-by: Loc Huynh <[email protected]>

sumitd2 pushed a commit to sumitd2/vllm that referenced this pull request Nov 14, 2024

[Model][LoRA]LoRA support added for Qwen (vllm-project#9622)

8486a62

Signed-off-by: Jee Jee Li <[email protected]> Signed-off-by: Sumit Dubey <[email protected]>

KuntaiDu pushed a commit to KuntaiDu/vllm that referenced this pull request Nov 20, 2024

[Model][LoRA]LoRA support added for Qwen (vllm-project#9622)

7f4d6d9

Signed-off-by: Jee Jee Li <[email protected]>

mfournioux pushed a commit to mfournioux/vllm that referenced this pull request Nov 20, 2024

[Model][LoRA]LoRA support added for Qwen (vllm-project#9622)

21b58d5

Signed-off-by: Jee Jee Li <[email protected]> Signed-off-by: Maxime Fournioux <[email protected]>

tlrmchlsmth pushed a commit to neuralmagic/vllm that referenced this pull request Nov 23, 2024

[Model][LoRA]LoRA support added for Qwen (vllm-project#9622)

b1bae2a

Signed-off-by: Jee Jee Li <[email protected]> Signed-off-by: Tyler Michael Smith <[email protected]>

sleepwalker2017 pushed a commit to sleepwalker2017/vllm that referenced this pull request Dec 13, 2024

[Model][LoRA]LoRA support added for Qwen (vllm-project#9622)

8a46649

Signed-off-by: Jee Jee Li <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Model][LoRA]LoRA support added for Qwen #9622

[Model][LoRA]LoRA support added for Qwen #9622

jeejeelee commented Oct 23, 2024 •

edited by DarkLight1337

Loading

github-actions bot commented Oct 23, 2024

DarkLight1337 left a comment

DarkLight1337 commented Oct 31, 2024 •

edited

Loading

jeejeelee commented Oct 31, 2024

jeejeelee commented Oct 31, 2024

DarkLight1337 commented Oct 31, 2024

jeejeelee commented Oct 31, 2024

[Model][LoRA]LoRA support added for Qwen #9622

[Model][LoRA]LoRA support added for Qwen #9622

Conversation

jeejeelee commented Oct 23, 2024 • edited by DarkLight1337 Loading

github-actions bot commented Oct 23, 2024

DarkLight1337 left a comment

Choose a reason for hiding this comment

DarkLight1337 commented Oct 31, 2024 • edited Loading

jeejeelee commented Oct 31, 2024

jeejeelee commented Oct 31, 2024

DarkLight1337 commented Oct 31, 2024

jeejeelee commented Oct 31, 2024

jeejeelee commented Oct 23, 2024 •

edited by DarkLight1337

Loading

DarkLight1337 commented Oct 31, 2024 •

edited

Loading