[Model]Refactor MiniCPMV #7020

jeejeelee · 2024-08-01T07:05:21Z

I have completed the following modification:

Separate different versions of MiniCPMV to facilitate support for future features like LoRA and BNB.
Port Idefics2VisionTransformer

github-actions · 2024-08-01T07:05:32Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which consists a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of default ones by unblocking the steps in your fast-check build on Buildkite UI.

Once the PR is approved and ready to go, please make sure to run full CI as it is required to merge (or just use auto-merge).

To run full CI, you can do one of these:

Comment /ready on the PR
Add ready label to the PR
Enable auto-merge.

🚀

ywang96 · 2024-08-01T07:09:11Z

Hey @jeejeelee Thanks for the contribution! Were you able to verify if the model works on TP=1,2,4,8? (I will verify this myself later, but I was curious if this PR is ready for testing)

Also I'm curious if you have seen any significant speedup by sharding the ViT

jeejeelee · 2024-08-01T07:15:12Z

Hey @jeejeelee Thanks for the contribution! Were you able to verify if the model works on TP=1,2,4,8? (I will verify this myself later, but I was curious if this PR is ready for testing)

Also I'm curious if you have seen any significant speedup by sharding the ViT

Just verified on TP=1,2,4 . Currently, I do not have available 8-gpu resources and have not yet verified TP=8

DarkLight1337 · 2024-08-01T07:15:48Z

This will cause significant merge conflicts with #6995. Would it be better for you to incorporate my changes into your PR instead of releasing them separately?

ywang96 · 2024-08-01T07:17:27Z

This will cause significant merge conflicts with #6995. Would it be better for you to incorporate my changes into your PR instead of releasing them separately?

I agree with this too.

jeejeelee · 2024-08-01T07:18:55Z

This will cause significant merge conflicts with #6995. Would it be better for you to incorporate my changes into your PR instead of releasing them separately?

Ok, I will incorporate your changes ASAP

jeejeelee · 2024-08-01T07:28:43Z

Also I'm curious if you have seen any significant speedup by sharding the ViT

@ywang96 I have not yet tested the speedup effect of TP, but I will provide test results once available.

jeejeelee · 2024-08-01T09:30:50Z

@DarkLight1337 It seems #6995 has a bug. After incorporating your changes, the generated results have become poor.

DarkLight1337 · 2024-08-01T09:32:54Z

Try reverting the lines where get_2d_sincos_pos_embed is called.

jeejeelee · 2024-08-01T10:57:16Z

Try reverting the lines where get_2d_sincos_pos_embed is called.

I have figured it out, this snippet has a bug, and have fixed it

DarkLight1337 · 2024-08-01T11:01:45Z

Try reverting the lines where get_2d_sincos_pos_embed is called.

I have figured it out, this snippet has a bug, and have fixed it

My bad, I forgot to rename the intermediate variables. Thanks for fixing this!

To avoid confusion, I have closed the other PR.

jeejeelee · 2024-08-01T11:07:58Z

Try reverting the lines where get_2d_sincos_pos_embed is called.

I have figured it out, this snippet has a bug, and have fixed it

My bad, I forgot to rename the intermediate variables. Thanks for fixing this!

To avoid confusion, I have closed the other PR.

Thanks, could you help review my implementation? I want to complete this PR asap. My final goal is actually to make minicpmv2.5 support LoRA

HwwwwwwwH · 2024-08-01T11:13:55Z

Sry for late. Really appreciate your contribution! I'll check these modifications.

HwwwwwwwH · 2024-08-01T11:26:20Z

I think it's truly great to have MiniCPMV separated. I'm pulling the code of this PR and running some evaluations.

vllm/model_executor/models/minicpmv.py

jeejeelee · 2024-08-03T17:22:57Z

@DarkLight1337 @HwwwwwwwH It's getting late here, so I'll log off for now. Thank you for all your hard work

DarkLight1337 · 2024-08-03T17:26:41Z

@DarkLight1337 @HwwwwwwwH It's getting late here, so I'll log off for now. Thank you for all your hard work

Sure, sorry for interfering with your own testing...

DarkLight1337 · 2024-08-03T17:28:15Z

@DarkLight1337 After updating the latest changes, I'm still encountering errors

[rank0]:   File "/mypath/vllm/vllm/model_executor/models/minicpmv.py", line 588, in _parse_and_validate_inputs
[rank0]:     raise ValueError(f"Inconsistent flattened lengths, found: {lens}")
[rank0]: ValueError: Inconsistent flattened lengths, found: [0, 16, 16]

After some offline discussion with @HwwwwwwwH , apparently the dummy data doesn't contain image tokens while providing the image. I have updated the validation to allow this for now, we will revisit the dummy data generation in a later PR.

HwwwwwwwH · 2024-08-03T17:30:07Z

te here, so I'll log off for now. Thank you for all your hard work

Thank you too. Good night!

HwwwwwwwH · 2024-08-04T04:26:59Z

@jeejeelee Hi there. Could you run the model correctly now? I got some problems.

jeejeelee · 2024-08-04T04:36:12Z

@jeejeelee Hi there. Could you run the model correctly now? I got some problems.

Me too.

vllm/model_executor/models/minicpmv.py

HwwwwwwwH · 2024-08-04T07:18:53Z

@DarkLight1337 Thank you for your hard work. It works fine for me now.

ywang96 · 2024-08-06T17:23:50Z

Hello @jeejeelee! Just a follow-up question: are you interested in implementing Idefics3 eventually? (It's not available on transformers yet since PR for this model is still WIP)

jeejeelee · 2024-08-07T01:18:50Z

Hello @jeejeelee! Just a follow-up question: are you interested in implementing Idefics3 eventually? (It's not available on transformers yet since PR for this model is still WIP)

I'd be happy to implement this, but I might not be able to start working on it until next week.

ywang96 · 2024-08-07T06:27:40Z

Hello @jeejeelee! Just a follow-up question: are you interested in implementing Idefics3 eventually? (It's not available on transformers yet since PR for this model is still WIP)

I'd be happy to implement this, but I might not be able to start working on it until next week.

@jeejeelee No rush at all and thank you for the interest. Most likely we'll have to wait for the transformers PR to get into a better shape so we can verify model correctness anyways.

Co-authored-by: Cyrus Leung <[email protected]>

Co-authored-by: Cyrus Leung <[email protected]> Signed-off-by: Alvant <[email protected]>

Co-authored-by: Cyrus Leung <[email protected]>

jeejeelee added 4 commits July 31, 2024 13:10

init

4adafc2

fix interface bug

351a878

fix tp bug

005266d

done

9f2d168

cleanup code

6e12593

DarkLight1337 mentioned this pull request Aug 1, 2024

[Model] Further cleanup MiniCPM-V #6995

Closed

DarkLight1337 self-assigned this Aug 1, 2024

DarkLight1337 mentioned this pull request Aug 1, 2024

Adding idefics2 #4937

Open

Merge branch 'vllm-project:main' into refactor-minicpmv

db26cb6

DarkLight1337 reviewed Aug 2, 2024

View reviewed changes

vllm/model_executor/models/minicpmv.py Outdated Show resolved Hide resolved

HwwwwwwwH reviewed Aug 2, 2024

View reviewed changes

vllm/model_executor/models/minicpmv.py Outdated Show resolved Hide resolved

vllm/model_executor/models/minicpmv.py Show resolved Hide resolved

vllm/model_executor/models/minicpmv.py Outdated Show resolved Hide resolved

jeejeelee added 3 commits August 2, 2024 15:16

Merge branch 'vllm-project:main' into refactor-minicpmv

6ba5f19

refactor resampler

1bc5ae2

delete unused args

0ffa431

jeejeelee requested review from DarkLight1337 and HwwwwwwwH August 2, 2024 09:03

Relax validation

14e75a4

DarkLight1337 added 2 commits August 3, 2024 17:36

Fix wrong embedding for empty image

3fe1a33

Bugfix

ca67df6

HwwwwwwwH reviewed Aug 4, 2024

View reviewed changes

vllm/model_executor/models/minicpmv.py Outdated Show resolved Hide resolved

vllm/model_executor/models/minicpmv.py Show resolved Hide resolved

DarkLight1337 added 4 commits August 4, 2024 05:27

Fix dtype

c529fbe

Simplify

1d139b0

Merge branch 'main' into refactor-minicpmv

aed074a

Avoid unnecessary computation

d472dc2

DarkLight1337 enabled auto-merge (squash) August 4, 2024 07:24

DarkLight1337 approved these changes Aug 4, 2024

View reviewed changes

DarkLight1337 merged commit 179a6a3 into vllm-project:main Aug 4, 2024
66 of 67 checks passed

jeejeelee deleted the refactor-minicpmv branch August 4, 2024 23:35

ywang96 mentioned this pull request Aug 5, 2024

[Model] SiglipVisionModel ported from transformers #6942

Merged

dtrifiro mentioned this pull request Aug 5, 2024

Sync with [email protected] opendatahub-io/vllm#120

Closed

ywang96 mentioned this pull request Aug 5, 2024

[RFC]: Multi-modality Support on vLLM #4194

Open

51 tasks

sfc-gh-mkeralapura pushed a commit to sfc-gh-mkeralapura/vllm that referenced this pull request Aug 12, 2024

[Model]Refactor MiniCPMV (vllm-project#7020)

7c5bc0a

Co-authored-by: Cyrus Leung <[email protected]>

kylesayrs pushed a commit to neuralmagic/vllm that referenced this pull request Aug 17, 2024

[Model]Refactor MiniCPMV (vllm-project#7020)

7cb1c78

Co-authored-by: Cyrus Leung <[email protected]>

Alvant pushed a commit to compressa-ai/vllm that referenced this pull request Oct 26, 2024

[Model]Refactor MiniCPMV (vllm-project#7020)

23f3a63

Co-authored-by: Cyrus Leung <[email protected]> Signed-off-by: Alvant <[email protected]>

jeejeelee mentioned this pull request Oct 28, 2024

[Model] Add Idefics3 support #9767

Merged

2 tasks

KuntaiDu pushed a commit to KuntaiDu/vllm that referenced this pull request Nov 20, 2024

[Model]Refactor MiniCPMV (vllm-project#7020)

1221e13

Co-authored-by: Cyrus Leung <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Model]Refactor MiniCPMV #7020

[Model]Refactor MiniCPMV #7020

jeejeelee commented Aug 1, 2024

github-actions bot commented Aug 1, 2024

ywang96 commented Aug 1, 2024 •

edited

Loading

jeejeelee commented Aug 1, 2024 •

edited

Loading

DarkLight1337 commented Aug 1, 2024 •

edited

Loading

ywang96 commented Aug 1, 2024

jeejeelee commented Aug 1, 2024

jeejeelee commented Aug 1, 2024

jeejeelee commented Aug 1, 2024

DarkLight1337 commented Aug 1, 2024

jeejeelee commented Aug 1, 2024

DarkLight1337 commented Aug 1, 2024 •

edited

Loading

jeejeelee commented Aug 1, 2024

HwwwwwwwH commented Aug 1, 2024

HwwwwwwwH commented Aug 1, 2024

jeejeelee commented Aug 3, 2024

DarkLight1337 commented Aug 3, 2024

DarkLight1337 commented Aug 3, 2024

HwwwwwwwH commented Aug 3, 2024

HwwwwwwwH commented Aug 4, 2024

jeejeelee commented Aug 4, 2024

HwwwwwwwH commented Aug 4, 2024

ywang96 commented Aug 6, 2024

jeejeelee commented Aug 7, 2024

ywang96 commented Aug 7, 2024 •

edited

Loading

[Model]Refactor MiniCPMV #7020

[Model]Refactor MiniCPMV #7020

Conversation

jeejeelee commented Aug 1, 2024

github-actions bot commented Aug 1, 2024

ywang96 commented Aug 1, 2024 • edited Loading

jeejeelee commented Aug 1, 2024 • edited Loading

DarkLight1337 commented Aug 1, 2024 • edited Loading

ywang96 commented Aug 1, 2024

jeejeelee commented Aug 1, 2024

jeejeelee commented Aug 1, 2024

jeejeelee commented Aug 1, 2024

DarkLight1337 commented Aug 1, 2024

jeejeelee commented Aug 1, 2024

DarkLight1337 commented Aug 1, 2024 • edited Loading

jeejeelee commented Aug 1, 2024

HwwwwwwwH commented Aug 1, 2024

HwwwwwwwH commented Aug 1, 2024

jeejeelee commented Aug 3, 2024

DarkLight1337 commented Aug 3, 2024

DarkLight1337 commented Aug 3, 2024

HwwwwwwwH commented Aug 3, 2024

HwwwwwwwH commented Aug 4, 2024

jeejeelee commented Aug 4, 2024

HwwwwwwwH commented Aug 4, 2024

ywang96 commented Aug 6, 2024

jeejeelee commented Aug 7, 2024

ywang96 commented Aug 7, 2024 • edited Loading

ywang96 commented Aug 1, 2024 •

edited

Loading

jeejeelee commented Aug 1, 2024 •

edited

Loading

DarkLight1337 commented Aug 1, 2024 •

edited

Loading

DarkLight1337 commented Aug 1, 2024 •

edited

Loading

ywang96 commented Aug 7, 2024 •

edited

Loading