support llava1.5 lora finetuning. #1487

lkk12014402 · 2024-11-14T17:32:27Z

What does this PR do?

add llava1.5 finetuning and add an example

lkk12014402 · 2024-11-14T17:34:33Z

performance comparison

finetuning on gaudi

Before optimization

After optimization (this pr)

finetuning on a100

yao-matrix · 2024-11-27T02:05:18Z

@libinta , pls help review, validated pass in 1.19.0-410 build, thx.

github-actions · 2024-11-29T22:12:39Z

The code quality check failed, please run make style.

HuggingFaceDocBuilderDev · 2024-11-29T22:15:53Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

yafshar · 2024-12-02T17:47:54Z

@lkk12014402 thanks for your contribution.

Can you explain why you created a new script, run_llava_lora_finetune.py, which is very similar to run_image2text_lora_finetune.py, instead of modifying the original script? Introducing new code can increase the possibility of errors and make maintenance more challenging.

optimum/habana/transformers/models/llava/modeling_llava.py

yafshar · 2024-12-02T19:19:46Z

@lkk12014402 in your performance comparison above, what kind of optimization you did? I do not see any optimization. Can you elaborate?

lkk12014402 · 2024-12-04T07:49:44Z

@lkk12014402 thanks for your contribution.

Can you explain why you created a new script, run_llava_lora_finetune.py, which is very similar to run_image2text_lora_finetune.py, instead of modifying the original script? Introducing new code can increase the possibility of errors and make maintenance more challenging.

hi, llava's Processor, DataCollator, evaluation code, padding style are different from the run_image2text_lora_finetune.py. If we merge the llava's code into the file run_image2text_lora_finetune.py, some if-else code will be ingested into the file. And I think each model maybe have its own Processor and DataCollator, or suitable dataset, So it is clear to create a new script.

lkk12014402 · 2024-12-04T08:33:20Z

@lkk12014402 in your performance comparison above, what kind of optimization you did? I do not see any optimization. Can you elaborate?

padding inputs for static shape during training datacollator

yafshar · 2024-12-05T15:48:55Z

@lkk12014402 can you also add the tests results? Does your change breaks any unit tests (test_image_to_text_example)?

>>> RUN_SLOW=true GAUDI2_CI=1 python -m pytest tests/test_image_to_text_example.py -v -s -k llava

before and after your changes?

yafshar · 2024-12-05T20:05:45Z

I ran the test

main

>>> RUN_SLOW=true GAUDI2_CI=1 python -m pytest tests/test_image_to_text_example.py -v -s -k llava

10 passed, 4 deselected in 4419.67s (1:13:39)

this PR

>>> RUN_SLOW=true GAUDI2_CI=1 python -m pytest tests/test_image_to_text_example.py -v -s -k llava

10 passed, 4 deselected in 2320.45s (0:38:40)

yafshar

I am still not sure about adding a new script but other than that it LGTM!

@regisss would you please check this PR.

lkk12014402 · 2024-12-06T05:32:56Z

I ran the test

main

>>> RUN_SLOW=true GAUDI2_CI=1 python -m pytest tests/test_image_to_text_example.py -v -s -k llava

10 passed, 4 deselected in 4419.67s (1:13:39)

this PR

>>> RUN_SLOW=true GAUDI2_CI=1 python -m pytest tests/test_image_to_text_example.py -v -s -k llava

10 passed, 4 deselected in 2320.45s (0:38:40)

Thanks~ @yafshar I will merge llava run_llava_lora_finetune.py into the run_image2text_lora_finetune.py

lkk12014402 · 2024-12-06T10:09:08Z

I am still not sure about adding a new script but other than that it LGTM!

@regisss would you please check this PR.

hi, @yafshar I have merged the 2 scripts.

Thanks~

yafshar · 2024-12-06T13:38:27Z

@lkk12014402 thanks. Can you do the full test to make sure your changes did not break anything?

>>> RUN_SLOW=true GAUDI2_CI=1 python -m pytest tests/test_image_to_text_example.py -v -s

yafshar · 2024-12-06T23:25:27Z

OK, I ran the tests. They seem to be fine. Would you please try some other examples

main branch

>>> RUN_SLOW=true GAUDI2_CI=1 python -m pytest tests/test_image_to_text_example.py -v -s 
14 passed in 1957.63s (0:32:37)

this PR

>>> RUN_SLOW=true GAUDI2_CI=1 python -m pytest tests/test_image_to_text_example.py -v -s 
14 passed in 1428.22s (0:23:48)

lkk12014402 · 2024-12-09T03:35:03Z

hi, @regisss please review~

vidyasiv

based on regiss feedback: please add/update relevant test(s) for new script/model: https://github.com/huggingface/optimum-habana/blob/main/tests/test_image_to_text_example.py

support llava1.5 lora finetuning.

78d1ab6

lkk12014402 requested a review from regisss as a code owner November 14, 2024 17:32

lkk12014402 and others added 2 commits December 2, 2024 14:12

Merge branch 'main' into llava1.5

bd85520

make style

9e22eb4

yafshar reviewed Dec 2, 2024

View reviewed changes

optimum/habana/transformers/models/llava/modeling_llava.py Outdated Show resolved Hide resolved

yafshar reviewed Dec 2, 2024

View reviewed changes

optimum/habana/transformers/models/llava/modeling_llava.py Outdated Show resolved Hide resolved

yafshar reviewed Dec 2, 2024

View reviewed changes

optimum/habana/transformers/models/llava/modeling_llava.py Outdated Show resolved Hide resolved

yafshar reviewed Dec 2, 2024

View reviewed changes

optimum/habana/transformers/models/llava/modeling_llava.py Outdated Show resolved Hide resolved

lkk12014402 and others added 3 commits December 4, 2024 09:53

Merge branch 'huggingface:main' into llava1.5

2e3580c

for transformers==v4.45.2.

e0be37f

for transformers==v4.45.2.

1ff6602

yafshar approved these changes Dec 5, 2024

View reviewed changes

lkk12014402 closed this Dec 6, 2024

lkk12014402 reopened this Dec 6, 2024

Merge branch 'huggingface:main' into llava1.5

d83743b

merge two scripts.

9a547b5

vidyasiv suggested changes Dec 10, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support llava1.5 lora finetuning. #1487

support llava1.5 lora finetuning. #1487

lkk12014402 commented Nov 14, 2024

lkk12014402 commented Nov 14, 2024 •

edited

Loading

yao-matrix commented Nov 27, 2024

github-actions bot commented Nov 29, 2024

HuggingFaceDocBuilderDev commented Nov 29, 2024

yafshar commented Dec 2, 2024

yafshar commented Dec 2, 2024

lkk12014402 commented Dec 4, 2024

lkk12014402 commented Dec 4, 2024 •

edited

Loading

yafshar commented Dec 5, 2024 •

edited

Loading

yafshar commented Dec 5, 2024

yafshar left a comment

lkk12014402 commented Dec 6, 2024

lkk12014402 commented Dec 6, 2024

yafshar commented Dec 6, 2024 •

edited

Loading

yafshar commented Dec 6, 2024

lkk12014402 commented Dec 9, 2024

vidyasiv left a comment •

edited

Loading

support llava1.5 lora finetuning. #1487

Are you sure you want to change the base?

support llava1.5 lora finetuning. #1487

Conversation

lkk12014402 commented Nov 14, 2024

What does this PR do?

lkk12014402 commented Nov 14, 2024 • edited Loading

performance comparison

finetuning on gaudi

Before optimization

After optimization (this pr)

finetuning on a100

yao-matrix commented Nov 27, 2024

github-actions bot commented Nov 29, 2024

HuggingFaceDocBuilderDev commented Nov 29, 2024

yafshar commented Dec 2, 2024

yafshar commented Dec 2, 2024

lkk12014402 commented Dec 4, 2024

lkk12014402 commented Dec 4, 2024 • edited Loading

yafshar commented Dec 5, 2024 • edited Loading

yafshar commented Dec 5, 2024

yafshar left a comment

Choose a reason for hiding this comment

lkk12014402 commented Dec 6, 2024

lkk12014402 commented Dec 6, 2024

yafshar commented Dec 6, 2024 • edited Loading

yafshar commented Dec 6, 2024

lkk12014402 commented Dec 9, 2024

vidyasiv left a comment • edited Loading

Choose a reason for hiding this comment

lkk12014402 commented Nov 14, 2024 •

edited

Loading

lkk12014402 commented Dec 4, 2024 •

edited

Loading

yafshar commented Dec 5, 2024 •

edited

Loading

yafshar commented Dec 6, 2024 •

edited

Loading

vidyasiv left a comment •

edited

Loading