-
Notifications
You must be signed in to change notification settings - Fork 217
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
support llava1.5 lora finetuning. #1487
base: main
Are you sure you want to change the base?
Conversation
@libinta , pls help review, validated pass in 1.19.0-410 build, thx. |
The code quality check failed, please run |
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update. |
@lkk12014402 thanks for your contribution. Can you explain why you created a new script, |
@lkk12014402 in your performance comparison above, what kind of optimization you did? I do not see any optimization. Can you elaborate? |
hi, llava's Processor, DataCollator, evaluation code, padding style are different from the |
padding inputs for static shape during training datacollator |
@lkk12014402 can you also add the tests results? Does your change breaks any unit tests (test_image_to_text_example)? >>> RUN_SLOW=true GAUDI2_CI=1 python -m pytest tests/test_image_to_text_example.py -v -s -k llava before and after your changes? |
I ran the test main >>> RUN_SLOW=true GAUDI2_CI=1 python -m pytest tests/test_image_to_text_example.py -v -s -k llava
10 passed, 4 deselected in 4419.67s (1:13:39) this PR >>> RUN_SLOW=true GAUDI2_CI=1 python -m pytest tests/test_image_to_text_example.py -v -s -k llava
10 passed, 4 deselected in 2320.45s (0:38:40) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I am still not sure about adding a new script but other than that it LGTM!
@regisss would you please check this PR.
Thanks~ @yafshar I will merge llava |
@lkk12014402 thanks. Can you do the full test to make sure your changes did not break anything? >>> RUN_SLOW=true GAUDI2_CI=1 python -m pytest tests/test_image_to_text_example.py -v -s |
OK, I ran the tests. They seem to be fine. Would you please try some other examples main branch >>> RUN_SLOW=true GAUDI2_CI=1 python -m pytest tests/test_image_to_text_example.py -v -s
14 passed in 1957.63s (0:32:37) this PR >>> RUN_SLOW=true GAUDI2_CI=1 python -m pytest tests/test_image_to_text_example.py -v -s
14 passed in 1428.22s (0:23:48) |
hi, @regisss please review~ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
based on regiss feedback: please add/update relevant test(s) for new script/model: https://github.com/huggingface/optimum-habana/blob/main/tests/test_image_to_text_example.py
What does this PR do?
add llava1.5 finetuning and add an example