update mutligpu readme and MllamaForConditionalGeneration import #681

AAndersn · 2024-09-26T03:49:50Z

Fixes an incorrect parameter in documentation for multigpu that says int4 instead of 4bit for quantization type.

Fixes incorrect import of MllamaForConditionalGeneration from transformers instead of transformers.models.mllama.modeling_mllama

Fixes #680

Before submitting

[ X] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
[ X] Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue? Please add a link
to it if that's the case.
[ X] Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

Thanks for contributing 🎉!

wukaixingxp · 2024-09-26T16:25:33Z

Hi! Can you check your transformer version? I think the latest transformer can do from transformers import MllamaForConditionalGeneration

AAndersn · 2024-09-26T16:59:32Z

@wukaixingxp You are right.

I think I might have had the 4.44.0 when I created that bug ticket and PR last night. Re-running with transformers 4.45.0 this morning and from transformers import MllamaForConditionalGeneration works fine

wukaixingxp · 2024-09-26T17:30:12Z

Thanks for your help! We want to bump the transformer version to 4.45.0 but it has a bug as stated in my PR. Now people must pip install from source for transformers to avoid this bug. We are waiting for a new release of transformers pip package.

init27 · 2024-10-04T00:00:27Z

@AAndersn thanks again for the PR-since this is fixed in the latest HF version can you take another parse please?

AAndersn · 2024-10-10T05:19:06Z

Sorry for the delay. Yes, I will fix my branch to resolve the conflict with the other portion that has already been fixed

init27 · 2024-10-10T17:01:00Z

Many Thanks!

update mutligpu readme

188b9d9

facebook-github-bot added the cla signed label Sep 26, 2024

AAndersn mentioned this pull request Sep 26, 2024

llama finetune.py throws pytorch tensor datatype error with 4 bit quantization #675

Closed

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update mutligpu readme and MllamaForConditionalGeneration import #681

update mutligpu readme and MllamaForConditionalGeneration import #681

AAndersn commented Sep 26, 2024 •

edited

Loading

wukaixingxp commented Sep 26, 2024

AAndersn commented Sep 26, 2024

wukaixingxp commented Sep 26, 2024

init27 commented Oct 4, 2024

AAndersn commented Oct 10, 2024

init27 commented Oct 10, 2024

update mutligpu readme and MllamaForConditionalGeneration import #681

Are you sure you want to change the base?

update mutligpu readme and MllamaForConditionalGeneration import #681

Conversation

AAndersn commented Sep 26, 2024 • edited Loading

Before submitting

wukaixingxp commented Sep 26, 2024

AAndersn commented Sep 26, 2024

wukaixingxp commented Sep 26, 2024

init27 commented Oct 4, 2024

AAndersn commented Oct 10, 2024

init27 commented Oct 10, 2024

AAndersn commented Sep 26, 2024 •

edited

Loading