Added mamba model support and test CI script #1573

zzhang37 · 2024-12-06T21:45:30Z

What does this PR do?

Added Mamba model support using custom op, and added the test on Mamba, the test command is
GAUDI2_CI=1 RUN_SLOW=1 python -m pytest tests/test_text_generation_example.py -s -v -k "mamba"
It passed on 1.18.0release and 1.19.0, build 497 with corresponding so files.

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

zzhang37 · 2024-12-06T21:47:35Z

Just run the command
GAUDI2_CI=1 RUN_SLOW=1 python -m pytest tests/test_text_generation_example.py -s -v -k "mamba"
It will pass on 1.18 and 1.19 version

setup.py

regisss · 2024-12-06T21:51:49Z

examples/text-generation/README.md

+```bash
+wget https://huggingface.co/Habana/mamba/resolve/main/hpu_custom_pscan_all.cpython-310-x86_64-linux-gnu_119.so
+wget https://huggingface.co/Habana/mamba/resolve/main/libcustom_tpc_perf_lib_119.so
+export HABANA_CUSTOM_OP_DIR=/path/of/hpu_custom_pscan_all.cpython-310-x86_64-linux-gnu_119.so located
+export GC_KERNEL_PATH=/path/to/libcustom_tpc_perf_lib_119.so:$GC_KERNEL_PATH


This should be done automatically in the script if the model type is Mamba

The test did the automatically download, but for users want to run the mamba, they can just download the so files form the Habana/mamba card.

I also remove the wget and use hf_hub_download in the test script.

I think this will discourage many users to use this model. Can't we put the code you added to the test file to donwload the .so in a new utils file in the Mamba modeling folder? And then we could call it here and in the test.

This is just readme file.

I'm not sure what you mean. I think automatically downloading these .so files in the script if the model type is mamba would give a better user experience. Why not doing it there?

@zzhang37 it seems the mamba run code fails if we don't set this environment variable at all. Is this intended? Can customer run with/without custom code? If the intention is always run mamba with custom kernel, we should put all these download/setup code in modeling_mamba file at the beginning before you load the library.

@regisss Removed the all wget and added util to do the job.
@jiminha Just checked with latest build 538, works fine.

README.md

github-actions · 2024-12-09T22:10:14Z

The code quality check failed, please run make style.

Co-authored-by: Wei Lin <[email protected]> Co-authored-by: Jianqian Zhou <[email protected]> Co-authored-by: Leo Zhao <[email protected]>

HuggingFaceDocBuilderDev · 2024-12-09T22:13:42Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

tests/test_text_generation_example.py

zzhang37 · 2024-12-10T20:05:12Z

Fixed the make style issue.

tests/test_text_generation_example.py

…during tests (huggingface#1590)

Co-authored-by: Gustavo <gustavo.malkomes> Co-authored-by: Yaser Afshar <[email protected]> Co-authored-by: regisss <[email protected]>

Co-authored-by: Adam Stachowicz <[email protected]>

…face#1597)

…gface#1596) Signed-off-by: Wang, Yi A <[email protected]>

…_with_test_119

regisss · 2024-12-12T22:03:34Z

@zzhang37 There are some new issues now but these should be quite straightforward to fix:

Calling adapt_transformers_to_gaudi will return an error if GC_KERNEL_PATH has not been specified before. I suggest to run this piece of code only if GC_KERNEL_PATH is in os.environ. If I understood correctly, GC_KERNEL_PATH should be specified if the user intends to use Mamba right?
The test doesn't work because GC_KERNEL_PATH is not specified. You should specify it similarly to how it is done here if the tested model is Mamba.

zzhang37 · 2024-12-12T22:44:54Z

@regisss GC_KERNEL_PATH is used by all models. There is a default path for GC_KERNEL_PATH, but mamba required an additional path to add .The problem is GC_KERNEL_PATH needs to be set before the start of the script since Synapse needs to use this path to load the TPC kernels in the beginning, if we add additional path to GC_KERNEL_PATH in the middle of the script, it will miss the loading of the additional kernels we added in the path, the mamba will not able to use that kernel. So that is the reason if we run mamba model, we need to add additional path to GC_KERNEL_PATH beforehand rather than in the middle of the script. Just like command we should use is
C_KERNEL_PATH=/root/.cache/huggingface/hub/models--Habana--mamba/blobs/libcustom_tpc_perf_lib.so:$GC_KERNEL_PATH python run_generation.py .....

Added mamba model support and test CI script

3606b69

zzhang37 requested a review from regisss as a code owner December 6, 2024 21:45

regisss reviewed Dec 6, 2024

View reviewed changes

zzhang37 added 3 commits December 6, 2024 17:32

Added mamba model support and test CI script

a63a784

Added mamba model support and test CI script

68da585

Added mamba model support and test CI script

ee6cb84

This was referenced Dec 9, 2024

Zhzhang/mamba pscan withtest #1572

Closed

Added custom mamba op and fix the mamba cache issue #1521

Closed

yafshar and others added 6 commits December 9, 2024 14:11

Generation utils update (minor) (huggingface#1468)

fba7f6c

style: removed tabs (huggingface#1577)

84f4651

Add chatglm (huggingface#1478)

f92097d

Co-authored-by: Wei Lin <[email protected]> Co-authored-by: Jianqian Zhou <[email protected]> Co-authored-by: Leo Zhao <[email protected]>

Enable num_return_sequences in beam search (huggingface#1536)

6979ebd

gpt_bigcode: added internal bucketing fix (huggingface#1526)

c3cc9e3

Update the Gaudi trainer with transformers 4.45.2 (huggingface#1398)

cbaa02b

zzhang37 requested review from ssarkar2, bhargaveede, vivekgoe and ZhaiFeiyue as code owners December 9, 2024 22:15

Merge branch 'main' into zhzhang/mamba_with_test_119

b883184

jiminha reviewed Dec 10, 2024

View reviewed changes

tests/test_text_generation_example.py Outdated Show resolved Hide resolved

mgonchar and others added 3 commits December 10, 2024 12:03

Fixed spelling (huggingface#1576)

313d238

Update docs for baichuan2 training (huggingface#1586)

14e473e

Update the Gaudi trainer with transformers 4.45.2 (huggingface#1398)

33a718f

vidyasiv reviewed Dec 10, 2024

View reviewed changes

tests/test_text_generation_example.py Outdated Show resolved Hide resolved

yafshar and others added 2 commits December 11, 2024 11:30

Fix Accuracy Calculation Issue in GPT-NeoX (huggingface#1591)

43d92c9

Add WA flag for falcon-180b to resolve text-gen critical reset error …

aa59027

…during tests (huggingface#1590)

malkomes and others added 2 commits December 11, 2024 11:30

Update transformers tests generation util v4.45.2 (huggingface#1441)

27a44c9

Co-authored-by: Gustavo <gustavo.malkomes> Co-authored-by: Yaser Afshar <[email protected]> Co-authored-by: regisss <[email protected]>

Update the Gaudi trainer with transformers 4.45.2 (huggingface#1398)

ceace58

zzhang37 requested a review from mandy-li as a code owner December 11, 2024 19:35

libinta added the run-test Run CI for PRs from external contributors label Dec 12, 2024

bhargaveede and others added 5 commits December 12, 2024 09:43

Limit position embeddings in inference (huggingface#1598)

05ada67

Co-authored-by: Adam Stachowicz <[email protected]>

Verify model output is provided when check_output is enabled (hugging…

dd7e0bd

…face#1597)

Update README.md (huggingface#1595)

aa76b9f

Fix scikit-learn to 1.5.2 to fix f1 evaluation crash in 1.6.0 (huggin…

4bd36c4

…gface#1596) Signed-off-by: Wang, Yi A <[email protected]>

Update the Gaudi trainer with transformers 4.45.2 (huggingface#1398)

d6681ec

zzhang37 requested a review from libinta as a code owner December 12, 2024 17:43

regisss added 2 commits December 12, 2024 21:26

Merge remote-tracking branch 'optimum-habana/main' into zhzhang/mamba…

2c8690d

…_with_test_119

Use relative import

a87ed29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added mamba model support and test CI script #1573

Added mamba model support and test CI script #1573

zzhang37 commented Dec 6, 2024 •

edited

Loading

zzhang37 commented Dec 6, 2024

regisss Dec 6, 2024

zzhang37 Dec 7, 2024

zzhang37 Dec 7, 2024

regisss Dec 10, 2024

zzhang37 Dec 10, 2024

regisss Dec 10, 2024

jiminha Dec 10, 2024

zzhang37 Dec 11, 2024

github-actions bot commented Dec 9, 2024

HuggingFaceDocBuilderDev commented Dec 9, 2024

zzhang37 commented Dec 10, 2024

regisss commented Dec 12, 2024

zzhang37 commented Dec 12, 2024 •

edited

Loading

Added mamba model support and test CI script #1573

Are you sure you want to change the base?

Added mamba model support and test CI script #1573

Conversation

zzhang37 commented Dec 6, 2024 • edited Loading

What does this PR do?

Before submitting

zzhang37 commented Dec 6, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

github-actions bot commented Dec 9, 2024

HuggingFaceDocBuilderDev commented Dec 9, 2024

zzhang37 commented Dec 10, 2024

regisss commented Dec 12, 2024

zzhang37 commented Dec 12, 2024 • edited Loading

zzhang37 commented Dec 6, 2024 •

edited

Loading

zzhang37 commented Dec 12, 2024 •

edited

Loading