New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Add example for OPT model with distribution. #1727

Draft

qlzh727 wants to merge 1 commit into keras-team:master from qlzh727:keras3_distribute_llm

Member

qlzh727 commented Jan 13, 2024

@fchollet, this is the draft of the OPT model inferencing with Keras distribution API (I can add the finetune parts later). Do u still have the instructions to convert the py example to colab/MD file?

This one will require 8 V100 GPU to simulate, and I think we should have A100 to properly simulate a finetuning workflow


          Add example for OPT model with distribution.

36f1786

qlzh727 requested review from fchollet and mattdangerw

January 13, 2024 00:10

github-actions bot assigned sachinprasadhs

fchollet reviewed

View reviewed changes

Member

fchollet left a comment

Thanks for the PR!

examples/generative/opt2_text_generation_with_distribution.py

+              support in the coming future.
+              """
+              import os

Member

fchollet Jan 16, 2024

Please group the imports at the top.

examples/generative/opt2_text_generation_with_distribution.py

+              print(keras.version())
+              print(keras.backend.backend())
+              keras.mixed_precision.set_global_policy("mixed_float16")

Member

fchollet Jan 16, 2024

Please add a comment about using mixed precision.

examples/generative/opt2_text_generation_with_distribution.py

+              count other items like optimizer states, as well as forward and backward path.
+              """
+              # model_spec = 'opt_6.7b_en'
+              # langauge_model = create_opt_model(model_spec)

Member

fchollet Jan 16, 2024

Should this be uncomented?

examples/generative/opt2_text_generation_with_distribution.py

+              # Create a 2D mesh for model parallel, change the mesh shape to tune the
+              # ratio of data/model parallelism
+              _BATCH_DIM_NAME = "batch"

Member

fchollet Jan 16, 2024

No need for leading underscores

examples/generative/opt2_text_generation_with_distribution.py

+              generate function with XLA. The follow up runs will be much faster.
+              """
+              prompt = "What is machine learning?"
+              print(large_model.generate(prompt))

Member

fchollet Jan 16, 2024

Please add a second prompt, possible with some time() calls, to demonstrate the regular (post compilation) step time

examples/generative/opt2_text_generation_with_distribution.py



		"""
		## Introduction to KerasNLP

Member

fchollet Jan 16, 2024

I suggest focusing the example purely on the distribution aspects, so we can replace the KerasNLP intro with ~1 sentence. Meanwhile maybe we could flesh out the distribution part, e.g. include fine-tuning or other inference performance considerations.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet