Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

是否支持Megatron-Deepspeed的模型并行? #3219

Open
RyanOvO opened this issue Feb 21, 2025 · 0 comments
Open

是否支持Megatron-Deepspeed的模型并行? #3219

RyanOvO opened this issue Feb 21, 2025 · 0 comments

Comments

@RyanOvO
Copy link

RyanOvO commented Feb 21, 2025

背景:
目前deepspeed推出了类似Metagtron的模型并行功能,即Deepspeed版的Megatron-LM的模型并行功能。但当前swift所推出的ds并行训练配置模板yaml中并没有相关的配置,现阶段swift提供的ds功能都是zero数据并行的配置。

期望:
swift能提供Megatron-Deepspeed的模型并行的配置模板,且支持到昇腾体系嘛?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant