We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
您好,请问下为啥Hybrid Parallel Plugin下TP显存比同配置下deepspeed要高???
The text was updated successfully, but these errors were encountered:
Bot detected the issue body's language is not English, translate it automatically. 👯👭🏻🧑🤝🧑👫🧑🏿🤝🧑🏻👩🏾🤝👨🏿👬🏿
Title: Is the TP memory under Hybrid Parallel Plugin higher than the deepspeed under the same configuration? ? ?
Hello, may I ask why the TP memory under Hybrid Parallel Plugin is higher than the deepspeed under the same configuration? ? ?
Sorry, something went wrong.
好像是显存碎片造成?而且很严重,请问下有对应的优化改进措施吗?
It seems to be caused by memory fragmentation? And it’s very serious. Are there any corresponding optimization and improvement measures?
No branches or pull requests
您好,请问下为啥Hybrid Parallel Plugin下TP显存比同配置下deepspeed要高???
The text was updated successfully, but these errors were encountered: