[QUESTION] how to profile bubble time in pipeline parallelism? #1190
Replies: 2 comments
-
O paralelismo de pipeline é uma técnica crucial para treinamento distribuído em larga escala, mas pode sofrer com "bolhas" no pipeline. Essas bolhas são ineficiências que surgem devido a atrasos de sincronização entre diferentes estágios do pipeline. Vamos explorar algumas estratégias para lidar com isso:
|
Beta Was this translation helpful? Give feedback.
-
Marking as stale. No activity in 60 days. |
Beta Was this translation helpful? Give feedback.
-
Your question
Ask a clear and concise question about Megatron-LM.
How can I profile bubble time and p2p comm time in pipeline parallelism?
Beta Was this translation helpful? Give feedback.
All reactions