[Question] Pipeline Parallel for Mamba Megatron #11008
Unanswered
zixianwang2022
asked this question in
Q&A
Replies: 1 comment
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi,
I am wondering if the scripts implemented in Mamba Megatron pipeline parallel will be possibly used inside Nemo? Specifically Mamba Megatron supports converting Mamba both to Tensor Parallel & Pipeline Parallel at here. The pipeline parallel code has been using Megatron's without interleaved implementation. Will we be able to utilize any of that Megatron implementation for SFT on Mamba in Nemo?
Thank you!
Beta Was this translation helpful? Give feedback.
All reactions