Replies: 2 comments
-
Marking as stale. No activity in 60 days. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi,
I'm curious if it's possible to set
--rotary-seq-len-interpolation-factor
in Megatron-LM to match huggingface's rope_scaling setting ({"type": "dynamic", "factor": 2.0}
).Is there any information you can share on how the
--rotary-seq-len-interpolation-factor
option is used?Beta Was this translation helpful? Give feedback.
All reactions