You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi,
I used the code to continue pretrain the model and used zero3 for model training. But I found my checkpoint file zero_pp_rank_*_mp_rank_00_model_states.pt is empty, the file only has model parameters name and shape, don't have the weights. Have you ever met this problem and how to fix?
Thanks!
Checklist
I have provided all relevant and necessary information above.
I have chosen a suitable title for this issue.
The text was updated successfully, but these errors were encountered:
My solution is to save checkpoints by myself or you can use zero_to_fp32
@mynewstart I found my converted ckpt global_step_xxx only contains meaningful *optim_states.pt but only empty *model_states.pt. Any clues on this? Thanks.
Required prerequisites
Questions
Hi,
I used the code to continue pretrain the model and used zero3 for model training. But I found my checkpoint file zero_pp_rank_*_mp_rank_00_model_states.pt is empty, the file only has model parameters name and shape, don't have the weights. Have you ever met this problem and how to fix?
Thanks!
Checklist
The text was updated successfully, but these errors were encountered: