Skip to content

[Nemo1] Generate sharded optimizer state dicts only if needed for saving #27159

[Nemo1] Generate sharded optimizer state dicts only if needed for saving

[Nemo1] Generate sharded optimizer state dicts only if needed for saving #27159