FSDP support #142

ssmmnn11 · 2025-02-18T14:24:44Z

Add support for Fully Sharded Data Parallel to support large (Parameters) models

We need to adapt the Pytorch Lightning FSDP strategy to implement our model and reader groups.

Potentially we could also make use of

No response

No response

ECMWF

ssmmnn11 added the enhancement New feature or request label Feb 18, 2025

ssmmnn11 assigned ssmmnn11, cathalobrien and japols Feb 18, 2025

ssmmnn11 added this to Anemoi-dev Feb 18, 2025

Provide feedback