Release AxoNN 0.1.0 · axonn-ai/axonn

AxoNN is a parallel framework for training deep neural networks.

Offers a hybrid of inter-layer parallelism with pipelining and data parallelism.
Supports both 16-bit mixed precision and 32-bit full precision training.
A highly efficient and scalable implementation of inter-layer parallelism with pipelining using asynchronous MPI-based communication and message-driven scheduling that achieves significant overlap of computation and communication.
Memory optimizations that can reduce the model state memory consumption by 5x for mixed precision training using the Adam optimizer and indirectly also increase hardware efficiency.

Provide feedback