All notable changes to this project will be documented in this file.
The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.
- Nystrom causal attention [#75]
- More robust blocksparse [#24]
- Rotary embeddings [#32]
- More flexible layernorm [#50]