Kernel Float is a header-only library for CUDA that simplifies working with vector types and reduced precision floating-point arithmetic in GPU code.
This release renames several the operations in the library and refactors many of its internals. Most notably, all operations that cannot be performed in 16-bit precision will now be performed in single precision and downcast back to lower precision.
Full Changelog: v0.1...v0.2