rocSOLVER-3.9.0 for ROCm 4.1.0
New Features
Added
- Eigensolver routines for symmetric/hermitian matrices:
STERF, STEQR - Linear solvers for general non-square systems:
GELS (API added with batched and strided_batched versions. Only the overdetermined non-transpose case is implemented in this release. Other cases will return rocblas_status_not_implemented status for now.) - Extended test coverage for functions returning info
- Changelog file
- Tridiagonalization routines for symmetric and hermitian matrices:
LATRD
SYTD2, SYTRD (with batched and strided_batched versions)
HETD2, HETRD (with batched and strided_batched versions) - Sample code and unit test for unified memory model/Heterogeneous Memory Management (HMM)
Optimizations
- Improved performance of LU factorization of small and mid-size matrices (n > 2048)
Changed
- Raised minimum requirement for building rocSOLVER from source to CMake 3.8
- Switched to use semantic versioning for the library
- Enabled automatic reallocation of memory workspace in rocsolver clients
Removed
- Removed -DOPTIMAL from the roc::rocsolver CMake usage requirements. This is an internal rocSOLVER definition, and does not need to be defined by library users
Fixed
- Fixed runtime errors in debug mode caused by incorrect kernel launch bounds
- Fixed complex unit test bug caused by incorrect zaxpy function signature
- Eliminated a small memory transfer that was being done on the default stream
- Fixed GESVD right singular vectors for 1x1 matrices
Known Issues
- None