Skip to content

rocBLAS-2.36.0 for ROCm 4.1.0

Compare
Choose a tag to compare
@saadrahim saadrahim released this 23 Mar 01:18
93c8293

Added

  • Added Numerical checking helper function to detect zero/NaN/Inf in the input and the output vectors of rocBLAS level 1 and 2 functions.
  • Added Numerical checking helper function to detect zero/NaN/Inf in the input and the output general matrices of rocBLAS level 2 and 3 functions.

Fixed

  • Fixed complex unit test bug caused by incorrect caxpy and zaxpy function signatures.
  • Make functions compliant with Legacy Blas for special values alpha == 0, k == 0, beta == 1, beta == 0.

Optimizations

  • Improved performance of single precision axpy_batched and axpy_strided_batched: batch_count >= 8192.