Skip to content

Releases: JuliaGPU/AMDGPU.jl

v0.8.4

14 Jan 12:37
Compare
Choose a tag to compare

AMDGPU v0.8.4

Diff since v0.8.3

Merged pull requests:

v0.8.3

27 Dec 13:19
Compare
Choose a tag to compare

AMDGPU v0.8.3

Diff since v0.8.2

Merged pull requests:

Closed issues:

  • AMDGPU fails test and crashes when initialized (#570)
  • Update rocSPARSE to ROCm 6.0 (#571)

v0.8.2

05 Dec 11:26
Compare
Choose a tag to compare

AMDGPU v0.8.2

Diff since v0.8.1

Merged pull requests:

Closed issues:

  • has_rocm_gpu() fails (#564)

v0.8.1

02 Dec 20:29
Compare
Choose a tag to compare

AMDGPU v0.8.1

Diff since v0.8.0

Merged pull requests:

Closed issues:

  • Implement sparse BLAS routines (#15)
  • Implement iterative solvers (#13)
  • Create a Docker image for AMDGPU.jl (#33)
  • Implement batched off-thread HSA signal waiting (#128)
  • HSA_STATUS_ERROR_INVALID_CODE_OBJECT on gfx803 (#192)
  • hsa_executable_freeze can hang during high GPU load (#208)
  • Implement copy!() (#218)
  • ROCM/Hip not downloading (?) when ]added (#230)
  • mapreducedim! is not implemented for AnyROCArray Types (#234)
  • Test of AMDGPU fails on 5900HX - hipErrorNoBinaryForGpu (#244)
  • Don't disable ROCm external library type definitions when non-functional (#350)
  • AMDGPU.jl doesn't seem to work with 7900 series GPUs (#371)
  • Support for rand from Julia Base on device code (#378)
  • Detect hardware queue limit and use to limit queue pool size (#403)
  • AMDGPU on windows (#465)
  • Rely on Atomix.jl for atomics (#547)

v0.8.0

23 Nov 13:41
22b78e6
Compare
Choose a tag to compare

AMDGPU v0.8.0

This release brings initial suport for Windows (see requirements).
Removed "mixed-mode", everything is done automatically under-the-hood.

Diff since v0.7.4

Merged pull requests:

  • ROCm discovery for Windows (#542) (@pxl-th)
  • Fix kernel compilation on Windows (#543) (@pxl-th)
  • [Windows] Fix D2H memcopy & don't test unsupported functionality (#544) (@pxl-th)

Closed issues:

  • Fails to load on AMD Ryzen 9 7950X integrated graphics (#401)
  • Support for ROCm 5.7.1 (#522)
  • Mixed Device Libs Not Detected if Not in Project (#534)

v0.7.4

19 Nov 19:32
Compare
Choose a tag to compare

AMDGPU v0.7.4

Diff since v0.7.3

Merged pull requests:

Closed issues:

  • Investigate GPUArrays tests suite error (#515)
  • Multiple workers hang test suite on Julia 1.10 (#521)
  • [rocSPARSE] ILU(0) and IC(0) preconditioners are not working (#532)
  • Hostcall tests hang (#537)

v0.7.3

25 Oct 11:34
Compare
Choose a tag to compare

AMDGPU v0.7.3

Diff since v0.7.2

Merged pull requests:

Closed issues:

  • AMDGPU 0.7.x target error on Frontier (#530)

v0.7.2

23 Oct 15:26
Compare
Choose a tag to compare

AMDGPU v0.7.2

Diff since v0.7.1

Merged pull requests:

v0.7.1

23 Oct 14:05
Compare
Choose a tag to compare

AMDGPU v0.7.1

Diff since v0.7.0

Merged pull requests:

Closed issues:

  • Support for multi-GPU nodes broken in 0.7 (#527)

v0.7.0

23 Oct 11:11
Compare
Choose a tag to compare

AMDGPU v0.7.0

Diff since v0.6.1

Merged pull requests:

Closed issues:

  • Runtime Locking (#64)
  • 2x slower AMDGPU.jl kernel compared to HIP (#331)
  • sincos() x3.5 slower than separate sin()/cos() calls (#341)
  • HSA memory fault using AMDGPU.rand() on device ≠ 1 (#386)
  • WARNING: could not import AMDGPU.device_libs_path into Compiler (#434)
  • sincos intrinsic is broken with GPUCompiler 0.24 (#502)
  • Navi 3 causes malloc(): unsorted double linked list corrupted (#518)