Skip to content

Latest commit

 

History

History
146 lines (131 loc) · 9.22 KB

ReleaseNotes.md

File metadata and controls

146 lines (131 loc) · 9.22 KB

GPU Performance API Release Notes


Version 3.0 (3/19/17)

  • Add support for additional GPUs and APUs.
  • Support for collecting hardware counters for Vulkan and DirectX 12 applications.
  • Redesigned API to support modern graphics APIs.
  • The documentation has been rewritten and is now available in HTML format.
  • New counters added:
    • Cycle and count-based counters in addition to existing percentage-based counters.
    • New Depth Buffer memory read/write counters.
    • Additional Color Buffer memory counters.
    • For graphics, several global memory counters which were previously available only in the Compute Shader stage are now available generically.
  • Support for setting stable GPU clocks.
  • Counter Group Names can now be queried separately from Counter Descriptions.
  • Counters now have a UUID which can be used to uniquely identify a counter.
  • New entry point (GPA_GetFuncTable) to retrieve a table of function pointers for all GPA entry points.
  • New C++ GPAInterfaceLoader.h header file provides an easy way to load and use GPA entry points.
  • Bugs Fixed:
    • Fixed an issue with TesselatorBusy counter on many GFX8 GPUs.
    • Fixed an issue with FlatVMemInsts and CSFlatVMemInsts counters on many GFX8 GPUs.
    • Fixed an issue with LDSInsts counter on Vega GPUs.
    • Fixed some issues with Compute Shader counters on Vega GPUs.
    • Some counter combinations could lead to incorrect counter results.
    • Enabling counters in a certain order can lead to incorrect counter scheduling across multiple passes.
    • ROCm/HSA: GPA_OpenContext crashes if libhsa-runtime64.so.1 can't be found.
    • ROCm/HSA: GPA does not coexist nicely with an application that also sets the HSA_TOOLS_LIB environment variable.
    • OpenGL: Fixed a crash that can occur with an incorrectly-configured OpenGL driver.
    • OpenGL: Fixed some issues with OpenGL device-detection.

Version 2.23 (6/27/17)

  • Add support for additional GPUs, including Vega series GPUs
  • Allow unit tests to be built and run on Linux

Version 2.22 (4/26/17)

  • Add support for additional GPUs and APUs, including RX 500 series GPUs

Version 2.21 (12/21/16)

  • Add support for additional GPUs and APUs.
  • ROCm/HSA: Support for ROCm 1.3 and ROCm 1.4
  • OpenGL: Fixed an issue with GPU/APU detection with newer Radeon Crimson drivers

Version 2.20 (5/12/16)

  • Add support for additional GPUs and APUs.
  • Initial Public release of ROCm/HSA support
  • Developer Preview for DirectX12 (no hardware-based performance counter support)
  • Removed support for pre-GCN-based devices
  • Improve accuracy of the various *VALUBusy and *SALUBusy counters (all APIs)
  • OpenGL: Fixed possibly wrong GPUTime values for some applications
  • OpenGL: Add support for open source OpenGL driver
  • OpenGL: Fix value of TexUnitBusy counter for 3rd generation GCN hardware
  • DirectX 11: Fix incorrect type returned for D3DGPUTime counter
  • DirectX 11: Fix PSVALUBusy counter on 2nd and 3rd generation GCN hardware
  • DirectX 11: A separate DLL (GPUPerfAPIDXGetAMDDeviceInfo*.dll) is required in order to support certain multi-GPU configurations. This DLL will need to be deployed alongside GPUPerfAPIDX11*.dll on multi-GPU systems. See the User Guide for more information.

Version 2.17 (8/12/15)

  • Add support for additional GPUs and APUs.
  • Add OpenGL ES support for both Windows and Linux.
  • DirectX 11/OpenGL: Add CSFlatVMInsts and CSFlatLDSInsts counters to measure flat instructions used for compute shaders on 2nd Generation GCN hardware or newer.
  • DirectX11/OpenGL: Fix CSLDSInsts, CSVWriteInsts, CSVFetchInsts counter on 2nd Generation GCN hardware or newer.
  • DirectX 11: Fixed a crash that could occur on Multi-GPU systems.
  • OpenGL: rework the GPUTime implementation so that GPA no longer needs to stop/start any existing queries that the application may be using.
  • OpenCL™: Add FlatVMInsts and FlatLDSInsts counters to measure flat instructions used for OpenCL kernels on 2nd Generation GCN hardware or newer.
  • OpenCL™: Fix LDSInsts, VWriteInsts, VFetchInsts counter on 2nd Generation GCN hardware or newer.
  • OpenCL™: Fix MemUnitBusy counter on 2nd Generation GCN hardware or newer.
  • Fix a potential crash in GPA_SelectContext.
  • Fix a bug in GPA_DisableCounter that could result in the counter not actually getting disabled.

Version 2.15 (1/9/15)

  • Add support for additional GPUs and APUs.
  • Improved error handling in the various GPA_GetSample functions
  • Improved algorithm which splits counters into multiple passes.
  • DirectX 11/OpenGL: Added new counters to the Timing group to report total time taken for a particular shader type.
  • DirectX 11: Provided access to various D3D Queries as counters (see the D3D11 counter group).
  • DirectX 11: Fixed some memory leaks when running on pre-GCN hardware.
  • DirectX 11: Fixed some incorrect Compute Shader counter values.
  • DirectX 11: Fixed a crash that could occur when querying some counters on most recent hardware.

Version 2.14 (9/5/14)

  • Add support for additional GPUs and APUs.
  • DirectX 11: Fixed some memory leaks caused by incorrect reference counting of D3D11 devices and device contexts.
  • OpenGL: Improved memory consumption and performance during profiling.
  • OpenGL: Fixed crash in 32-bit Linux version.
  • OpenGL: Fixed CSMemUnitBusy, CSMemUnitStalled, and TexTriFilteringPct counters on Graphics IP v8 family of GPUs.
  • OpenGL: Fixed CSALUStalledByLDS counter on Graphics IP v7 and v8 families of GPUs.
  • Removed Support for DirectX 10.

Version 2.13 (4/17/14, updated 6/20/14)

  • 6/20/14: Add support for additional GPUs and APUs.
  • 6/20/14: OpenGL: Fixed some device support.
  • Add support for Volcanic Islands GPUs.
  • Add support for AMD Radeon R9 200 Series GPUs.
  • Add support for Kaveri-based APUs.
  • Add support for more FirePro, Mobility, and APU devices.
  • DirectX 10/DirectX 11: Fix an issue with collecting performance counters using the latest Catalyst Beta Drivers (14.1, 14.2 and 14.3).
  • OpenGL: Add support for Compute Shader counters.
  • OpenGL: New shader-stage-specific ALUBUsy and ALUInstCount counters (both vector and scalar) for Sea Islands GPUs and newer.
  • OpenGL: Improved Linux support.
  • OpenCL™: FastPath, CompletePath, and PathUtilization now report correct value on AMD Radeon HD 6000 series hardware and on Trinity/Richland-based APUs.
  • OpenCL™: Improved results when performing multi-pass profiling.

Version 2.11 (6/5/13)

  • Add support for AMD Radeon HD 8000 Series hardware.
  • Add support for more AMD Radeon HD 7000 Series devices.
  • Add support for more FirePro, Mobility, and APU devices.
  • Improved performance and counter results when large numbers of counters are enabled on AMD Radeon HD 7000 Series hardware and newer.
  • Improved counter results on systems with both an APU and a discrete GPU.
  • DirectX 11: CSFetchInsts, CSTexBusy, and CSALUFetchRatio now report correct value on AMD Radeon HD 5000-6000 series hardware.
  • OpenGL/DirectX 10/DirectX 11: TexUnitBusy counter now reports correct value on AMD Radeon HD 7000 Series hardware.
  • OpenCL™: WriteSize counter now reports correct value on AMD Radeon HD 7000 Series hardware.

Version 2.9 (1/18/12)

  • Add support for AMD Radeon 7000 Series hardware.
  • Add support for more FirePro, Mobility, and APU devices.
  • Improved memory consumption and performance during profiling.
  • Reduce memory footprint of the GPUPerfAPI DLLs.
  • Correct counters are now exposed on systems with dual GPUs of different hardware generations.
  • DirectX 10:Fixed shader related counters on AMD Radeon HD 2000-5000 Series hardware.
  • Fixed PrimitivesIn counter on AMD Radeon HD 2000 Series hardware.

Version 2.8 (4/22/11)

  • DirectX 10: Fixed GPUTime support on AMD Radeon HD 2000 and 3000 Series hardware.
  • DirectX 11: Fixed counter accessibility on AMD Radeon HD 2000 and 3000 Series hardware.

Version 2.7 (4/6/11)

  • New entrypoint for registering logging callback function for improved troubleshooting.
  • Adds support for AMD Radeon HD 6000 Series hardware.
  • New DepthAndStencil counters give more detailed understanding of HiZ behavior.
  • OpenGL: Improved accuracy of depth, texture, busy, and stalled counters.
  • OpenCL™: FetchSize counter now reports the correct value.
  • DirectX 10/11: Fixed support for AMD Radeon HD 4000 Series hardware; Fixed support for Catalyst 11.2 drivers on AMD Radeon HD 5000 Series hardware.

Version 2.5 (11/17/10)

  • Adds Linux support for OpenGL and OpenCL.
  • More consistent naming between OpenCL and DX Compute Shader counters.
  • Improved methods for identifying existing hardware.
  • DirectX 10/11: 1) Improved accuracy of DepthAndStencil counters, 2) Improved accuracy of ColorBuffer counters.
  • OpenCL™: New counters: FetchSize, CacheHit, LDSFetchInsts, LDSWriteInsts, FastPath, CompletePath, PathUtilization.
  • OpenGL: Fixed an issue that caused counters to not be available if the application is using queries; Adds tessellation related counters.

Version 2.3 (6/4/10)

  • Supports DirectX10, DirectX11, OpenGL on ATI Radeon 2000, 3000, 4000, and 5000 series.
  • Supports OpenCL™ on ATI Radeon 4000 and 5000 series.
  • Provides derived counters based on raw HW performance counters.
  • Manages memory automatically – no allocations required.
  • Requires ATI Catalyst driver 10.1 or later.