Skip to content

Commit

Permalink
Update MD in README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
Andi Kleen committed Dec 6, 2023
1 parent 8d767ee commit 538f31e
Showing 1 changed file with 16 additions and 16 deletions.
32 changes: 16 additions & 16 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -128,27 +128,27 @@ on newer Linux kernels.
* toplev updated to TMA 4.7:
* New --hbm-only for sprmax in HBM Only mode. toplev currently cannot auto detect this condition.
* New Models
** SPR-HBM: model for Intel Xeon Max (server) processor covering HBM-only mode (on top of cache mode introduced in 4.6 release)
* SPR-HBM: model for Intel Xeon Max (server) processor covering HBM-only mode (on top of cache mode introduced in 4.6 release)
* New Features
** Releasing the Bottlenecks View - a rather complete version [SKL onwards]
*** Bottlenecks View is An abstraction or summarization of the 100+ TMA tree nodes into a 12-entry vector of familiar performance issues, presented under the Info.Bottlenecks section.
** This release introduces Core_Bound_Est metric: An estimation of total pipeline cost when the execution is compute-bound.
** Besides, balanced distrubtion among Branching Retired, Irregular_Overhead, Mispredictions and Instruction_Fetch_BW as well as
** enhanced Cache_Memory_Latency to account for Stores info better accuracy.
* Releasing the Bottlenecks View - a rather complete version [SKL onwards]
* Bottlenecks View is An abstraction or summarization of the 100+ TMA tree nodes into a 12-entry vector of familiar performance issues, presented under the Info.Bottlenecks section.
* This release introduces Core_Bound_Est metric: An estimation of total pipeline cost when the execution is compute-bound.
* Besides, balanced distrubtion among Branching Retired, Irregular_Overhead, Mispredictions and Instruction_Fetch_BW as well as
* enhanced Cache_Memory_Latency to account for Stores info better accuracy.
* New Tree Metrics (nodes)
** HBM_Bound: stalls due to High Bandwidth Memory (HBM) accesses by loads.
* Informative Metrics (see spreadsheet for descriptions)
* HBM_Bound: stalls due to High Bandwidth Memory (HBM) accesses by loads.
* Informative Metrics
* New: Uncore_Frequency in server models
* New: IpPause [CFL onwards]
* Key Enhancements & fixes
** Hoisted Serializing_Operation and AMX_Busy to level 3; directly under Core Bound [SKL onwards]
** Swapped semantics of ILP (becomes per-thread) and Execute (per physical core) info metrics
** Moved Nop_Instructions to Level 4 under Other_Light_Op [SKL onwards]
** Moved Shuffles_256b to Level 4 under Other_Light_Op [ADL onwards]
** Renamed Local/Remote_DRAM to Local/Remote_MEM to account for HBM too
** Reduced # events when SMT is off [all]
** Reduced # events for HBM metrics; fixed MEM_Bandwidth/Latency descriptions [SPR-HBM]
** Tuned Threshold for: Branching_Overhead; Fetch_Bandwidth, Ports_Utilized_3m
* Hoisted Serializing_Operation and AMX_Busy to level 3; directly under Core Bound [SKL onwards]
* Swapped semantics of ILP (becomes per-thread) and Execute (per physical core) info metrics
* Moved Nop_Instructions to Level 4 under Other_Light_Op [SKL onwards]
* Moved Shuffles_256b to Level 4 under Other_Light_Op [ADL onwards]
* Renamed Local/Remote_DRAM to Local/Remote_MEM to account for HBM too
* Reduced # events when SMT is off [all]
* Reduced # events for HBM metrics; fixed MEM_Bandwidth/Latency descriptions [SPR-HBM]
* Tuned Threshold for: Branching_Overhead; Fetch_Bandwidth, Ports_Utilized_3m

* toplev has new options:
* --node-metrics or -N collects and shows metrics related to selected TMA nodes if their nodes
Expand Down

0 comments on commit 538f31e

Please sign in to comment.