You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When doing ib_write_bw (a standard IB bandwidth testing tool without memory copy) and watch the corresponding memory bandwidth by pcm-memory ( of latest build 202502) as follows:
nodeA ip : 100.7.36.221, with ib port of mlx5_0 (400Gb) on numa0, and cpu is intel Xeon 6740E (BirthStream, Siera Forest)
nodeB ip : 100.7.36.223, with ib port of mlx5_0 (400Gb) on numa0, and cpu is intel Xeon 6740E (BirthStream, Siera Forest)
let nodeA do 'ib_write_bw' to nodeB:
nodeB(server): numactl -N 0 -m 0 ib_write_bw -d mlx5_0 -t 128 --run_infinitely
nodeA(client) : numactl -N 0 -m 0 ib_write_bw -d mlx5_0 -t 128 --run_infinitely 100.7.36.223
now ib_write_bw show the ib bandwidth is 44GB, and pcm-memory shows the memory bandwidth as 47GB, it's just ok.
let nodeB do 'ib_write_bw' to nodeA:
nodeA(server): numactl -N 0 -m 0 ib_write_bw -d mlx5_0 -t 128 --run_infinitely
nodeB(client) : numactl -N 0 -m 0 ib_write_bw -d mlx5_0 -t 128 --run_infinitely 100.7.36.221
now ib_write_bw show the ib bandwidth is also 44GB, but pcm-memory shows the memory bandwidth less than 1G, only a few hundred megabytes.
It looks like the pcm-memory does not account the memory band in last scenario 2.
Why? As i thought, the pcm-memory should account any memory bandwidth from cpu and pcie to memory ?
The text was updated successfully, but these errors were encountered:
Yes, just as you thought, it is caused by DDIO. 'pcm-pcie' -e shows 99% hits of 'ItoM' when case 2. thanks.
By the way, i find the 'pcm-pcie -h' shows that write events includes PCIeWiLf, PCIeItoM, PCIeNSWr, PCIeNSWrF, ItoM, RFO, but when i run 'pcm-pcie -e', there are only columes of 'PCIRdCur', 'ItoM', 'ItoMCacheNear', 'UCRdF', 'WiL', 'WCiL', 'WCiLF', which not correspond with the 'pcm-pcie -h' shows. Why?
When doing ib_write_bw (a standard IB bandwidth testing tool without memory copy) and watch the corresponding memory bandwidth by pcm-memory ( of latest build 202502) as follows:
nodeA ip : 100.7.36.221, with ib port of mlx5_0 (400Gb) on numa0, and cpu is intel Xeon 6740E (BirthStream, Siera Forest)
nodeB ip : 100.7.36.223, with ib port of mlx5_0 (400Gb) on numa0, and cpu is intel Xeon 6740E (BirthStream, Siera Forest)
nodeB(server): numactl -N 0 -m 0 ib_write_bw -d mlx5_0 -t 128 --run_infinitely
nodeA(client) : numactl -N 0 -m 0 ib_write_bw -d mlx5_0 -t 128 --run_infinitely 100.7.36.223
now ib_write_bw show the ib bandwidth is 44GB, and pcm-memory shows the memory bandwidth as 47GB, it's just ok.
nodeA(server): numactl -N 0 -m 0 ib_write_bw -d mlx5_0 -t 128 --run_infinitely
nodeB(client) : numactl -N 0 -m 0 ib_write_bw -d mlx5_0 -t 128 --run_infinitely 100.7.36.221
now ib_write_bw show the ib bandwidth is also 44GB, but pcm-memory shows the memory bandwidth less than 1G, only a few hundred megabytes.
It looks like the pcm-memory does not account the memory band in last scenario 2.
Why? As i thought, the pcm-memory should account any memory bandwidth from cpu and pcie to memory ?
The text was updated successfully, but these errors were encountered: