Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
test/bench: osu-style bcast benchmark
The barrier often does not exit uniformly especially if node-topology is in play. This affects different collectie algorithms differently, thus using the combined latency doesn't hides too much details for algorithm comparisons. The osu microbenchmarks measures collective latency individually then reduce for min, max, and average. Why it is still suceptible to barrier behavior, it does provide more details for some insights comparing different algorithms.
- Loading branch information