Track library performance over time #163

rlouf · 2022-02-01T20:44:02Z

Benchmarks should serve three functions. First prevent performance regression when changes are made on the libray's core. Then give potential users an idea of what to expect when they use the library on their problem. Finally inform users about the trade-offs of each of the algorithms.

We should thus select a few problems that are notoriously hard to sample from and setup a few of benchmarks. The selection should include problems where one (family of) algorithm(s) clearly outperforms all others to fulfill goal (3), as well as common problems to fulfill goal (2). We should measure raw speed, but also other important metrics that assess sampling quality like the number of ESS/s.

The repo already uses pytest-benchmark to run a few benchmarks, but this is rather useless since the results are not stored. In order to keep track of history, and display the results we can use this Github action. In the benchmark branch of this repo there's an example of model implementation and data fetching in the /benchmark directory. We will need to add a datastructure that specifies which algorithm can be run on this example with which parameter values.

The results should be computed at every merge on main and should be displayed in the documentation (and not be erased by a new build).

Documentation build should not erase previous benchmarks, benchmark publishing should not erase docs

The text was updated successfully, but these errors were encountered:

junpenglao · 2022-10-13T10:32:39Z

For v1 release, here are some items to close out:

continue tracking with https://github.com/marketplace/actions/continuous-benchmark
add more models
specific benchmarking of different component (e.g., seperating warmup with sampling, and warmup+sampling end to end)
add more metrics

rlouf added documentation Improvements or additions to documentation help wanted Extra attention is needed labels Sep 19, 2022

rlouf mentioned this issue Sep 19, 2022

Build docs in PR and clean workflows #296

Merged

rlouf pinned this issue Sep 19, 2022

rlouf added the important label Sep 19, 2022

junpenglao mentioned this issue Oct 13, 2022

Fix benchmark #380

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Track library performance over time #163

Track library performance over time #163

rlouf commented Feb 1, 2022 •

edited

Loading

junpenglao commented Oct 13, 2022

Track library performance over time #163

Track library performance over time #163

Comments

rlouf commented Feb 1, 2022 • edited Loading

junpenglao commented Oct 13, 2022

rlouf commented Feb 1, 2022 •

edited

Loading