All notable changes to this project will be documented in this file.
The format is based on Keep a Changelog and this project adheres to Semantic Versioning.
- Added
CRITERION_HOME
environment variable to set the directory for Criterion to store its results and charts in. - Added support for cargo-criterion. The long-term
goal here is to remove code from Criterion-rs itself to improve compile times, as well as to add
features to
cargo-criterion
that are difficult to implement in Criterion-rs. - Add sampling mode option for benchmarks. This allows the user to change how Criterion.rs chooses the iteration counts in each sample. By default, nothing will change for most benchmarks, but very slow benchmarks will now run fewer iterations to fit in the desired number of samples. This affects the statistics and plots generated.
- The serialization format for some of the files has changed. This may cause your first benchmark run after updating to produce errors, but they're harmless and will go away after running the benchmarks once.
- Fixed a bug where the current measurement was not shown on the relative regression plot.
- Fixed rare panic in the plotters backend.
- Panic with a clear error message (rather than panicking messily later on) when the user sets the group or function name to the empty string.
- Escape single quotes in benchmark names when generating Gnuplot scripts.
0.3.2 - 2020-04-26
- Added
?Sized
bound to benchmark parameter types, which allows dynamically sized types like&str
and&[T]
to be used as benchmark parameters. - Added the
--output-format <format>
command-line option. If--output-format bencher
is passed, Criterion.rs will print its measurements in a format similar to that used by thebencher
crate or unstablelibtest
benchmarks, and using similar statistical measurements as well. Though this provides less information than the default format, it may be useful for supporting tools which parse this output format. - Added
--nocapture
argument. This argument does nothing, but prevents Criterion.rs from exiting when running tests or benchmarks and allowing stdout output from other tests.
- Fixed panic when environment variables contains non-UTF8 characters.
- Fixed panic when
CRITERION_DEBUG
orCRITERION_TARGET_DIR
environment variables contain non-UTF8 characters.
0.3.1 - 2020-01-25
- Added new plotting backend using the
plotters
crate. Implementation generously provided by Hao Hou, author of theplotters
crate. - Added
--plotting-backend
command-line option to select the plotting backend. The existing gnuplot backend will be used by default when available, and the plotters backend will be used when gnuplot is not available or when requested. - Added
Criterion::plotting_backend()
function to configure the plotting backend in code. - Added
--load-baseline
command-line option to load a baseline for comparison rather than measuring the current code - Benchmark filters can now be regular expressions.
- Fixed
fibonacci
functions. - Fixed
#[criterion]
benchmarks ignoring the command-line options. - Fixed incorrect scaling of the violin plots.
- Don't print the recommended sample count if it's the same as the configured sample count.
- Fix potential panic when
nresamples
is set too low. Also added a warning against settingnresamples
too low. - Fixed issue where a slow outer closure would cause Criterion.rs to calculate the wrong estimated time and number of iterations in the warm-up phase.
0.3.0 - 2019-08-25
- Added support for plugging in custom measurements (eg. processor counters) into Criterion.rs' measurement and analysis.
- Added support for plugging in instrumentation for internal profilers such as
cpuprofiler
which must be explicitly started and stopped within the profiled process. - Added the
BenchmarkGroup
type, which supersedesParameterizedBenchmark
,Benchmark
,Criterion::bench_functions
,Criterion::bench_function_over_inputs
, andCriterion::bench
.BenchmarkGroup
performs the same function as all of the above, but is cleaner to use and more powerful and flexible. All of these types/functions are now soft-deprecated (meaning they're hidden from the documentation and should not be used in new code). They will be fully deprecated at some point in the 0.3.* series and removed in 0.4.0. iter_custom
- a "timing loop" that allows the caller to perform their own measurements. This is useful for complex measurements that don't fit into the usual mode of calling a lambda in a loop.- If the benchmark cannot be completed in approximately the requested measurement time, Criterion.rs will now print a suggested measurement time and sample size that would work.
- Two new fields,
throughput_num
andthroughput_type
have been added to theraw.csv
file. - Added command-line options to set the defaults for warm-up time, measurement-time, etc.
- The
raw.csv
file format has been changed slightly. Thesample_time_nanos
field has been split intosample_measured_value
andunit
fields to accommodate custom measurements. - Throughput has been expanded from u32 to u64 to accommodate very large input sizes.
- Fixed possible invalid file name error on Windows
- Fixed potential case where data for two different benchmarks would be stored in the same directory.
- Removed the
--measure-only
command-line argument; it was deprecated in favor of--profile-time
in 0.2.6. - External program benchmarks have been removed; they were deprecated in 0.2.6. The new
iter_custom
timing loop can be used as a substitute; seebenches/external_process.rs
for an example of this.
- The
--test
argument is now deprecated. To test benchmarks, usecargo test --benches
.
0.2.11 - 2019-04-08
- Enabled automatic text-coloring on Windows.
- Fixed panic caused by outdated files after benchmark names or types were changed.
- Reduced timing overhead of
Criterion::iter_batched/iter_batched_ref
.
0.2.10 - 2019-02-09
- Added
iter_batched/iter_batched_ref
timing loops, which allow for setup (likeiter_with_setup/iter_with_large_setup
) and exclude drop (likeiter_with_large_drop
) but measure the runtime more accurately, use less memory and are more flexible.
iter_with_setup/iter_with_large_setup
are now deprecated in favor ofiter_batched
.
0.2.9 - 2019-01-24
- Criterion.rs no longer depends on the default features of the
rand-core
crate. This fixes some downstream crates which userand
in ano_std
context.
0.2.8 - 2019-01-20
- Criterion.rs now uses
rayon
internally instead of manualunsafe
code built with thread-scoped. - Replaced handlebars templates with TinyTemplate
- Merged
criterion-stats
crate intocriterion
crate.criterion-stats
will no longer receive updates. - Replaced or removed various other dependencies to reduce the size of Criterion.rs' dependency tree.
0.2.7 - 2018-12-29
- Fixed version numbers to prevent incompatibilities between
criterion
andcriterion-stats
crates.
0.2.6 - 2018-12-27 - Yanked
- Added
--list
command line option, which lists the benchmarks but does not run them, to matchcargo test -- --list
. - Added README/CONTRIBUTING/LICENSE files to sub-crates.
- Displays change in throughput in the command-line and HTML output as well as change in iteration time.
- Benchmarks with multiple functions and multiple values will now generate a per-value summary report file in addition to the existing per-function one.
- Added a
--profile-time
command-line argument which disables reporting and analysis and instead simply iterates each benchmark for approximately the given number of seconds. This supersedes the (now-deprecated)--measure-only
argument.
- Functions passed to
Bencher::iter_with_large_setup
can now return output. This is necessary to prevent the compiler from optimizing away the benchmark. This is technically a breaking change - that function requires a new type parameter. It's so unlikely to break existing code that I decided not to delay this for a breaking-change release. - Reduced measurement overhead for the
iter_with_large_setup
anditer_with_drop
methods. criterion_group
andcriterion_main
macros no longer require theCriterion
struct to be explicitly imported.- Don't panic when
gnuplot --version
fails. - Criterion.rs macros no longer require user to
use criterion::Criterion;
- Criterion.rs no longer initializes a logger, meaning that it will no longer conflict with user code which does.
- Criterion.rs no longer fails to parse gnuplot version numbers like
gnuplot 5.2 patchlevel 5a (Gentoo revision r0)
- Criterion.rs no longer prints an error message that gnuplot couldn't be found when chart
generation is disabled (either by
Criterion::without_plots
,--noplot
or disabling the HTML reports feature) - Benchmark names are now automatically truncated to 100 characters and a number may be added to make them unique. This fixes a problem where gnuplot would crash if the title was extremely long, and also improves the general usability of Criterion.rs.
- Changed timing model of
iter_with_large_setup
to exclude time spent dropping values returned by the routine. Time measurements taken with 0.2.6 using these methods may differ from those taken with 0.2.5. - Benchmarks with multiple functions and multiple values will now appear as a table rather than a tree in the benchmark index. This is to accommodate the new per-value summary reports.
- Deprecated the
--measure-only
command-line-argument in favor of--profile-time
. This will be removed in 0.3.0. - External-program benchmarks are now deprecated. They will be removed in 0.3.0.
- The
html_reports
cargo feature is now deprecated. This feature will become non-optional in 0.3.0. - Sample sizes less than 10 are deprecated and will be disallowed in 0.3.0.
- This is not an exhaustive list - the full scope of changes in 0.3.0 is not yet determined. There may be breaking changes that are not listed here.
0.2.5 - 2018-08-27
- Fixed links from generated report files to documentation.
- Fixed formatting for very large percentage changes (>1000%)
- Sorted the benchmarks in the index report by name
- Fixed case where benchmark ID with special characters would cause Criterion.rs to open the wrong file and log an error message.
- Fixed case where running
cargo clean; cargo bench -- <filter>
would cause Criterion.rs to log an error message. - Fixed a GNUplot error message when sample size is very small.
- Fixed several cases where Criterion.rs would generate invalid path names.
- Fixed a bug where Criterion.rs would print an error if run with a filter that allowed no benchmarks and a clean target directory.
- Fixed bug where some benchmarks didn't appear in the benchmark index report.
- Criterion.rs now honors the
CARGO_TARGET_DIR
environment variable.
- Criterion.rs will generate a chart showing the effects of changes in input (or input size) for all benchmarks with numeric inputs or throughput, not just for those which compare multiple functions.
0.2.4 2018-07-08
- Added a pair of flags,
--save-baseline
and--baseline
, which change how benchmark results are stored and compared. This is useful for working against a fixed baseline(eg. comparing progress on an optimization feature branch to the commit it forked from). Default behavior of Criterion.rs is now--save-baseline base
which emulates the previous, user facing behavior.--save-baseline
saves the benchmark results under the provided name.--baseline
compares the results to a saved baseline. If the baseline does not exist for a benchmark, an error is given.
- Added user-guide documentation for baselines, throughput measurements and plot configuration.
- Added a flag,
--test
, which causes Criterion to execute the benchmarks once without measuring or reporting the results. This is useful for checking that the benchmarks run successfully in a CI setting. - Added a
raw.csv
file to the output which contains a stable, machine-readable representation of the measurements taken by benchmarks. This enables users to perform their own analysis or keep historical information without depending on private implementation details.
- The
sample_size
method on theCriterion
,Benchmark
andParameterizedBenchmark
structs has been changed to panic if the sample size is less than 2. Other parts of the code require this and will panic if the sample size is 1, so this is not considered to be a breaking change. - API documentation has been updated to show more-complete examples.
- Certain characters will now be replaced with underscores when creating benchmark directory paths, to avoid generating invalid or unexpected paths.
0.2.3 - 2018-04-14
- Criterion.rs will now panic with a clear error message if the user attempts to run
a benchmark which doesn't call the
Bencher::iter
function or a related function, rather than failing in an uncontrolled manner later. - Fixed broken links in some more summary reports.
- Added a
--measure-only
argument which causes the benchmark executable to run the warmup and measurement and then move on to the next benchmark without analyzing or saving data. This is useful to prevent Criterion.rs' analysis code from appearing in profile data when profiling benchmarks. - Added an index report file at "target/criterion/report/index.html" which links to the other reports for easy navigation.
0.2.2 - 2018-03-25
- Fixed broken links in some summary reports.
- Work around apparent rustc bug in >= 1.24.0.
0.2.1 - 2018-02-24
- HTML reports are now a default Cargo feature. If you wish to disable HTML reports, disable Criterion.rs' default features. Doing so will allow compatibility with older Rust versions such as 1.20. If you wish to continue using HTML reports, you don't need to do anything.
- Added a summary report for benchmarks that compare multiple functions or different inputs.
- The plots and HTML reports are now generated in a
report
folder.
- Underscores in benchmark names will no longer cause subscripted characters to appear in generated plots.
0.2.0 - 2018-02-05
- Added
Criterion.bench
function, which accepts either aBenchmark
orParameterizedBenchmark
. These new structures allow for custom per-benchmark configuration as well as more complex benchmark grouping (eg. comparing a Rust function against an external program over a range of inputs) which was not possible previously. - Criterion.rs can now report the throughput of the benchmarked code in units of
bytes or elements per second. See the
Benchmark.throughput
andParameterizedBenchmark.throughput
functions for further details. - Criterion.rs now generates a basic HTML report for each benchmark.
- Added
--noplot
command line option to disable plot generation.
- The builder methods on the Criterion struct now take and return self by value
for easier chaining. Functions which configure a Criterion structure will need
to be updated accordingly, or will need to be changed to work with the
Benchmark
orParameterizedBenchmark
types to do per-benchmark configuration instead. - The closures taken by
Criterion.bench_*
must now have a'static
lifetime. This means that you may need to change your closures from|bencher| {...}
tomove |bencher| {...}
. Criterion.bench_functions
now takesI
as an input parameter, not&I
.- Input values must now implement
Debug
rather thanDisplay
. - The generated plots are stored in
target/criterion
rather than.criterion
.
- The hidden
criterion::ConfidenceInterval
andcriterion::Estimate
types are no longer publicly accessible. - The
Criterion.summarize
function has been removed.
- Fixed the relative mean and median reports.
- Fixed panic while summarizing benchmarks.
0.1.2 - 2018-01-12
- Criterion.rs is now stable-compatible!
- Criterion.rs now includes its own stable-compatible
black_box
function. Some benchmarks may now be affected by dead-code-elimination where they previously weren't and may have to be updated. - Criterion.rs now uses
serde
to save results. Existing results files will be automatically removed when benchmarks are run. - Redesigned the command-line output to highlight the important information and reduce noise.
- Running benchmarks with the variable "CRITERION_DEBUG" in the environment will cause Criterion.rs to generate extra debug output and save the gnuplot scripts alongside the generated plots.
- Don't panic on IO errors or gnuplot failures
- Fix generation of invalid gnuplot scripts when benchmarking over inputs and inputs include values <= 0.
- Bug where benchmarks would run one sample fewer than was configured.
- Generated plots will no longer use log-scale.
0.1.1 - 2017-12-12
- A changelog file.
- Added a chapter to the book on how Criterion.rs collects and analyzes data.
- Added macro rules to generate a test harness for use with
cargo bench
. Benchmarks defined without these macros should continue to work. - New contribution guidelines
- Criterion.rs can selectively run benchmarks. See the Command-line page for more details
- Initial release on Crates.io.