chore: Add basic benchmark suite to C library #393

paleolimbot · 2024-02-28T17:00:17Z

This PR adds an initial set of benchmarks covering some realistic usage patterns. The general approach is to use doxygen comments to document the benchmarks, which will run against the released version and the previous version. I'm not sure exactly what the output format will be but I'd like the benchmarks to be written in such a way that there's a path to programatically generating a report (maybe using conbench, maybe just a Quarto document).

Work in progress!

pitrou · 2024-02-28T18:44:44Z

Can you show what the results look like?

paleolimbot · 2024-02-28T18:48:13Z

This is from the CI run (so timings are maybe meaningless), but the output looks like:

  2024-02-28T17:43:48+00:00
  ::group::array_benchmark

  Run on (4 X 3139.35 MHz CPU s)
  CPU Caches:
    L1 Data 32 KiB (x2)
    L1 Instruction 32 KiB (x2)
    L2 Unified 512 KiB (x2)
    L3 Unified 32768 KiB (x1)
  Load Average: 0.89, 0.34, 0.12
  -------------------------------------------------------------------------------------------------
  Benchmark                                       Time             CPU   Iterations UserCounters...
  -------------------------------------------------------------------------------------------------
  BM_ArrayViewGetIntUnsafeInt8              1576584 ns      1576545 ns          449 items_per_second=634.298M/s
  BM_ArrayViewGetIntUnsafeInt16              936609 ns       936540 ns          749 items_per_second=1.06776G/s
  BM_ArrayViewGetIntUnsafeInt32             1244619 ns      1244574 ns          562 items_per_second=803.488M/s
  BM_ArrayViewGetIntUnsafeInt64              945470 ns       945435 ns          745 items_per_second=1.05771G/s
  BM_ArrayViewGetIntUnsafeInt64CheckNull    1751277 ns      1751243 ns          396 items_per_second=571.023M/s

::group::schema_benchmark
  2024-02-28T17:43:52+00:00
  Running ./schema_benchmark
  Run on (4 X 3241.55 MHz CPU s)
  CPU Caches:
    L1 Data 32 KiB (x2)
    L1 Instruction 32 KiB (x2)
    L2 Unified 512 KiB (x2)
    L3 Unified 32768 KiB (x1)
  Load Average: 0.90, 0.35, 0.13
  --------------------------------------------------------------------------------------
  Benchmark                            Time             CPU   Iterations UserCounters...
  --------------------------------------------------------------------------------------
  BM_SchemaInitWideStruct         768760 ns       768689 ns          911 items_per_second=13.0092M/s
  BM_SchemaViewInitWideStruct     175154 ns       175138 ns         4202 items_per_second=57.0978M/s

felipecrv · 2024-02-28T22:57:42Z

src/nanoarrow/array_benchmark.cc

+BENCHMARK(BM_ArrayViewGetIntUnsafeInt16);
+BENCHMARK(BM_ArrayViewGetIntUnsafeInt32);
+BENCHMARK(BM_ArrayViewGetIntUnsafeInt64);
+BENCHMARK(BM_ArrayViewGetIntUnsafeInt64CheckNull);


hmm in Arrow C++ we don't start the benchmarks with BM_. Is that necessary?

Good catch! I think it was rather contagious copy/paste from the first example in the benchmark library README 😬

pitrou · 2024-02-29T11:50:51Z

This is from the CI run (so timings are maybe meaningless), but the output looks like:

Thank you! This looks fine to me.

This PR adds an initial set of benchmarks covering some realistic usage patterns. The general approach is to use doxygen comments to document the benchmarks, which will run against the released version and the previous version. I'm not sure exactly what the output format will be but I'd like the benchmarks to be written in such a way that there's a path to programatically generating a report (maybe using conbench, maybe just a Quarto document). Work in progress!

paleolimbot added 5 commits February 28, 2024 10:17

some schema benchmarks

c4d8042

add some docs

3d62513

just a few benchmarks

3c70bc3

rename

3daa655

some cos

729af11

felipecrv reviewed Feb 28, 2024

View reviewed changes

rename benchmarks

47e3c8d

paleolimbot marked this pull request as ready for review March 7, 2024 18:46

paleolimbot merged commit 5756b76 into apache:main Mar 7, 2024
32 checks passed

paleolimbot deleted the c-benchmarks branch March 7, 2024 19:19

paleolimbot added this to the nanoarrow 0.5.0 milestone May 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: Add basic benchmark suite to C library #393

chore: Add basic benchmark suite to C library #393

paleolimbot commented Feb 28, 2024 •

edited

Loading

pitrou commented Feb 28, 2024

paleolimbot commented Feb 28, 2024

felipecrv Feb 28, 2024

paleolimbot Feb 29, 2024

pitrou commented Feb 29, 2024

chore: Add basic benchmark suite to C library #393

chore: Add basic benchmark suite to C library #393

Conversation

paleolimbot commented Feb 28, 2024 • edited Loading

pitrou commented Feb 28, 2024

paleolimbot commented Feb 28, 2024

felipecrv Feb 28, 2024

Choose a reason for hiding this comment

paleolimbot Feb 29, 2024

Choose a reason for hiding this comment

pitrou commented Feb 29, 2024

paleolimbot commented Feb 28, 2024 •

edited

Loading