Improve implementation of RealSum #352

minnerbe · 2024-01-03T21:41:13Z

Problem

While profiling an application that uses ImgLib2, I learned that multiple threads concurrently summing up large arrays using net.imglib2.util.RealSum are responsible for a big chunk of CPU-time.

Proposed changes

Subsequently, I researched numerically stable ways of summing up floating point numbers, and learned that the current implementation could be improved by using a variant of the Kahan summation algorithm.

I implemented this algorithm in RealSum, keeping the API stable. Since there is no capacity anymore, I also deprecated the constructor that takes an int and deleted the corresponding test.

Improvements

I added a simple test that fails for the current implementation, but passes for the proposed implementation using the improved Kahan algorithm. Furthermore, the new implementation is substantially faster than the old one. These are the results from a benchmark adding 10M random numbers (lower is better):

Benchmark                               Mode  Cnt   Score   Error  Units
CompensatedSummation.newImplementation  avgt    5  12.175 ± 0.413  ms/op
CompensatedSummation.oldImplementation  avgt    5  27.491 ± 2.794  ms/op

Further suggestions

Looking at RealSumTest, I'm not entirely sure what the point of the testDoubleSum() and testDoubleAdd() methods is. It looks to me as if they are only trying to assert that the reference result obtained from summing BigDecimals "does the correct thing". Am I missing something here? If my interpretation is correct, I suggest to delete those tests since they are not related to RealSum at all.

minnerbe · 2024-02-26T14:23:28Z

Hi @tpietzsch, have you had a chance to look at this PR already? I'm happy to also work on the further suggestions (which are trivial, anyway).

tpietzsch · 2024-03-06T12:53:50Z

Looks great!

I agree: Those tests can be removed. Also, I would remove the for ( int t = 0; t < 20; ++t ) loop from testAdd().
(Maybe the test was also used for benchmarking, which would also explain the presence of testDoubleSum() and testDoubleAdd() methods).

Could you add your JMH benchmark too? (just benchmarking RealSum, not old vs new implementation)

minnerbe · 2024-03-12T21:39:45Z

Thanks for the feedback, @tpietzsch! According to your suggestions, I cleaned up the test and added my benchmark.

The benchmark compares RealSum with naive double summation, but I'm happy to remove the latter if you don't want it there. The parameters are taken from the only other benchmark in that directory (FlatCollectionsBenchmark).

Let me know if I can do anything else.

tpietzsch · 2024-03-28T15:23:59Z

Awesome, thanks!

minnerbe added 3 commits January 3, 2024 16:01

Add test that fails for current implementation

27bf8ff

Swap implementation of RealSum to Neumaier's algorithm

c3cb8c6

Delete test for deprecated constructor

bd690ff

minnerbe added 4 commits March 12, 2024 17:06

Delete double tests methods

3568e4a

Remove unnecessary loop in summation test

41ddf0e

Add more description to and change name of hard example

193142a

Add benchmark for RealSum

a8bd278

tpietzsch merged commit 0879407 into imglib:master Mar 28, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve implementation of RealSum #352

Improve implementation of RealSum #352

minnerbe commented Jan 3, 2024 •

edited

Loading

minnerbe commented Feb 26, 2024

tpietzsch commented Mar 6, 2024

minnerbe commented Mar 12, 2024 •

edited

Loading

tpietzsch commented Mar 28, 2024

Improve implementation of RealSum #352

Improve implementation of RealSum #352

Conversation

minnerbe commented Jan 3, 2024 • edited Loading

Problem

Proposed changes

Improvements

Further suggestions

minnerbe commented Feb 26, 2024

tpietzsch commented Mar 6, 2024

minnerbe commented Mar 12, 2024 • edited Loading

tpietzsch commented Mar 28, 2024

minnerbe commented Jan 3, 2024 •

edited

Loading

minnerbe commented Mar 12, 2024 •

edited

Loading