bloom filter bulk inserts and queries #723

dcoutts · 2025-05-15T00:24:22Z

Two changes, both from bulk operations for bloom filters.

For Bloom filter inserts in run accumulation: instead of inserting keys into the bloom filter one by one as they are added to the run accumulator, save them up and add them all in one go when the page is being finalised. This then lets us use a bloom filter bulk insert, which lets us use memory prefetching.

For Bloom filter queries in key lookups, update the existing bulk query code to properly take advantage of the new API and use prefetching. We can now also simplify and use a single high performance implementation, rather than needing two (a more compatible one and a faster one that relied on fancier features available in later GHC versions).

Results for the WP8 benchmark (100M elements, 10bits per key, full caching, 10k batches of 256 keys):

baseline: 88,871 ops/sec
with bulk inserts: 97,152 ops/sec, ~9% improvement
with bulk queries: 103,005 ops/sec, ~6% additional, ~16% cumulative improvement

So overall about a 16% improvement in ops/sec on the primary WP8 benchmark, and as a bonus, getting over the magic 100k ops/sec threshold (on my laptop).

jorisdral

Some initial comments. I have yet to look at the bulk insert code

src/Database/LSMTree/Internal/RunAcc.hs

test/Test/Database/LSMTree/Internal/BloomFilter.hs

src/Database/LSMTree/Internal/RunAcc.hs

jorisdral

LGTM! If we fix the test failure that I noted in my previous review, then we're good to go

bloomfilter/src/Data/BloomFilter/Blocked/BitArray.hs

src/Database/LSMTree/Internal/BloomFilter.hs

test/Test/Database/LSMTree/Internal/BloomFilter.hs

dcoutts · 2025-05-15T10:29:45Z

CI failure due to flaky test fixed in #724.

Instead of inserting keys into the bloom filter one by one as they are added to the run accumulator, save them up and add them all in one go when the page is being finalised. This then lets us use a bloom filter bulk insert, which lets us use memory prefetching. The result should be faster.

Fetch into the caches in the least intrusive way, "0" levels, rather than "3" levesl. This does not appear to slow down inserts, and should evict fewer things from the caches. And document what level "0" means and why we use it.

`bloomQueriesModel` was not really a proper model, because the model itself was using actual bloom filters. The model is now instead a `Set`, and `prop_bloomQueriesModel` is updated because the model will now only return true positives and negatives.

Previously, for the Classic Bloom filter implementation we had two different implementations of bloomQueries: one that was relatively simple and didn't rely on anything fancy, and one that went all out to maximise performance. The high performance one had to be disabled when we added the block-structured bloom filter implementation, since it was tightly coupled to that implementation. With the new bloom filter API and implementation, we can now implement a single high performance version of bulk query. We no longer need separate higher and lower performance versions, since we no longer need to rely on fancy features like unlifted boxed arrays. So strip out the bloom-query-fast cabal flag and the BloomFilterQuery2 module. The updated BloomFilterQuery1 does a simple nested loop, but also does prefetching. The prefetch distance is the number of runs, which is proportional to the number of levels, and is typically modest.

The query module used to be big, and there used to be two of them. Now there's only one and it's a lot smaller. So it makes sense to keep it all together in one module.

We were using a mix of import qualified as BF, and import as Bloom.

dcoutts requested review from jorisdral, mheinzel, recursion-ninja and wenkokke as code owners May 15, 2025 00:24

jorisdral requested changes May 15, 2025

View reviewed changes

src/Database/LSMTree/Internal/RunAcc.hs Show resolved Hide resolved

src/Database/LSMTree/Internal/RunAcc.hs Show resolved Hide resolved

test/Test/Database/LSMTree/Internal/BloomFilter.hs Outdated Show resolved Hide resolved

jorisdral reviewed May 15, 2025

View reviewed changes

src/Database/LSMTree/Internal/RunAcc.hs Show resolved Hide resolved

jorisdral approved these changes May 15, 2025

View reviewed changes

dcoutts force-pushed the dcoutts/bloomfilter-blocked branch 2 times, most recently from 0c341bb to 09afe55 Compare May 16, 2025 00:04

dcoutts and others added 6 commits May 16, 2025 01:05

Tweak prefetching for bloom filter inserts

8940010

Fetch into the caches in the least intrusive way, "0" levels, rather than "3" levesl. This does not appear to slow down inserts, and should evict fewer things from the caches. And document what level "0" means and why we use it.

Merge BloomFilterQuery1 module into BloomFilter module

b02fb97

The query module used to be big, and there used to be two of them. Now there's only one and it's a lot smaller. So it makes sense to keep it all together in one module.

Be consistent in names for importing bloom filter modules

49f8071

We were using a mix of import qualified as BF, and import as Bloom.

dcoutts force-pushed the dcoutts/bloomfilter-blocked branch from 09afe55 to 49f8071 Compare May 16, 2025 00:06

dcoutts changed the title ~~Bulk insert for bloom filters in RunAcc~~ bloom filter bulk inserts and queries May 16, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bloom filter bulk inserts and queries #723

bloom filter bulk inserts and queries #723

dcoutts commented May 15, 2025 •

edited

Loading

jorisdral left a comment •

edited

Loading

jorisdral left a comment

dcoutts commented May 15, 2025

bloom filter bulk inserts and queries #723

Are you sure you want to change the base?

bloom filter bulk inserts and queries #723

Conversation

dcoutts commented May 15, 2025 • edited Loading

jorisdral left a comment • edited Loading

Choose a reason for hiding this comment

jorisdral left a comment

Choose a reason for hiding this comment

dcoutts commented May 15, 2025

dcoutts commented May 15, 2025 •

edited

Loading

jorisdral left a comment •

edited

Loading