Store Gateway: Cache expanded postings for expensive matcher #8023

yeya24 · 2024-12-26T20:17:20Z

I added CHANGELOG entry for this change.
Change is not relevant to the end user.

Store Gateway Index Cache caches each posting individually. For example, pod="prometheus-0" is one cache key and pod="prometheus-1" can be another cache key. This is kind of problematic for those high cardinality labels for example, host, node or pod.

When there is a query matcher such as pod!="" , Store Gateway tries to get all values of label pod and fetches each value from the remote cache. This is ok for small set up. However, we have 100+ vertically sharded blocks per day where each block has close to 1M values for pod label. When a query tries to fetch 30 days of data with matcher pod!="", this ended up fetching 3B cache keys from remote cache. It is even worse when there is cache miss for postings, Store Gateway tries to backfill remote cache with postings data fetched from object store. This can be another 3B SET requests to the cache.

Lazy posting is a way to solve high cardinality matchers such as pod!="" and we have several attempts to improve that. However, the root cause of the problem is that Index Cache stores each label (posting) separately and it is not doing well against high cardinality labels.

Changes

This PR made several changes:

For each posting group (most of the cases this is a single label matcher in your query), if it matches more than 1 key, fetch expanded postings from cache for this group. For example, pod!="" usually matches more than 1 pod so it tries to fetch expanded postings matching pod!="".
If cache miss, Store Gateway fallbacks to fetch individual postings for pod from cache and object storage.
If Store Gateway downloads a posting from object store, if the posting's corresponding posting group matches more than 1 key, Store Gateway doesn't backfill remote cache, instead it tries to backfill the whole expanded posting for the whole posting group
For each posting group which fetches only a single key such as __name__="metric_name", it works the same as of today.

Basically we change from caching individual postings to caching the whole matcher.

now:
pod=A => [1,2,3,4]
pod=B => [5,6,7,8]
pod=C => [9, 10]

new:
pod!="" => [1,2,3,4,5,6,7,8,9,10]

Index Cache interface FetchExpandedPostings is updated to support multiple matchers. The previous one is designed to fetch only one set of matchers.

today: FetchExpandedPostings(ctx context.Context, blockID ulid.ULID, matchers []*labels.Matcher, tenant string) ([]byte, bool)

new: FetchExpandedPostings(ctx context.Context, blockID ulid.ULID, matchers [][]*labels.Matcher, tenant string) [][]byte

Verification

Tests updated.

group if it has more than 1 key Signed-off-by: Ben Ye <[email protected]>

yeya24 · 2024-12-29T00:42:55Z

pkg/store/bucket.go

+		if !pg.lazy {
+			// If posting group has more than 1 key to fetch, fetch expanded postings first.
+			// This helps for matcher such as !="", =~".+" to avoid fetching too many keys from cache.
+			if len(pg.addKeys) > 1 || len(pg.removeKeys) > 1 {


Maybe we can make the condition > a certain number instead of > 1

yeya24 · 2024-12-29T00:45:07Z

pkg/store/bucket.go

+			continue
+		}
+		// Cache miss, have additional keys to fetch.
+		keysLength += len(postingGroups[pgIdx].addKeys) + len(postingGroups[pgIdx].removeKeys)


In case expanded posting cache miss, we still try to fetch each posting individually from index cache and then object store.

I wonder if we should just skip cache and fetch from object store directly. It makes no sense to fetch postings from cache if we don't store cache there, only if the posting are queried directly with = matcher.

pull-request-size bot added the size/XL label Dec 26, 2024

yeya24 changed the title ~~Store Gateway: Cache expanded postings for each matcher~~ Store Gateway: Cache expanded postings for expensive matcher Dec 26, 2024

yeya24 force-pushed the expanded-posting-posting-group branch 3 times, most recently from c898baa to ff747fa Compare December 27, 2024 23:58

yeya24 force-pushed the expanded-posting-posting-group branch from ff747fa to 08d1160 Compare December 28, 2024 21:24

yeya24 commented Dec 29, 2024

View reviewed changes

yeya24 mentioned this pull request Jan 12, 2025

skip fetching postings from index cache for group with many keys #8054

Draft

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Store Gateway: Cache expanded postings for expensive matcher #8023

Store Gateway: Cache expanded postings for expensive matcher #8023

yeya24 commented Dec 26, 2024 •

edited

Loading

yeya24 Dec 29, 2024

yeya24 Dec 29, 2024 •

edited

Loading

Store Gateway: Cache expanded postings for expensive matcher #8023

Are you sure you want to change the base?

Store Gateway: Cache expanded postings for expensive matcher #8023

Conversation

yeya24 commented Dec 26, 2024 • edited Loading

Changes

Verification

yeya24 Dec 29, 2024

Choose a reason for hiding this comment

yeya24 Dec 29, 2024 • edited Loading

Choose a reason for hiding this comment

yeya24 commented Dec 26, 2024 •

edited

Loading

yeya24 Dec 29, 2024 •

edited

Loading