add matcher cache for proxy store and tsdb store API #111

yuchen-db · 2024-12-12T00:57:21Z

bechmark shows 100x speedup if hit rate is 100%. More tests in https://github.com/databricks-eng/universe/pull/837859
goos: darwin
goarch: arm64
pkg: github.com/thanos-io/thanos/pkg/store/storepb
cpu: Apple M1 Max
BenchmarkMatcherConverter_REWithAndWithoutCache
BenchmarkMatcherConverter_REWithAndWithoutCache/Without_Cache
BenchmarkMatcherConverter_REWithAndWithoutCache/Without_Cache-10 29792 42061 ns/op
BenchmarkMatcherConverter_REWithAndWithoutCache/With_Cache
BenchmarkMatcherConverter_REWithAndWithoutCache/With_Cache-10 3147709 371.9 ns/op
PASS

hczhu-db

Nice work!

pkg/store/storepb/custom.go

hczhu-db · 2024-12-12T01:22:20Z

pkg/store/storepb/custom.go

+}
+
+func NewMatcherConverter(cacheCapacity int, reg prometheus.Registerer) (*MatcherConverter, error) {
+	c, err := cache.New2Q[LabelMatcher, *labels.Matcher](cacheCapacity)


Nice findings. TwoQueue cache fits out use case very well.

pkg/store/prometheus.go

pkg/store/proxy.go

pkg/store/tsdb.go

jnyi · 2024-12-12T22:22:07Z

pkg/store/storepb/custom_test.go

+	NoCache   CacheAction_Type = 2
+)
+
+func TestMatcherConverter_MatchersToPromMatchers(t *testing.T) {


awesome job to benchmark this, would encourage to submit a PR to OSS as well :)

jnyi · 2024-12-12T22:23:30Z

pkg/store/storepb/custom.go

@@ -381,31 +384,112 @@ func PromMatchersToMatchers(ms ...*labels.Matcher) ([]LabelMatcher, error) {
 	return res, nil
 }

+func MatcherToPromMatcher(m LabelMatcher) (*labels.Matcher, error) {


nit: lower case this function if we are not expose it outside.

pkg/store/storepb/custom.go

jnyi · 2024-12-12T22:30:59Z

pkg/store/storepb/custom_test.go

+		}
+	})
+
+	b.Run("With Cache", func(b *testing.B) {


could possible to add some random cache miss cases like regex randomkey[0-4] and generate test data randomkey[0-9] so it will have 50% hit miss

jnyi

awesome job @yuchen-db , highly encourage to propose this optimization to OSS once we've tested in our setup, a few comments on the styling and better code structure.

pkg/store/proxy.go

pkg/store/storepb/custom.go

pkg/store/tsdb.go

jnyi · 2024-12-13T19:50:13Z

pkg/store/prometheus.go

@@ -488,8 +488,14 @@ func (p *PrometheusStore) startPromRemoteRead(ctx context.Context, q *prompb.Que

 // matchesExternalLabels returns false if given matchers are not matching external labels.
 // If true, matchesExternalLabels also returns Prometheus matchers without those matching external labels.
-func matchesExternalLabels(ms []storepb.LabelMatcher, externalLabels labels.Labels) (bool, []*labels.Matcher, error) {
-	tms, err := storepb.MatchersToPromMatchers(ms...)
+func matchesExternalLabels(ms []storepb.LabelMatcher, externalLabels labels.Labels, mc *storepb.MatcherConverter) (bool, []*labels.Matcher, error) {


cool, this is much cleaner

all change are made

davidyuanfs

nice change!

davidyuanfs · 2024-12-13T23:09:58Z

cmd/thanos/receive.go

@@ -1097,6 +1136,8 @@ func (rc *receiveConfig) registerFlag(cmd extkingpin.FlagClause) {
 		Default("10000").Uint64Var(&rc.topMetricsMinimumCardinality)
 	cmd.Flag("receive.top-metrics-update-interval", "The interval at which the top metrics are updated.").
 		Default("5m").DurationVar(&rc.topMetricsUpdateInterval)
+	cmd.Flag("receive.store-matcher-converter-cache-capacity", "The number of label matchers to cache in the matcher converter for the Store API. Set to 0 to disable to cache. Default is 0.").


Curious how do we decide this cache size about how many matcher in each region?

We use a constant 30k cache size across all regions. I tried 2k, 30k and 100k, and 30k gives great cache hit rate (99.8%) with negligible memory footprint in oregon-dev.

yuchen-db added 2 commits December 11, 2024 16:56

add matcher cache for proxy store API

8d8aa94

format imports

b69d8f9

hczhu-db previously requested changes Dec 12, 2024

View reviewed changes

yuchen-db added 10 commits December 11, 2024 23:09

don't cache == or !=

615bd5b

refactor matcher conversion off match external labels

c01b540

enable cache for proxy store

c752686

add unit tests

69db7c8

fix lint

34c2028

add benchmark

4cb816d

update metrics

e02d63c

add matcher converter to tsdb store

8900420

enable cache for tsdb store

1312720

add code comments

6652bb6

yuchen-db requested review from hczhu-db and jnyi December 12, 2024 22:03