spanprofiler: do less work on unsampled traces #528

bboreham · 2024-05-30T11:34:37Z

What this PR does:

Skip the work to examine the span and decorate it, if the whole trace is not sampled hence the data is going nowhere.
This is 4% of all garbage in one production Mimir installation I looked at, maybe 2% of all CPU.

BenchmarkTracer/unsampled-28             2866868               405.6 ns/op           979 B/op         10 allocs/op
BenchmarkTracer/sampled-28               1000000              1024 ns/op            2346 B/op         25 allocs/op

Checklist

Tests updated
CHANGELOG.md updated

Skip the work to examine the span and decorate it, if the whole trace is not sampled hence the data is going nowhere. Signed-off-by: Bryan Boreham <[email protected]>

kolesnikovae · 2024-06-03T02:43:11Z

I probably should have made this more clear, but there is a reason for collecting sampled traces: span_name attribution (aggregated span profiles):

dskit/spanprofiler/spanprofiler.go

Lines 66 to 71 in 5141842

    
           } else { 
        
           	// Even if the trace has not been sampled, we still need to keep track 
        
           	// of samples that belong to the span (all spans with the given name). 
        
           	ctx = pprof.WithLabels(parentCtx, pprof.Labels( 
        
           		spanNameLabelName, operationName)) 
        
           }

On the other hand, I suppose that probabilistic trace sampling might not impact the results of profiling sampling (structurally and proportionally), so that even a fraction of span profiles forms a trustworthy picture.

I guess it might make sense to add an option that controls the behaviour

bboreham · 2024-06-06T17:11:19Z

Sorry I missed that nuance.

It will affect the result: suppose a particular span name does 1x work in traces sampled 0.01 and 100x work in traces sampled 0.1.

But by this reasoning, the aggregate seems low-value. I need to know the cost in context.

kolesnikovae · 2024-06-07T09:49:30Z

I'm going to add an option that controls this behavior. By default, sampled traces will be accounted for in span aggregate profiles. Would you mind?

spanprofiler: do less work on unsampled traces

988afe7

Skip the work to examine the span and decorate it, if the whole trace is not sampled hence the data is going nowhere. Signed-off-by: Bryan Boreham <[email protected]>

bboreham force-pushed the skip-unsampled branch from cc972d1 to 988afe7 Compare May 30, 2024 11:35

bboreham requested a review from kolesnikovae May 30, 2024 11:35

colega approved these changes May 30, 2024

View reviewed changes

bboreham merged commit 5141842 into main May 31, 2024
6 checks passed

bboreham deleted the skip-unsampled branch May 31, 2024 09:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

spanprofiler: do less work on unsampled traces #528

spanprofiler: do less work on unsampled traces #528

bboreham commented May 30, 2024 •

edited

Loading

kolesnikovae commented Jun 3, 2024

bboreham commented Jun 6, 2024

kolesnikovae commented Jun 7, 2024

spanprofiler: do less work on unsampled traces #528

spanprofiler: do less work on unsampled traces #528

Conversation

bboreham commented May 30, 2024 • edited Loading

kolesnikovae commented Jun 3, 2024

bboreham commented Jun 6, 2024

kolesnikovae commented Jun 7, 2024

bboreham commented May 30, 2024 •

edited

Loading