-
Notifications
You must be signed in to change notification settings - Fork 46
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Is possible to expose more Clickhouse kafka connector metrics like p99 latency of ingesting to Clickhouse Cloud, # of errors/retriable errors, etc. #441
Comments
@georgeli-roblox Hi! For Question 1 - do these details clear it up or did you have further questions about it? |
Ah, I didn't see this documentation before. Thanks. So the TaskProcessingTime could be used as the e2e (end-to-end) Also , if it needs to wait for the Ack from Clickhouse Cloud, then it could take longer. if this metric is real time for processing 1 record or a batch of records, the prometheus metric scraper might have a larger interval in-between. So maybe it might be better to convert this counter/gauge into histograms for p99/p95/p50 |
Hi @georgeli-roblox For Regarding latency, you want us to expose it as histogram, the same as Prometheus will expect. |
Having the
I think if the connector restarting, and the metrics (errors/retries) getting reset it should be fine. e.g. the COUNTER type, whose value will increase monolithically and need the
Yes. I think that would work to use the histogram_quantile() over the bucket. or directly with tags like p999, p99, p95, p75, p50. e.g. see this Kafka metric: Thanks @mzitnik @Paultagoras |
Is your feature request related to a problem? Please describe.
@Paultagoras Thanks for adding the Clickhouse connector metrics for #209
Also see below graph.
Some questions:
I
Thanks,
George
Describe the solution you'd like
A clear and concise description of what you want to happen.
TaskProcessingTime/RecordProcessingTime/ReceivedRecords
metricsDescribe alternatives you've considered
A clear and concise description of any alternative solutions or features you've considered.
Additional context
Add any other context or screenshots about the feature request here.
The text was updated successfully, but these errors were encountered: