Skip to content

Commit

Permalink
Update proposal with the results of the design doc
Browse files Browse the repository at this point in the history
Signed-off-by: Chris Marchbanks <[email protected]>
  • Loading branch information
csmarchbanks committed Mar 22, 2024
1 parent 6ce4170 commit e06d3d7
Showing 1 changed file with 30 additions and 7 deletions.
37 changes: 30 additions & 7 deletions proposals/2024-01-29_native_histograms_text_format.md
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,7 @@
* **Implementation Status:** Not implemented

* **Related Issues and PRs:**
* `<GH Issues/PRs>`
* [Native Histogram Support in client_python](https://github.com/prometheus/client_python/issues/918)

* **Other docs or links:**
* [Design Doc for choosing a proposal](https://docs.google.com/document/d/1qoHf24cKMpa1QHskIjgzyf3oFhIPvacyJj8Tbe6fIrY/edit#heading=h.5sybau7waq2q)
Expand All @@ -19,8 +19,6 @@ Today it is only possible to export native histograms using the Protocol Buffers

During a [dev summit in 2022](https://docs.google.com/document/d/11LC3wJcVk00l8w5P3oLQ-m3Y37iom6INAMEu2ZAGIIE/edit#bookmark=id.c3e7ur6rn5d2) there was consensus we would continue to support the text format. Including native histograms as part of the text format shows commitment to that consensus.

See the linked design doc in Google Docs for additional background information.

### Pitfalls of the current solution

Prometheus client libraries such as Python do not want to require a dependency on protobuf in order to expose native histograms, and in some languages protobuf is painful to use. Gating native histograms only to clients/users willing to use protobuf hurts adoption of native histograms, therefore, we would like a way to represent a native histogram in the text based format.
Expand All @@ -46,15 +44,40 @@ Client library maintainers, OpenMetrics, and Prometheus scrape maintainers.

## How

To be filled out after one of the proposals in the linked design doc has support.
Extend the OpenMetrics text format to allow structured values instead of only float values. This structured value will be used to encode a structure with the same fields as is exposed using the protobuf exposition format. Starting with an example and then breaking up the format:
```
# TYPE nativehistogram histogram
nativehistogram {sample_count:24,sample_sum:100,schema:0,zero_threshold:0.001,zero_count:4,positive_span:[0:2,1:2],negative_span:[0:2,1:2],positive_delta:[2,1,-2,3],negative_delta:[2,1,-2,3]}
```
The metric will have no "magic" suffixes, then the value for each series is a custom struct format with the following fields:
* `sum: float64` - The sum of all observations for this histogram. Could be negative in cases with negative observations.
* `count: uint64` - The number of samples that were observed for this histogram.
* `schema: int32` - The schema used for this histogram, currently supported values are -4 -> 8.
* `zero_threshold: float64` - The width of the zero bucket.
* `zero_count: uint64` - The number of observations inside the zero bucket.
* `negative_span: []BucketSpan` - The buckets corresponding to negative observations, optional.
* `negative_delta: []int64` - The delta of counts compared to the previous bucket.
* `positive_span: []BucketSpan` - The buckets corresponding to negative observations, optional.
* `positive_delta: []int64` - The delta of counts compared to the previous bucket.

A bucket span is the combination of an `int32` offset and a `uint32` length. It is encoded as `<offset>:<length>`. Lists/arrays are encoded within square brackets with elements separated by commas. Compared to JSON this avoids consistently repeating keys and curly braces.

Note that in this initial implementation float histograms are not supported.

## Alternatives

To be filled out after one of the proposals in the linked design doc has support.
### Do nothing

One valid option is to avoid this extra format and require anyone who desires to use native histograms to use protobuf for exposition. It would go against the consensus of the Prometheus team members from 2022 however.

### Alternate exposition formats

See the alternate exposition formats proposed in the [design document](https://docs.google.com/document/d/1qoHf24cKMpa1QHskIjgzyf3oFhIPvacyJj8Tbe6fIrY/edit#heading=h.5sybau7waq2q). The other formats generally added in additional readability/verbosity at the expense of performance.

## Action Plan

The tasks to do in order to migrate to the new idea.

* [ ] Task one <GH issue>
* [ ] Task two <GH issue> ...
* [ ] Implement an encoder and parser in client_python
* [ ] Implement an experimental parser in the Prometheus server
* [ ] Update OpenMetrics with formalized syntax

0 comments on commit e06d3d7

Please sign in to comment.