Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This PR introduces support for telemetry for
b7s
nodes.We have support for two main mechanisms - tracing and metrics.
Tracing
Tracing can be enabled with the CLI flags
--enable-tracing
and either--tracing-grpc-endpoint
or--tracing-http-endpoint
flags, depending on which protocol should be used to transfer tracing data.Below is an example configuration for a config file:
Some more background about the implementation can be found here - #158 .
Metrics
Metrics can be enabled with the CLI flags
--enable-metrics
and--prometheus-address
. The latter CLI flag controls on which address the worker node will serve the metrics. In the case of the head node, the metrics will be served on the REST API address.Metrics can be found on the
/metrics
endpoint.Support for push metrics was dropped because it seems against the Prometheus guidelines, and pull metrics can be enabled by using Prometheus PushProx.
For the config file, the metrics configuration might be:
More information about the implementation can be found here - #164 .
Note that these two PRs are not the final version, and some changes were made afterwards, here - #167 ,
and in minor scale in this PR, after rebasing the feature branch.
Subsequent changes mainly deal with splitting
tracing
andmetrics
and supporting enabling one or the other, instead of forcingtelemetry
as a single system.