-
Notifications
You must be signed in to change notification settings - Fork 807
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Expose NVMe Volume Metrics #2216
Expose NVMe Volume Metrics #2216
Conversation
Signed-off-by: torredil <[email protected]>
Code Coverage Diff
|
0fb096f
to
428fb1d
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/lgtm
Signed-off-by: torredil <[email protected]>
428fb1d
to
5927a72
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks great, left a few non-blocking unit test comments/questions.
/hold For @ElijahQuinones's review |
Signed-off-by: torredil <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/lgtm
/label tide/merge-method-squash |
/lgtm |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/unhold
/approve
LGTM thank you for also adding the service in this PR :) !
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: AndrewSirenko, ElijahQuinones The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
Hey, thanks for adding these metrics. Unfortunately, there are a few things about this change that don't follow Prometheus best practices. https://prometheus.io/docs/practices/naming/ Would you be willing to accept changes to adapt the data to meet best practices? |
Hi @SuperQ, thank you for reaching out about this and engaging with the community on Prometheus best practices. We are aware of that documentation and its one of the things our team carefully evaluated when implementing this feature. While we generally strive to follow best practices, in this specific case we had a requirement and intentionally decided to maintain consistency with the underlying EBS metrics output to allow for direct mapping with the AWS documentation and CloudWatch metrics. That said, could you share an example of the change you'd propose? I'd like to discuss this with the team if there is an opportunity to incorporate best practices for naming conventions while maintaining the consistency requirement. |
There's basically no way to make the underlying metric match best practices without breaking changes. My changes would be:
I also reviewed the code and found that the upstream data is missing the histogram cumulative sum data. |
That makes sense @SuperQ, I'll bring up this feedback for discussion during our next weekly sync and provide a response here shortly after. We also need to consider existing metrics exported by the controller plugin when adding/changing prefixes if the team decides to move forward with the proposal. |
Is this a bug fix or adding new feature?
New feature.
What is this PR about? / Why do we need it?
This PR adds support for collecting detailed performance statistics for CSI managed devices. The log page for each device is retrieved at gather time, transformed into a prometheus-compatible format, and exposed at a
/metrics
HTTP endpoint.What testing is done?