Skip to content

Conversation

@aadam19
Copy link
Contributor

@aadam19 aadam19 commented Sep 12, 2025

This PR continues the work of my fellow noble and courageous intern @tudor-manea.
We were scraping already for the node_(gpu|efa|neuroncore)_request but somehow forgot to make the agent pick it up.
Link to IntegTests PR that made us realize: aws/amazon-cloudwatch-agent-test#569

@aadam19 aadam19 requested a review from a team as a code owner September 12, 2025 13:32
@aadam19 aadam19 force-pushed the tudorman/efa-gpu-metrics branch from 0e5acce to 3cf5dfd Compare September 12, 2025 13:45
@aadam19 aadam19 changed the title [fix] Added node_(gpu|efa|neuroncore)_request metric [Fix] Added node_(gpu|efa|neuroncore)_request metric Sep 12, 2025
- node_status_condition_unknown
- node_status_capacity_pods
- node_status_allocatable_pods
- node_gpu_request
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Are we sure we want this? I think this was left out in the past since request & limit are always the same for GPUs.

@aadam19 aadam19 closed this Sep 15, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants