Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add aggregated energy to solution to job views #261

Open
moebiusband73 opened this issue Apr 24, 2024 · 1 comment · May be fixed by #289
Open

Add aggregated energy to solution to job views #261

moebiusband73 opened this issue Apr 24, 2024 · 1 comment · May be fixed by #289
Labels
backend backend API related issues enhancement New feature or request frontend Svelte UI related issues
Milestone

Comments

@moebiusband73
Copy link
Member

Show total energy to solution for a job to the job meta data, the job list and job view.

This requires a aggregated energy value for a job. The cc-metric-collector already offers aggregated as well as incremental energy counters based on RAPL.

Discussion is required how other node agents could provide those metrics and how to make this configurable.

@moebiusband73 moebiusband73 added enhancement New feature or request frontend Svelte UI related issues backend backend API related issues labels Apr 24, 2024
@moebiusband73 moebiusband73 added this to the Release 1.4.0 milestone Apr 24, 2024
@giesselmann
Copy link
Collaborator

giesselmann commented Apr 24, 2024

I could imagine two ways to get energy to solution for a job:

  1. Submit an external value via stop_job API endpoint, that would support arbitrary installations but would require people's submit scripts to query their metric backends.
  2. Allow the user to configure a 'power-metric' or 'energy-metric' e.g. cpu_power or node_power and integrate/read those values in the job-archiving routines.

Ideally these methods could work together in a way, if value submitted from outside, internal calculation is skipped.

In my opinion job-specific integration is a task for the backend, not the metric store.
From our experience, only integrating the minute measurements is not accurate, a better job-energy is obtained from multiplying average node_power with the actual runtime.

@spacehamster87 spacehamster87 linked a pull request Oct 7, 2024 that will close this issue
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
backend backend API related issues enhancement New feature or request frontend Svelte UI related issues
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants