Skip to content

Latest commit

 

History

History
43 lines (29 loc) · 2.16 KB

README.md

File metadata and controls

43 lines (29 loc) · 2.16 KB

job-wrapper.py

This Python script is a super simple wrapper for jobs run via cron or systemd. It collects the status of the job and writes the result as OpenMetric files to a target folder. The metric files can form the basis of a visualization and alerting system for the jobs.

Requirements

  • Python 3
  • An OpenMetrics compatible collector to pick up the metric files (e.g. Alloy).
  • Some application to display the metrics (e.g. Prometheus + Grafana)

How to Use

Wrap the job to be executed with job-wrapper.py.

python3 job-wrapper.py <job-name> <job-command> [<command args>...]

  • job-name: The name of the job (will be used for naming the output files and labels for the metrics)
  • job-command: The command which should be executed
  • command args: Further arguments passed to the command

A $JOB_METRICS_FILE environment variable is passed to the job, which can be used to write custom metrics (e.g. backup size for a backup job).

Metrics

Metric Name Description Format
job_success Indicates if the last job run was successful. Returns 1 for success and 0 for failure. Gauge
job_execution_seconds The execution time of the last job run, provided in seconds. Gauge
job_last_success_timestamp_seconds The Unix timestamp of the last successful job run. Gauge
custom job metric Any custom metric which the job writes into the $JOB_METRICS_FILE file any

The three predefined metrics are sufficient to

  • Alert when a job has not run successfully for a specified amount of time.
  • Monitor job execution times.
  • Have a visualization to see which job is currently working or not.

Goals

  • Provide a simple wrapper for execution of jobs with cron or systemd
  • Manage output of metrics
  • Do not use any external libraries, to allow easy deployment

For more information, see the related blog post.