Skip to content
This repository has been archived by the owner on Feb 28, 2025. It is now read-only.

Metric AI service #1322

Open
wants to merge 70 commits into
base: main
Choose a base branch
from
Open

Metric AI service #1322

wants to merge 70 commits into from

Conversation

tybalex
Copy link
Contributor

@tybalex tybalex commented Apr 19, 2023

This is the python service implementation of the Epic attached.
a few concepts in this service:

  • A job: A list of namespaces of a cluster to be watched, selected by user, similar to the watchlist we already have in log anomaly detection
  • A job run: run metric analysis for a job at the requested timestamp.

What's included in this PR:

  1. Http server in main.py: provide a few APIs, the most important one is /run_job/{job_id}/, which is the logic to run a job. The other APIs are used by gateway plugin.
  2. filter_anomaly_metric.py defines the rules to filter out normal metrics.
  3. the model dir, cnn_model.py defines the structure of the CNN model and metric_pattern_classification.py defines the logic to train/eval/predict with the model.
  4. Grafana_dashboard_utils.py defines functions to create/delete dashboardjson.
  5. the tests dir: unit-test

kralicky and others added 29 commits March 2, 2023 13:08
…etrics, and it can correctly capture some spikes.
@tybalex tybalex linked an issue Apr 19, 2023 that may be closed by this pull request
7 tasks
@tybalex tybalex requested a review from kralicky April 28, 2023 03:13
@tybalex
Copy link
Contributor Author

tybalex commented Apr 28, 2023

Regard to the model file, @AmartC will move it to Opni's internal S3 bucket.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Projects
None yet
2 participants