Skip to content

Commit

Permalink
Update SOPs
Browse files Browse the repository at this point in the history
Signed-off-by: Douglas Camata <[email protected]>
  • Loading branch information
douglascamata committed Sep 28, 2023
1 parent d74fbec commit 597c8c5
Showing 1 changed file with 26 additions and 0 deletions.
26 changes: 26 additions & 0 deletions docs/sop/observatorium.md
Original file line number Diff line number Diff line change
Expand Up @@ -34,6 +34,7 @@
* [Observatorium Gubernator Alerts](#observatorium-gubernator-alerts)
* [GubernatorIsDown](#gubernatorisdown)
* [Observatorium Obsctl Reloader Alerts](#observatorium-obsctl-reloader-alerts)
* [ObsCtlIsDown](#obsctlisdown)
* [ObsCtlRulesStoreServerError](#obsctlrulesstoreservererror)
* [ObsCtlFetchRulesFailed](#obsctlfetchrulesfailed)
* [ObsCtlRulesSetFailure](#obsctlrulessetfailure)
Expand Down Expand Up @@ -898,6 +899,31 @@ Observatorium rate-limiting service is not working.

# Observatorium Obsctl Reloader Alerts

## ObsCtlIsDown

### Impact

Tenant's rules are not being pushed to Observatorium, so they might be stale.

### Summary

The `obsctl-reloader` is most likely down, in a crashloop state.

### Severity

`critical`

### Access Required

- Console access to the production clusters (this system is't used in staging) that runs Observatorium (currently [telemeter-prod-01 OSD](https://console-openshift-console.apps.telemeter-prod.a5j2.p1.openshiftapps.com/k8s/cluster/projects/observatorium-mst-production) and [rhobsp0ue1 OSD](https://console-openshift-console.apps.rhobsp02ue1.y9ya.p1.openshiftapps.com/)).
- Edit access to the Observatorium namespaces:
- `observatorium-mst-production`

### Steps

- In the OSD console for specific cluster, check the pods belonging to the `obsctl-reloader` deployment and establish what is causing the crashloop.
- Further actions will depend a lot on the root cause found and most likel be something we didn't go through before.

## ObsCtlRulesStoreServerError

### Impact
Expand Down

0 comments on commit 597c8c5

Please sign in to comment.