You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
After deploying cos-alerter, I was investigating why alertmanager was sending Watchdog alerts very erratically.
I looked at the alertmanager logs, and found that it was constantly restarted by pebble (see the logs below).
To Reproduce
I did not initially deploy this COS cluster, nor do I have a k8s cluster at hand to reproduce. If it helps, here is the alertmanager config:
I'm of the opinion that we don't want to drop any logs, so for me drop_newest is not a viable option. https://vector.dev/docs/about/under-the-hood/architecture/buffering-model/#overflow-to-another-buffer-overflow seems like it would be exactly what we're searching for, but this is not yet suitable for production, so we'll need to track the status of that mode. Until then, we should add alert rules in vector to surface when the service either
a) goes into a crash loop, or
b) has assymetric input/output, indicating backpressure.
Bug Description
After deploying cos-alerter, I was investigating why alertmanager was sending Watchdog alerts very erratically.
I looked at the alertmanager logs, and found that it was constantly restarted by pebble (see the logs below).
To Reproduce
I did not initially deploy this COS cluster, nor do I have a k8s cluster at hand to reproduce. If it helps, here is the alertmanager config:
Environment
The COS bundle runs on a microk8s cluster. The charms are the latest/stable versions.
Relevant log output
Additional context
No response
The text was updated successfully, but these errors were encountered: