Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEATURE] Create an alert for tsars when the datatrail (registration?) service goes down at any of the sites #37

Open
kaitshin opened this issue Sep 10, 2023 · 0 comments
Assignees
Labels
enhancement New feature or request

Comments

@kaitshin
Copy link

Is your feature request related to a problem? Please describe.
A few days ago, events that should have been registered were not being found with datatrail (slack thread). It seemed that something happened to the registration service and needed to be restarted, but nobody was aware/alerted until this issue was noticed/brought up manually.

Describe the solution you'd like
A slack alert, or a grafana panel, that shows when the datatrail registration service is down and needs to be restarted. Bonus points if it's something that a frb/outrigger tsar can fix themselves, or alert frbadmin who could fix it (so as to not have the burden be solely on Tarik).

Describe alternatives you've considered
N/A

Additional context
N/A

@kaitshin kaitshin added the enhancement New feature or request label Sep 10, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants