Skip to content
This repository has been archived by the owner on Apr 21, 2023. It is now read-only.

Create new engineering internal monitoring & dashboard #364

Open
Robcwilliams opened this issue Jan 5, 2018 · 0 comments
Open

Create new engineering internal monitoring & dashboard #364

Robcwilliams opened this issue Jan 5, 2018 · 0 comments
Labels

Comments

@Robcwilliams
Copy link

Engineering would like to add additional layers of internal monitoring for awareness and early detection

  • Monitor for if RDS goes down

  • Monitor for if there is a widespread amazon outage in a location where we are hosted
    https://status.aws.amazon.com/

  • Identify exec level items that need to happen or some playbook needs to be ran prior to escalating and our engineers getting pinged.
    -- These self healing type playbooks should be written / provided by Insights eng

  • Monitor & report number of 500 errors per night

  • Monitor & report total uploads processed nightly

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Projects
None yet
Development

No branches or pull requests

1 participant