Skip to content

Latest commit

 

History

History
77 lines (54 loc) · 4.88 KB

disaster-recovery-testing.md

File metadata and controls

77 lines (54 loc) · 4.88 KB

Disaster Recovery testing

This document covers the Disaster Recovery testing procedure for applications hosted on the Teacher Services AKS clusters based on scenarios detailed in the Disaster recovery document.

Prerequisites

Documentation requirements

Copy the template DR testing document which will be a record of the scenarios run, time taken, and any issues.

Initial set-up

Participants must have access to Github and the repositories.

Schedule virtual meeting for the test to take place

  • teams or slack
  • invite the relevant stakeholders

Regularly provide updates on the service Slack channel to keep product owners abreast of developments.

Scenario 1: Loss of database instance

See DR scenario 1.

Delete the postgres database instance

Note that you must have a previously created backup on azure storage before starting this step. If not, create one now before continuing.

Follow the disaster recovery instructions.

Scenario 2: Loss of data

See DR scenario 2.

Delete data from the postgres database instance

Make a note of the time this step is being started as the restore point must be before you delete any data.

  • Delete a table manually
    • connect via konduit and delete the table
    • it must be possible to confirm the data has been deleted either within the app, by errors messages being logged, the app crashing or users observing inconsistent content.

Follow the disaster recovery instructions.

Post scenario steps

Documentation requirements

  • Complete the DR testing document and save in the DR test Reports folder
  • Update the service on the infra team sharepoint service list with the DR date and status (success/fail)

Post DR test review

  • Review the just completed DR test, and raise trello cards for any process improvements.
  • Review the contact list in the Teacher services list