Skip to content

Commit

Permalink
update (#27)
Browse files Browse the repository at this point in the history
  • Loading branch information
karl-johan-grahn authored Mar 14, 2024
1 parent c4d8333 commit 8dabcb8
Show file tree
Hide file tree
Showing 3 changed files with 59 additions and 2 deletions.
1 change: 1 addition & 0 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,2 +1,3 @@
# OpenShift as a Service Documentation

Docs Home for OpenShift as a Service (OaaS) Offering
52 changes: 52 additions & 0 deletions content/upgrades.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,52 @@
# OpenShift Upgrades

Stakater will perform one major OpenShift version upgrade per day per cluster.

Upgrades of test and staging clusters will typically be done first before production clusters.

The time it takes to upgrade the cluster depends on the number of nodes present on the cluster, typically ranging from 2 to 3 hours if no issues arise. However, in case any issues occur, the upgrade process may extend up to 6 hours.

The estimated times assume Stakater has cluster-admin access to the clusters.

Physical presence will only be required if any on-prem machines need to be restarted.

## Pre-Upgrade Steps

1. Run the following command on the cluster and share its output with Stakater:

```sh
oc get apirequestcount
```

1. Verify all nodes are in the ready state
1. Confirm that all cluster operators are in the ready state
1. Check the health of PVs (Persistent Volumes) and PVCs (Persistent Volume Claims) to ensure:
1. All PVs and PVCs are mounted
1. None of the PVs and PVCs are unmounted
1. None of the PVs and PVCs are stuck in the terminating state
1. No abnormal configurations exist
1. Verify the health of machineConfigPools:
1. Ensure `MACHINECOUNT` equals `READYMACHINECOUNT`
1. Ensure there are no machines stuck in `UPDATEDMACHINECOUNT` or `DEGRADEDMACHINECOUNT`
1. Check the health of the etcd cluster
1. Confirm that no critical alerts exist on the cluster
1. The Customer needs to take a backup of the etcd cluster to ensure data safety in case of any issues during the upgrade process, see the official [`Backing up etcd` Red Hat documentation for the applicable version](https://docs.openshift.com/container-platform/4.15/backup_and_restore/control_plane_backup_and_restore/backing-up-etcd.html)

## Upgrade Steps

1. Start the upgrade from the cluster console
1. Monitor the upgrade process closely for any errors or warnings
1. If any error occurs or manual intervention is required, Stakater will address it promptly

## Post-Upgrade Steps

1. Perform all upgrade steps again to ensure the stability of the cluster
1. Verify that all nodes are in the ready state after the upgrade
1. Confirm that all cluster operators are functioning properly post-upgrade
1. Check the health of PVs and PVCs again to ensure successful migration and mounting
1. Verify the health of `machineConfigPools` and ensure the machine counts are as expected
1. Conduct tests or checks to ensure the etcd cluster is functioning correctly after the upgrade
1. Ensure all applications and services are running as expected on the upgraded cluster
1. Document any issues encountered during the upgrade process and their resolutions
1. Update the system documentation with details of the upgrade process, including any configurations or settings changed during the upgrade
1. Provide post-upgrade support to address any lingering issues or concerns
8 changes: 6 additions & 2 deletions mkdocs.yml
Original file line number Diff line number Diff line change
Expand Up @@ -13,6 +13,8 @@ theme:
primary: white
features:
- navigation.tabs
- content.code.copy
- content.code.select

markdown_extensions:
- admonition
Expand All @@ -24,8 +26,8 @@ markdown_extensions:
format: !!python/name:pymdownx.superfences.fence_code_format
- attr_list
- pymdownx.emoji:
emoji_index: !!python/name:material.extensions.emoji.twemoji
emoji_generator: !!python/name:material.extensions.emoji.to_svg
emoji_index: !!python/name:materialx.emoji.twemoji
emoji_generator: !!python/name:materialx.emoji.to_svg

extra:
analytics:
Expand All @@ -50,6 +52,8 @@ nav:
- business-hours.md
- technical-support.md
- updating-clusters.md
- Runbooks:
- upgrades.md
- sla.md
- cost.md
- Stakater Home: https://www.stakater.com/
Expand Down

0 comments on commit 8dabcb8

Please sign in to comment.