Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add cluster sizing quorum warning and link #2583

Open
wants to merge 2 commits into
base: main
Choose a base branch
from

Conversation

mbrat2005
Copy link
Contributor

Add an [important] message and link to the Server Requirements section of the system requirements article to raise visibility in how node count and quorum interact to impact availability. For example, a 2 node cluster will be brought offline by a node failure 50% of the time.

@prmerger-automator
Copy link
Contributor

@mbrat2005 : Thanks for your contribution! The author(s) have been notified to review your proposed change.

@learn-build-service-prod
Copy link
Contributor

Learn Build status updates of commit c34f88c:

✅ Validation status: passed

File Status Preview URL Details
azure-stack/hci/concepts/system-requirements.md ✅Succeeded

For more details, please refer to the build report.

For any questions, please:

@learn-build-service-prod
Copy link
Contributor

Learn Build status updates of commit bf4174f:

✅ Validation status: passed

File Status Preview URL Details
azure-stack/hci/concepts/system-requirements.md ✅Succeeded

For more details, please refer to the build report.

For any questions, please:

@Jak-MS
Copy link
Contributor

Jak-MS commented Sep 1, 2023

@JasonGerend

  • Can you review this PR?
  • IMPORTANT: When this content is ready to merge, you must add #sign-off in a comment or the approval may get overlooked.

#label:"aq-pr-triaged"
@MicrosoftDocs/public-repo-pr-review-team

@prmerger-automator prmerger-automator bot added the aq-pr-triaged Tracking label for the PR review team label Sep 1, 2023
Copy link
Contributor

@JasonGerend JasonGerend left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for making these helpful edits! I made some suggested edits to the wording and positioning of the quorum reference - you should be able to hopefully just accept my suggestions and then we can approve the PR. Thanks again! :)

@@ -72,6 +72,9 @@ Here are the Azure requirements for your Azure Stack HCI cluster:

A standard Azure Stack HCI cluster requires a minimum of one server and a maximum of 16 servers.

> [!IMPORTANT]
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This would be better as a bullet rather than an Important reader element. I'll make a recommendation below.

@@ -72,6 +72,9 @@ Here are the Azure requirements for your Azure Stack HCI cluster:

A standard Azure Stack HCI cluster requires a minimum of one server and a maximum of 16 servers.

> [!IMPORTANT]
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
> [!IMPORTANT]

@@ -72,6 +72,9 @@ Here are the Azure requirements for your Azure Stack HCI cluster:

A standard Azure Stack HCI cluster requires a minimum of one server and a maximum of 16 servers.

> [!IMPORTANT]
> When designing the number of nodes in your cluster, make sure to consider how both [Windows Failover Cluster and Storage Spaces Direct quorum](quorum.md) will impact your cluster's availability.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
> When designing the number of nodes in your cluster, make sure to consider how both [Windows Failover Cluster and Storage Spaces Direct quorum](quorum.md) will impact your cluster's availability.

@@ -72,6 +72,9 @@ Here are the Azure requirements for your Azure Stack HCI cluster:

A standard Azure Stack HCI cluster requires a minimum of one server and a maximum of 16 servers.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
A standard Azure Stack HCI cluster requires a minimum of one server and a maximum of 16 servers.

@@ -72,6 +72,9 @@ Here are the Azure requirements for your Azure Stack HCI cluster:

A standard Azure Stack HCI cluster requires a minimum of one server and a maximum of 16 servers.

> [!IMPORTANT]
> When designing the number of nodes in your cluster, make sure to consider how both [Windows Failover Cluster and Storage Spaces Direct quorum](quorum.md) will impact your cluster's availability.

Keep the following in mind for various types of Azure Stack HCI deployments:

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
- An Azure Stack HCI system requires a minimum of one server and a maximum of 16 servers. If you're using 2-4 servers, we recommend using a [witness](quorum.md) to help determine which server is most up to date if there is contention.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @JasonGerend!

The trigger for opening this PR was that a customer did not understand that Windows Failover Cluster and Storage Spaces Direct manage quorum separately. Whereas a Windows Failover Cluster allows you to configure an external witness resource for maintaining quorum, S2D does not. I worry that changing the wording as above will result in customers configuring WFC quorum but not considering how a node failure could impact S2D availability. I couldn't think of how to spell this out concisely and fell back on linking to the quorum article...

Another way to address this could be a page dedicated to cluster sizing, multiple clusters, and related considerations-- which customers also seem to be asking for

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good feedback on that table - I agree it's a problem and that we're not yet clear enough on pool quorum. I'm asking some of the PMs for more info and will keep you posted! :)

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@JasonGerend Did you have any luck getting more information from the PG?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@JasonGerend Have there been any updates on this conversation from the PG?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants