✨ Allow clusters without explicit availability zones #1253

mkjpryor · 2022-06-01T15:29:24Z

What this PR does / why we need it:

This PR adds the ability to create clusters without explicitly setting availability zones. The use case is discussed in detail in #1252.

Which issue(s) this PR fixes (optional, in fixes #<issue number>(, fixes #<issue_number>, ...) format, will close the issue(s) when PR gets merged):
Fixes #1252

Special notes for your reviewer:

Adds an additional, backwards-compatible flag to the OpenStack cluster spec.

TODOs:

squashed commits
if necessary:
- includes documentation
- adds unit tests

/hold

netlify · 2022-06-01T15:29:28Z

✅ Deploy Preview for kubernetes-sigs-cluster-api-openstack ready!

Name	Link
🔨 Latest commit	`b2e18ea`
🔍 Latest deploy log	https://app.netlify.com/sites/kubernetes-sigs-cluster-api-openstack/deploys/629f0cf44f715b000aa83001
😎 Deploy Preview	https://deploy-preview-1253--kubernetes-sigs-cluster-api-openstack.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site settings.

k8s-ci-robot · 2022-06-01T15:29:33Z

Hi @mkjpryor. Thanks for your PR.

I'm waiting for a kubernetes-sigs member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

k8s-ci-robot · 2022-06-01T15:29:35Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: mkjpryor
To complete the pull request process, please assign seanschneeweiss after the PR has been reviewed.
You can assign the PR to them by writing /assign @seanschneeweiss in a comment when ready.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

mkjpryor · 2022-06-01T15:31:38Z

@mdbooth

Turns out it was basically as easy as I thought. This works like a dream for me. Can I get an /ok-to-test please?

apricote · 2022-06-01T15:38:33Z

/ok-to-test

controllers/openstackcluster_controller.go

api/v1alpha5/openstackcluster_types.go

Co-authored-by: ji chen <[email protected]>

mkjpryor · 2022-06-06T15:21:05Z

/retest

mkjpryor · 2022-06-07T10:14:01Z

@jichenjc

I added some docs for the new option - can you review and suggest changes if required?

mdbooth

It might be an idea to merge the workers part of this fix separately. It's self-contained and very simple.

For the control plane, I worry that options like IgnoreFoo are at risk of polluting the API. I don't yet have a better suggestion, but I would like to fully consider options before adding to the API. I'm very much in favour of the effect of the change, btw, and not necessarily against the proposed API, but I'd like to think it all the way through first.

I have 2 threads of thoughts:

We're working round behaviour which is defined by CAPI. We should discuss this with CAPI before making an API change in case they have any better ideas/imminent plans.
We should write down the various ways Failure Domains might be implemented in an OpenStack cloud which are not AZ. What would an API look like which explicitly represented a failure domain in each of these models? Would it be compatible with CAPI? If not, what changes could we make to CAPI to represent more failure domain models?

On that second point, I have in mind something like:

  failureDomainModel: (AvailabilityZone|ServerGroup|None)

instead of IgnoreFailureDomain. This is barely a half-baked thought so read nothing into the detail of it, but the critical difference is that it defines what it is rather than what it is not.

mkjpryor · 2022-06-10T12:35:15Z

@mdbooth

It might be an idea to merge the workers part of this fix separately. It's self-contained and very simple.

Happy to do this.

We're working round behaviour which is defined by CAPI. We should discuss this with CAPI before making an API change in case they have any better ideas/imminent plans.

I'm not actually sure that we are. The InfraCluster.status.failureDomains field is explicitly optional in the spec (see https://cluster-api.sigs.k8s.io/developer/providers/cluster-infrastructure.html#infracluster-resources) and all this flag does is explicitly say that we don't care about AZs.

However I don't disagree with your comment that there might be a better approach.

On that second point, I have in mind something like:
  failureDomainModel: (AvailabilityZone|ServerGroup|None)
instead of IgnoreFailureDomain. This is barely a half-baked thought so read nothing into the detail of it, but the critical difference is that it defines what it is rather than what it is not.

This could actually work quite well - the only other thing I can think of is host aggregates.

I guess for my specific case I would use failureDomain: ServerGroup which would put the control plane nodes in a server group with either soft-anti-affinity or anti-affinity policies (could be configurable). The way this could work in code is:

OpenStackCluster reconciliation in CAPO creates a server group
The ID of the server group is reported using OpenStackCluster.status.failureDomains with the flag that identifies it as suitable for control plane nodes
This will cause CAPI to create control plane nodes with the server group ID as the failureDomain
CAPO knows to use the failureDomain as the server group when creating the server

What do you think?

mkjpryor · 2022-06-10T12:37:49Z

And I guess failureDomainModel: None would be basically what I have implemented when ignoreAvailabilityZones: true.

mkjpryor · 2022-06-12T19:32:11Z

@mdbooth

What if I change this PR to have failureDomainModel: AvailabilityZone | None instead of the flag, leaving us open for additional modes in the future?

Then submit another PR for #1256 that implements failureDomainModel: ServerGroup.

How does that sound as a plan?

k8s-ci-robot · 2022-06-15T17:36:35Z

@mkjpryor: PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

k8s-triage-robot · 2022-09-13T18:12:31Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

jichenjc · 2022-09-14T00:29:23Z

/remove-lifecycle stale

k8s-triage-robot · 2022-12-13T00:34:19Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

k8s-triage-robot · 2023-01-12T01:27:51Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle rotten

k8s-triage-robot · 2023-02-11T02:26:40Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the PR is closed

You can:

Reopen this PR with /reopen
Mark this PR as fresh with /remove-lifecycle rotten
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/close

k8s-ci-robot · 2023-02-11T02:26:44Z

@k8s-triage-robot: Closed this PR.

In response to this:

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied

After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied

After 30d of inactivity since lifecycle/rotten was applied, the PR is closed

You can:

Reopen this PR with /reopen

Mark this PR as fresh with /remove-lifecycle rotten

Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Remove the need to specify AZs

734038d

k8s-ci-robot added do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. labels Jun 1, 2022

k8s-ci-robot requested review from apricote and mdbooth June 1, 2022 15:29

k8s-ci-robot added the size/M Denotes a PR that changes 30-99 lines, ignoring generated files. label Jun 1, 2022

k8s-ci-robot added ok-to-test Indicates a non-member PR verified by an org member that is safe to test. and removed needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. labels Jun 1, 2022

Don't test conversion for the new field

0b09fde

jichenjc reviewed Jun 2, 2022

View reviewed changes

controllers/openstackcluster_controller.go Outdated Show resolved Hide resolved

api/v1alpha5/openstackcluster_types.go Show resolved Hide resolved

Accept recommended wording for comment

6beb9fe

Co-authored-by: ji chen <[email protected]>

Add some documentation of the new option

b2e18ea

mdbooth requested changes Jun 10, 2022

View reviewed changes

k8s-ci-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Jun 15, 2022

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Sep 13, 2022

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Sep 14, 2022

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Dec 13, 2022

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Jan 12, 2023

k8s-ci-robot closed this Feb 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

✨ Allow clusters without explicit availability zones #1253

✨ Allow clusters without explicit availability zones #1253

mkjpryor commented Jun 1, 2022

netlify bot commented Jun 1, 2022 •

edited

Loading

k8s-ci-robot commented Jun 1, 2022

k8s-ci-robot commented Jun 1, 2022

mkjpryor commented Jun 1, 2022

apricote commented Jun 1, 2022

mkjpryor commented Jun 6, 2022

mkjpryor commented Jun 7, 2022

mdbooth left a comment

mkjpryor commented Jun 10, 2022

mkjpryor commented Jun 10, 2022

mkjpryor commented Jun 12, 2022

k8s-ci-robot commented Jun 15, 2022

k8s-triage-robot commented Sep 13, 2022

jichenjc commented Sep 14, 2022

k8s-triage-robot commented Dec 13, 2022

k8s-triage-robot commented Jan 12, 2023

k8s-triage-robot commented Feb 11, 2023

k8s-ci-robot commented Feb 11, 2023

✨ Allow clusters without explicit availability zones #1253

✨ Allow clusters without explicit availability zones #1253

Conversation

mkjpryor commented Jun 1, 2022

netlify bot commented Jun 1, 2022 • edited Loading

✅ Deploy Preview for kubernetes-sigs-cluster-api-openstack ready!

k8s-ci-robot commented Jun 1, 2022

k8s-ci-robot commented Jun 1, 2022

mkjpryor commented Jun 1, 2022

apricote commented Jun 1, 2022

mkjpryor commented Jun 6, 2022

mkjpryor commented Jun 7, 2022

mdbooth left a comment

Choose a reason for hiding this comment

mkjpryor commented Jun 10, 2022

mkjpryor commented Jun 10, 2022

mkjpryor commented Jun 12, 2022

k8s-ci-robot commented Jun 15, 2022

k8s-triage-robot commented Sep 13, 2022

jichenjc commented Sep 14, 2022

k8s-triage-robot commented Dec 13, 2022

k8s-triage-robot commented Jan 12, 2023

k8s-triage-robot commented Feb 11, 2023

k8s-ci-robot commented Feb 11, 2023

netlify bot commented Jun 1, 2022 •

edited

Loading