Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

operator v1: ignore maint. mode errors if "not found" #329

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

birdayz
Copy link
Contributor

@birdayz birdayz commented Nov 26, 2024

It's a rare case, which does not happen in the happy path:

  • A Pod is running with its broker being already decommissioned
  • A change is pending that requires restart (Condition to roll pod is set on pod). Eg. cluster property has drift, and requires a restart.

The code will want to enable maint. mode, which fails with 404: node not found - as it's decom'd already. Since this is a requeue/error, the code does not proceed to all the "scale" stuff, that would get rid of the pod. So basically, a pending pod roll can prevent decommission from happening.

Since there's no broker hosted on this pod (already decom'd), there's no point in turning on maint. mode, so just skip if not found.

It's a rare case, which does not happen in the happy path:

- A Pod is running with its broker being already decommissioned
- A change is pending that requires restart (Condition to roll pod is
  set on pod)

The code will want to enable maint. mode, which fails with 404: node not
found - as it's decom'd already. Since this is a requeue/error, the code
does not proceed to all the "scale" stuff, that would get rid of the
pod. So basically, a pending pod roll can prevent decommission from
happening.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant