Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[main] Always reset failure counts on existing plans #48109

Merged
merged 1 commit into from
Nov 26, 2024

Conversation

HarrisonWAffel
Copy link
Contributor

Issue: #47652

Problem

In all previously released versions of Rancher, the rancher-wins service does have its underlying binary automatically upgraded to the latest version after an upgrade of the Rancher server. This behavior differs from Linux, and forces users to reprovision their Windows nodes in order to receive the latest enhancements within the system-agent (which is embedded in rancher-wins). This has resulted in unnecessary conditional logic within Rancher to account for behavior differences on nodes joined to the same cluster.

While implementing #46620 for Windows nodes, an issue was encountered where plan failures would not be accurately reported as out-dated versions of rancher-wins did not contain the necessary system-agent changes. To address this, Rancher was updated to conditionally update the failure-count property on Windows plans such that it is only set for newly provisioned nodes running the latest version of rancher-wins.

This change was effectively a stop gap, and should be considered as tech debt.

Solution

Revert the changes made in #47651. The changes raised in rancher/wins#260 will update the windows SUC image to automatically upgrade the underlying wins.exe binary used by rancher-wins, ensuring that the proper version of the system-agent is installed onto all Windows nodes.

Testing

  • Provision a windows cluster using a commit associated released version of Rancher, like 2.9.3
  • Ensure that the windows node plan failure count is set to 1
  • Upgrade the rancher server to a commit running a newer version of rancher, like 2.9.4 or 2.10
  • Ensure that the windows node plan failure count it updated to 5

Engineering Testing

Manual Testing

I've done the above

Automated Testing

QA Testing Considerations

Regressions Considerations

@HarrisonWAffel HarrisonWAffel requested a review from a team November 15, 2024 18:22
@HarrisonWAffel HarrisonWAffel requested a review from a team November 15, 2024 20:43
@snasovich snasovich merged commit 6db452e into rancher:main Nov 26, 2024
18 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants