gardener-node-agent
might end in a crash-loop in case of breaking changes affecting its own configuration
#11025
Labels
area/robustness
Robustness, reliability, resilience related
kind/bug
Bug
triage/accepted
Indicates an issue or PR is ready to be actively worked on.
How to categorize this issue?
/area robustness
/kind bug
What happened:
gardener-node-agent
updates its own binary and its configuration. Usually the config changes are applied before the binary is updated because of their sequence in OperatingSystemConfig.If there are breaking changes in
gardener-node-agent
(like adding a feature gate) it might end in a crash-loop in the following case.NodeAgentAuthorizer
in the concrete case).In this case the configuration of GNA already includes the feature gate parameter while the old GNA binary does not know it and refuses to start. Manual intervention is required to solve this problem.
What you expected to happen:
gardener-node-agent
should be resilient in this update case.How to reproduce it (as minimally and precisely as possible):
Anything else we need to know?:
The issue could be solved by adding a version suffix to the GNA config files and let GNA load configs of its own version only.
Environment:
kubectl version
):The text was updated successfully, but these errors were encountered: