-
Notifications
You must be signed in to change notification settings - Fork 349
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ovnkube-node crash loops when trying to restart #4654
Comments
looks like it might be related to recent changes that were done |
It looks to me like this is fixed by #4652 I deleted ovnk pods multiple times and not seeing the issue. Feel free to reopen if it happens again. |
I just tried on master:
And I get the same crash. I started kind with:
Then I deleted the ovnkube-node pod corresponding to ovn-worker:
ovnkube fails in the same way:
I'm not sure it's relevant but I'm using podman on that machine. |
I couldn't replicate the failure on main. Using docker. |
I couldn't replicate the failure on main with docker either. Originally I was using podman, will try again. |
@martinkennelly I moved back to using podman (just removed docker and installed podman and podman-docker) and now I get the same crash loop when deleting the ovnkube-node pod. |
This issue is stale because it has been open 60 days with no activity. Remove stale label or comment or this will be closed in 5 days. |
This issue was closed because it has been stalled for 5 days with no activity. |
What happened?
On a freshly started kind cluster (multi-network and network segmentation enabled):
Delete an ovnkube-node pod:
The new ovnkube-node pod fails and crash loops because it fails to start the node network controller:
Logs of the ovnkube-node pod (full logs attached):
ovnk-logs.txt
What did you expect to happen?
The new ovnkube-node pod should come up without issues.
How can we reproduce it (as minimally and precisely as possible)?
Described above.
Anything else we need to know?
No response
OVN-Kubernetes version
Kubernetes version
OVN version
OVS version
Platform
Is it baremetal? GCP? AWS? Azure?
OS version
Install tools
Container runtime (CRI) and version (if applicable)
The text was updated successfully, but these errors were encountered: