-
Notifications
You must be signed in to change notification settings - Fork 444
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] libovsdb keeps reconnecting to NorthDB every 4 seconds #4465
Comments
Could you please upgrade kube-ovn to the latest v1.12.22 and then set the kube-ovn-controller image tag to v1.12.23-x86? You also need to add two arguments to kube-ovn-controller after setting the image tag: args:
- /kube-ovn/start-controller.sh
- --ovsdb-con-timeout=30 # The seconds to wait ovsdb connect timeout
- --ovsdb-inactivity-timeout=180 # The seconds to wait ovsdb inactivity check timeout |
Yes, I tested it and found that the reason for this problem is that the monitor timeout is too short during reconnection. const timeout = 3 * time.Second
...
options := []client.Option{
client.WithReconnect(timeout, &backoff.ConstantBackOff{Interval: time.Second}),
client.WithLeaderOnly(true),
client.WithLogger(&logger),
} The timeout here is hard-coded to be 3 seconds. |
@cmdy What's your final timeout setting? |
I set 12 seconds under 10w LogicalRouterPolicy, and the program can reconnect normally and continue to work, but I think 30s is a reasonable value, Because our ultimate goal is 3k nodes, in this scenario, 100 subnets will have 30w+ LogicalRouterPolicies. However, I have not yet tested this scenario, because there are other issues that cause adding nodes to be too slow (I will open a new issue for this) . |
Kube-OVN Version
v1.12.21
Kubernetes Version
v1.28.11
Operation-system/Kernel Version
"CentOS Linux 7 (Core)"
6.6.35-2406.el7.bzl.x86_64
Description
libovsdb keeps reconnecting to OVN_Northbound every 4 seconds, Causes not connected.
Database tradition is normal
Steps To Reproduce
Large number of pod deletes
8w pods
Current Behavior
unable to connect to OVN_Northbound,delete logical router causes not connected
Expected Behavior
can connect to OVN_Northbound normally
The text was updated successfully, but these errors were encountered: