We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
csi-tencentcloud-cbs 进程占用了calicao的9099健康检查端口,导致网络不通 收集了如下日志 calica node: 2022-01-21 07:33:49.433 [INFO][58] ipsets.go 304: Finished resync family="inet" numInconsistenciesFound=0 resyncDuration=1.018007ms 2022-01-21 07:33:49.433 [INFO][58] int_dataplane.go 765: Finished applying updates to dataplane. msecToApply=1.285159 2022-01-21 07:33:50.015 [ERROR][58] health.go 196: Health endpoint failed, trying to restart it... error=listen tcp 127.0.0.1:9099: bind: address already in use 2022-01-21 07:33:51.016 [ERROR][58] health.go 196: Health endpoint failed, trying to restart it... error=listen tcp 127.0.0.1:9099: bind: address already in use 2022-01-21 07:33:52.016 [ERROR][58] health.go 196: Health endpoint failed, trying to restart it... error=listen tcp 127.0.0.1:9099: bind: address already in use 2022-01-21 07:33:53.016 [ERROR][58] health.go 196: Health endpoint failed, trying to restart it... error=listen tcp 127.0.0.1:9099: bind: address already in use 2022-01-21 07:33:54.017 [ERROR][58] health.go 196: Health endpoint failed, trying to restart it... error=listen tcp 127.0.0.1:9099: bind: address already in use 2022-01-21 07:33:55.017 [ERROR][58] health.go 196: Health endpoint failed, trying to restart it... error=listen tcp 127.0.0.1:9099: bind: address already in use 2022-01-21 07:33:56.017 [ERROR][58] health.go 196: Health endpoint failed, trying to restart it... error=listen tcp 127.0.0.1:9099: bind: address already in use
查找占用9099的进程: [root@k8s-dev-node10 ~]# netstat -nlpt Active Internet connections (only servers) Proto Recv-Q Send-Q Local Address Foreign Address State PID/Program name tcp 0 0 0.0.0.0:22 0.0.0.0:* LISTEN 1437/sshd tcp 0 0 127.0.0.1:42651 0.0.0.0:* LISTEN 1311/containerd tcp 0 0 127.0.0.1:10248 0.0.0.0:* LISTEN 8271/kubelet tcp 0 0 127.0.0.1:10249 0.0.0.0:* LISTEN 21454/kube-proxy tcp 0 0 0.0.0.0:6379 0.0.0.0:* LISTEN 3472/redis-server tcp 0 0 0.0.0.0:111 0.0.0.0:* LISTEN 939/rpcbind tcp 0 0 0.0.0.0:179 0.0.0.0:* LISTEN 6513/bird tcp6 0 0 :::9092 :::* LISTEN 29090/java tcp6 0 0 :::10250 :::* LISTEN 8271/kubelet tcp6 0 0 :::9099 :::* LISTEN 2564/csi-tencentclo tcp6 0 0 :::6379 :::* LISTEN 3472/redis-server tcp6 0 0 :::10255 :::* LISTEN 8271/kubelet tcp6 0 0 :::111 :::* LISTEN 939/rpcbind tcp6 0 0 :::10256 :::* LISTEN 21454/kube-proxy tcp6 0 0 :::45813 :::* LISTEN 29090/java
[root@k8s-dev-node10 ~]# ps aux|grep 2564 root 2564 0.0 0.2 736272 17024 ? Ssl 15:27 0:00 /csi-tencentcloud-cbs --v=5 --logtostderr=true --endpoint=unix:///csi/csi.sock root 7360 0.0 0.0 112712 956 pts/0 D+ 15:34 0:00 grep --color=auto 2564
The text was updated successfully, but these errors were encountered:
解决方法:启动加参数 metric_port
Sorry, something went wrong.
哪里改?
https://github.com/TencentCloud/kubernetes-csi-tencentcloud/blob/master/deploy/cbs/kubernetes/csi-node-new.yaml#L69
增加启动参数metric_port
No branches or pull requests
csi-tencentcloud-cbs 进程占用了calicao的9099健康检查端口,导致网络不通
收集了如下日志
calica node:
2022-01-21 07:33:49.433 [INFO][58] ipsets.go 304: Finished resync family="inet" numInconsistenciesFound=0 resyncDuration=1.018007ms
2022-01-21 07:33:49.433 [INFO][58] int_dataplane.go 765: Finished applying updates to dataplane. msecToApply=1.285159
2022-01-21 07:33:50.015 [ERROR][58] health.go 196: Health endpoint failed, trying to restart it... error=listen tcp 127.0.0.1:9099: bind: address already in use
2022-01-21 07:33:51.016 [ERROR][58] health.go 196: Health endpoint failed, trying to restart it... error=listen tcp 127.0.0.1:9099: bind: address already in use
2022-01-21 07:33:52.016 [ERROR][58] health.go 196: Health endpoint failed, trying to restart it... error=listen tcp 127.0.0.1:9099: bind: address already in use
2022-01-21 07:33:53.016 [ERROR][58] health.go 196: Health endpoint failed, trying to restart it... error=listen tcp 127.0.0.1:9099: bind: address already in use
2022-01-21 07:33:54.017 [ERROR][58] health.go 196: Health endpoint failed, trying to restart it... error=listen tcp 127.0.0.1:9099: bind: address already in use
2022-01-21 07:33:55.017 [ERROR][58] health.go 196: Health endpoint failed, trying to restart it... error=listen tcp 127.0.0.1:9099: bind: address already in use
2022-01-21 07:33:56.017 [ERROR][58] health.go 196: Health endpoint failed, trying to restart it... error=listen tcp 127.0.0.1:9099: bind: address already in use
查找占用9099的进程:
[root@k8s-dev-node10 ~]# netstat -nlpt
Active Internet connections (only servers)
Proto Recv-Q Send-Q Local Address Foreign Address State PID/Program name
tcp 0 0 0.0.0.0:22 0.0.0.0:* LISTEN 1437/sshd
tcp 0 0 127.0.0.1:42651 0.0.0.0:* LISTEN 1311/containerd
tcp 0 0 127.0.0.1:10248 0.0.0.0:* LISTEN 8271/kubelet
tcp 0 0 127.0.0.1:10249 0.0.0.0:* LISTEN 21454/kube-proxy
tcp 0 0 0.0.0.0:6379 0.0.0.0:* LISTEN 3472/redis-server
tcp 0 0 0.0.0.0:111 0.0.0.0:* LISTEN 939/rpcbind
tcp 0 0 0.0.0.0:179 0.0.0.0:* LISTEN 6513/bird
tcp6 0 0 :::9092 :::* LISTEN 29090/java
tcp6 0 0 :::10250 :::* LISTEN 8271/kubelet
tcp6 0 0 :::9099 :::* LISTEN 2564/csi-tencentclo
tcp6 0 0 :::6379 :::* LISTEN 3472/redis-server
tcp6 0 0 :::10255 :::* LISTEN 8271/kubelet
tcp6 0 0 :::111 :::* LISTEN 939/rpcbind
tcp6 0 0 :::10256 :::* LISTEN 21454/kube-proxy
tcp6 0 0 :::45813 :::* LISTEN 29090/java
[root@k8s-dev-node10 ~]# ps aux|grep 2564
root 2564 0.0 0.2 736272 17024 ? Ssl 15:27 0:00 /csi-tencentcloud-cbs --v=5 --logtostderr=true --endpoint=unix:///csi/csi.sock
root 7360 0.0 0.0 112712 956 pts/0 D+ 15:34 0:00 grep --color=auto 2564
The text was updated successfully, but these errors were encountered: