Open
Description
Several pods failed to start when the server was restarted.
[root@iZj6cd6w5lquaue689jii0Z ~]# microk8s.kubectl get podsNAME READY STATUS RESTARTS AGE
stackstorm-job-st2-apikey-load-js2vt 0/1 Completed 0 27h
stackstorm-job-st2-key-load-rwmnw 0/1 Completed 0 27h
stackstorm-job-st2-register-content-l4hhf 0/1 Completed 0 27h
stackstorm-mongodb-0 1/1 Running 1 (96m ago) 27h
stackstorm-st2client-658d9d4645-6x8mm 1/1 Running 1 (96m ago) 27h
stackstorm-mongodb-2 1/1 Running 1 (96m ago) 27h
stackstorm-mongodb-1 1/1 Running 1 (96m ago) 27h
stackstorm-rabbitmq-0 1/1 Running 1 (96m ago) 27h
stackstorm-rabbitmq-1 1/1 Running 1 (96m ago) 27h
stackstorm-st2stream-6d75b4cff7-7w6br 1/1 Running 1 (96m ago) 27h
stackstorm-st2rulesengine-56456f8986-dd2lk 1/1 Running 1 (96m ago) 27h
stackstorm-st2notifier-7f76b6d97b-qmndn 1/1 Running 1 (96m ago) 27h
stackstorm-st2timersengine-5f7fd7585d-g7n4h 1/1 Running 1 (96m ago) 27h
stackstorm-st2notifier-7f76b6d97b-r6hhf 1/1 Running 1 (96m ago) 27h
stackstorm-st2stream-6d75b4cff7-2zt57 1/1 Running 1 (96m ago) 27h
stackstorm-st2rulesengine-56456f8986-p774j 1/1 Running 1 (96m ago) 27h
stackstorm-st2workflowengine-756db5df96-7r8m5 1/1 Running 1 (96m ago) 27h
stackstorm-st2actionrunner-76d5d6446b-4q2f8 1/1 Running 1 (96m ago) 27h
stackstorm-st2actionrunner-76d5d6446b-86xtf 1/1 Running 1 (96m ago) 27h
stackstorm-st2workflowengine-756db5df96-p985g 1/1 Running 1 (96m ago) 27h
stackstorm-st2actionrunner-76d5d6446b-lbppb 1/1 Running 1 (96m ago) 27h
stackstorm-st2auth-7f887c8b64-s65rn 1/1 Running 1 (96m ago) 27h
stackstorm-st2garbagecollector-59948bf897-97zq5 1/1 Running 1 (96m ago) 27h
stackstorm-st2actionrunner-76d5d6446b-frb9t 1/1 Running 1 (96m ago) 27h
stackstorm-st2auth-7f887c8b64-99w86 1/1 Running 1 (96m ago) 27h
stackstorm-st2actionrunner-76d5d6446b-vh2h9 1/1 Running 1 (96m ago) 27h
stackstorm-st2web-5cc49ccffc-m2jrz 1/1 Running 3 (95m ago) 27h
stackstorm-rabbitmq-2 1/1 Running 1 (96m ago) 27h
stackstorm-redis-node-2 2/2 Running 4 (95m ago) 27h
stackstorm-st2web-5cc49ccffc-tbnjc 1/1 Running 2 (95m ago) 27h
stackstorm-st2sensorcontainer-7fc7fcdd-9r5lg 1/1 Running 2 (94m ago) 27h
stackstorm-st2api-6769588f8c-mmpsq 0/1 CrashLoopBackOff 22 (4m46s ago) 27h
stackstorm-st2scheduler-585c948ccd-42b85 0/1 CrashLoopBackOff 22 (4m46s ago) 27h
stackstorm-redis-node-0 1/2 CrashLoopBackOff 24 (2m47s ago) 27h
stackstorm-redis-node-1 1/2 CrashLoopBackOff 24 (111s ago) 27h
stackstorm-st2api-6769588f8c-w7bhz 0/1 Error 23 (5m19s ago) 27h
stackstorm-st2scheduler-585c948ccd-h9br2 1/1 Running 23 (5m12s ago) 27h
These are the pods in question:
stackstorm-st2api-6769588f8c-mmpsq 0/1 CrashLoopBackOff 22 (4m46s ago) 27h
stackstorm-st2scheduler-585c948ccd-42b85 0/1 CrashLoopBackOff 22 (4m46s ago) 27h
stackstorm-redis-node-0 1/2 CrashLoopBackOff 24 (2m47s ago) 27h
stackstorm-redis-node-1 1/2 CrashLoopBackOff 24 (111s ago) 27h
stackstorm-st2api-6769588f8c-w7bhz 0/1 Error 23 (5m19s ago) 27h
stackstorm-st2scheduler-585c948ccd-h9br2 1/1 Running 23 (5m12s ago) 27h
view pod the error log :
amqp.exceptions.NotFound: Queue.declare: (404) NOT_FOUND - home node 'rabbit@stackstorm-rabbitmq-0.stackstorm-rabbitmq-headless.default.svc.cluster.local' of durable queue 'st2.preinit' in vhost '/' is down or inaccessible
2022-02-05 12:23:53,149 INFO [-] (PID=1) ST2 API is serving on http://0.0.0.0:9101.
2022-02-05 12:23:53,149 INFO [-] Creating st2api: StackStorm v3.6.0 as OpenAPI app.
2022-02-05 12:23:53,518 ERROR [-] (PID=1) ST2 API quit due to exception.
Traceback (most recent call last):
File "/opt/stackstorm/st2/lib/python3.6/site-packages/tooz/drivers/redis.py", line 42, in _translate_failures
yield
File "/opt/stackstorm/st2/lib/python3.6/site-packages/tooz/drivers/redis.py", line 450, in _start
self._server_info = self._client.info()
File "/opt/stackstorm/st2/lib/python3.6/site-packages/redis/client.py", line 1304, in info
return self.execute_command('INFO')
File "/opt/stackstorm/st2/lib/python3.6/site-packages/redis/client.py", line 898, in execute_command
conn = self.connection or pool.get_connection(command_name, **options)
File "/opt/stackstorm/st2/lib/python3.6/site-packages/redis/connection.py", line 1192, in get_connection
connection.connect()
File "/opt/stackstorm/st2/lib/python3.6/site-packages/redis/sentinel.py", line 44, in connect
self.connect_to(self.connection_pool.get_master_address())
File "/opt/stackstorm/st2/lib/python3.6/site-packages/redis/sentinel.py", line 107, in get_master_address
self.service_name)
File "/opt/stackstorm/st2/lib/python3.6/site-packages/redis/sentinel.py", line 219, in discover_master
raise MasterNotFoundError("No master found for %r" % (service_name,))
redis.sentinel.MasterNotFoundError: No master found for 'mymaster'
How can I solve this problem?????