Skip to content

Failed to deploy stackStorm HA through MicroK8S #281

Open
@simonli866

Description

@simonli866

Several pods failed to start when the server was restarted.

[root@iZj6cd6w5lquaue689jii0Z ~]# microk8s.kubectl get podsNAME                                              READY   STATUS             RESTARTS         AGE
stackstorm-job-st2-apikey-load-js2vt              0/1     Completed          0                27h
stackstorm-job-st2-key-load-rwmnw                 0/1     Completed          0                27h
stackstorm-job-st2-register-content-l4hhf         0/1     Completed          0                27h
stackstorm-mongodb-0                              1/1     Running            1 (96m ago)      27h
stackstorm-st2client-658d9d4645-6x8mm             1/1     Running            1 (96m ago)      27h
stackstorm-mongodb-2                              1/1     Running            1 (96m ago)      27h
stackstorm-mongodb-1                              1/1     Running            1 (96m ago)      27h
stackstorm-rabbitmq-0                             1/1     Running            1 (96m ago)      27h
stackstorm-rabbitmq-1                             1/1     Running            1 (96m ago)      27h
stackstorm-st2stream-6d75b4cff7-7w6br             1/1     Running            1 (96m ago)      27h
stackstorm-st2rulesengine-56456f8986-dd2lk        1/1     Running            1 (96m ago)      27h
stackstorm-st2notifier-7f76b6d97b-qmndn           1/1     Running            1 (96m ago)      27h
stackstorm-st2timersengine-5f7fd7585d-g7n4h       1/1     Running            1 (96m ago)      27h
stackstorm-st2notifier-7f76b6d97b-r6hhf           1/1     Running            1 (96m ago)      27h
stackstorm-st2stream-6d75b4cff7-2zt57             1/1     Running            1 (96m ago)      27h
stackstorm-st2rulesengine-56456f8986-p774j        1/1     Running            1 (96m ago)      27h
stackstorm-st2workflowengine-756db5df96-7r8m5     1/1     Running            1 (96m ago)      27h
stackstorm-st2actionrunner-76d5d6446b-4q2f8       1/1     Running            1 (96m ago)      27h
stackstorm-st2actionrunner-76d5d6446b-86xtf       1/1     Running            1 (96m ago)      27h
stackstorm-st2workflowengine-756db5df96-p985g     1/1     Running            1 (96m ago)      27h
stackstorm-st2actionrunner-76d5d6446b-lbppb       1/1     Running            1 (96m ago)      27h
stackstorm-st2auth-7f887c8b64-s65rn               1/1     Running            1 (96m ago)      27h
stackstorm-st2garbagecollector-59948bf897-97zq5   1/1     Running            1 (96m ago)      27h
stackstorm-st2actionrunner-76d5d6446b-frb9t       1/1     Running            1 (96m ago)      27h
stackstorm-st2auth-7f887c8b64-99w86               1/1     Running            1 (96m ago)      27h
stackstorm-st2actionrunner-76d5d6446b-vh2h9       1/1     Running            1 (96m ago)      27h
stackstorm-st2web-5cc49ccffc-m2jrz                1/1     Running            3 (95m ago)      27h
stackstorm-rabbitmq-2                             1/1     Running            1 (96m ago)      27h
stackstorm-redis-node-2                           2/2     Running            4 (95m ago)      27h
stackstorm-st2web-5cc49ccffc-tbnjc                1/1     Running            2 (95m ago)      27h
stackstorm-st2sensorcontainer-7fc7fcdd-9r5lg      1/1     Running            2 (94m ago)      27h
stackstorm-st2api-6769588f8c-mmpsq                0/1     CrashLoopBackOff   22 (4m46s ago)   27h
stackstorm-st2scheduler-585c948ccd-42b85          0/1     CrashLoopBackOff   22 (4m46s ago)   27h
stackstorm-redis-node-0                           1/2     CrashLoopBackOff   24 (2m47s ago)   27h
stackstorm-redis-node-1                           1/2     CrashLoopBackOff   24 (111s ago)    27h
stackstorm-st2api-6769588f8c-w7bhz                0/1     Error              23 (5m19s ago)   27h
stackstorm-st2scheduler-585c948ccd-h9br2          1/1     Running            23 (5m12s ago)   27h

These are the pods in question:
stackstorm-st2api-6769588f8c-mmpsq                0/1     CrashLoopBackOff   22 (4m46s ago)   27h
stackstorm-st2scheduler-585c948ccd-42b85          0/1     CrashLoopBackOff   22 (4m46s ago)   27h
stackstorm-redis-node-0                           1/2     CrashLoopBackOff   24 (2m47s ago)   27h
stackstorm-redis-node-1                           1/2     CrashLoopBackOff   24 (111s ago)    27h
stackstorm-st2api-6769588f8c-w7bhz                0/1     Error              23 (5m19s ago)   27h
stackstorm-st2scheduler-585c948ccd-h9br2          1/1     Running            23 (5m12s ago)   27h

view pod the error log :

amqp.exceptions.NotFound: Queue.declare: (404) NOT_FOUND - home node 'rabbit@stackstorm-rabbitmq-0.stackstorm-rabbitmq-headless.default.svc.cluster.local' of durable queue 'st2.preinit' in vhost '/' is down or inaccessible
2022-02-05 12:23:53,149 INFO [-] (PID=1) ST2 API is serving on http://0.0.0.0:9101.
2022-02-05 12:23:53,149 INFO [-] Creating st2api: StackStorm v3.6.0 as OpenAPI app.
2022-02-05 12:23:53,518 ERROR [-] (PID=1) ST2 API quit due to exception.
Traceback (most recent call last):
  File "/opt/stackstorm/st2/lib/python3.6/site-packages/tooz/drivers/redis.py", line 42, in _translate_failures
    yield
  File "/opt/stackstorm/st2/lib/python3.6/site-packages/tooz/drivers/redis.py", line 450, in _start
    self._server_info = self._client.info()
  File "/opt/stackstorm/st2/lib/python3.6/site-packages/redis/client.py", line 1304, in info
    return self.execute_command('INFO')
  File "/opt/stackstorm/st2/lib/python3.6/site-packages/redis/client.py", line 898, in execute_command
    conn = self.connection or pool.get_connection(command_name, **options)
  File "/opt/stackstorm/st2/lib/python3.6/site-packages/redis/connection.py", line 1192, in get_connection
    connection.connect()
  File "/opt/stackstorm/st2/lib/python3.6/site-packages/redis/sentinel.py", line 44, in connect
    self.connect_to(self.connection_pool.get_master_address())
  File "/opt/stackstorm/st2/lib/python3.6/site-packages/redis/sentinel.py", line 107, in get_master_address
    self.service_name)
  File "/opt/stackstorm/st2/lib/python3.6/site-packages/redis/sentinel.py", line 219, in discover_master
    raise MasterNotFoundError("No master found for %r" % (service_name,))
redis.sentinel.MasterNotFoundError: No master found for 'mymaster'

How can I solve this problem?????

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't workinghelp wantedExtra attention is neededquestionFurther information is requested

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions