You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It's possible that tasks executed via ansible hang. For example, there were a few cases of podman system prune -f indefinitely blocking. That's likely because of a bug in the container runtime. Nevertheless, do.sh should not indefinitely hang.
It happened again. What I ca see is that we have a zombie process in the container and conmon doesn't reap it. While stopping the container we hit the Error: cannot remove container d9f655ac066b5af3f4c2df875e897207d027e5a5db2c900d958fdf176b4ec4cf as it could not be stopped: given PIDs did not die within timeout. This is likely triggering SIGKILL that kills parents and they have no chance to wait for a child, causing a PID 1 problem essentially.
It's possible that tasks executed via ansible hang. For example, there were a few cases of
podman system prune -f
indefinitely blocking. That's likely because of a bug in the container runtime. Nevertheless,do.sh
should not indefinitely hang.We should instead add timeouts to ansible tasks. One option is to set a global timeout:
https://docs.ansible.com/ansible/latest/reference_appendices/config.html#task-timeout
It's possible though that some tasks need a longer timeout than others. This needs to be investigated further.
The text was updated successfully, but these errors were encountered: