Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Some builds are stuck in the running state #2936

Closed
FrostyX opened this issue Oct 2, 2023 · 4 comments
Closed

Some builds are stuck in the running state #2936

FrostyX opened this issue Oct 2, 2023 · 4 comments
Assignees
Labels
fedora-copr-admin Tasks that need to be done by Fedora Copr administrator

Comments

@FrostyX
Copy link
Member

FrostyX commented Oct 2, 2023

Some builds appear to be stuck in the running state. I briefly looked at the logs and it seems there is more than one cause.

Screenshot_2023-10-02_12-27-04

@FrostyX FrostyX added the fedora-copr-admin Tasks that need to be done by Fedora Copr administrator label Oct 2, 2023
@FrostyX
Copy link
Member Author

FrostyX commented Oct 2, 2023

There was no way for us to switch them back to a pending state, obtain a new builder, and try again. I needed to fail all those builds, so I apologize for any inconvenience if this affected you. Please resubmit your builds if needed.

@FrostyX FrostyX closed this as completed Oct 2, 2023
@praiskup
Copy link
Member

praiskup commented Oct 3, 2023

I needed to fail all those builds, so I apologize for any inconvenience if this affected you.

Killing the background worker (at least with 9) should indeed restart the build (worker disappears without telling the dispatcher, so the dispatcher starts a new worker instead). If this isn't the actual behavior, it is a bug.

@FrostyX
Copy link
Member Author

FrostyX commented Oct 3, 2023

For all of the builds I did

[root@copr-be ~][PROD]# ps ax |grep 6449298
4095001 ?        Sl    12:08 Builder for task 6449298-mageia-9-x86_64: Job 6449298-mageia-9-x86_64, host info: ResallocHost, ticket_id=4374032, hostname=2620:52:3:1:dead:beef:cafe:c14a (command: /usr/bin/copr-backend-process-build --daemon
 --build-id 6449298 --chroot mageia-9-x86_64 --worker-id rpm_build_worker:6449298-mageia-9-x86_64)

[root@copr-be ~][PROD]# kill 4095001

and they all ended up failed. So, maybe a bug.

@praiskup
Copy link
Member

praiskup commented Oct 3, 2023

hm, I suppose kill -9 would be better next time (the background worker could actually partly recover from INT and have a chance to mark the build as failed).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
fedora-copr-admin Tasks that need to be done by Fedora Copr administrator
Projects
Archived in project
Development

No branches or pull requests

2 participants