-
Notifications
You must be signed in to change notification settings - Fork 27
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Improve client handling around director outages #1389
Comments
I encountered this bug (or version of it) this afternoon, around 1:45 PM CDT, directly using the pelican binary in Ubuntu 22.04 (on WSL, Windows 11).
Managed to catch the error with
Oddly, if I typed the command manually, I did not receive an error. As demonstrated live to @turetske , @jhiemstrawisc , if I copied the command from the website (using the "Copy" button in the code block, or highlighting then Later attempts to reproduce the error worked as expected; presumably the OSDF director was back online. I did change my wireless network in between tests; perhaps the first network was doing something funky? |
@aowen-uwmad - that's a puzzler. The headers indicate you're talking to a HTTP server that's proxied through CloudFlare. Needless to say -- that's not where our director lives! Is there anything special about your laptop's networking setup that might be relevant? Next time this occurs, could you also run Another idea - could you and @williamnswanson try reproducing it for the ITB director? |
My laptop setup has a history of funky networking things, but it hasn't acted up in quite a while. |
A client unlucky enough to start while the director is restarted will get either a 404 or a 500 from the Traefik ingress (I think it depends slightly on where things are in the restart process).
This is a fatal outage; here's a few example hold messages generated:
A few thoughts:
The text was updated successfully, but these errors were encountered: