You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Last saturday (Nov 23, 2024) we got main archive to become unresponsive because of being unable to communicate to API server. To the visitor it hanged for awhile eventually showing
and specific "trigger" to the situation is webdav needing to make per-asset requests on a heavy in number of assets dandiset 000026. The particular issue to be addressed to allow for more efficient API is
but the point of this issue is different. IMHO API service should be made more robust against DoS situations where one client or some specific set of IPs hog it up preventing others entirely. Possibly it could be done via limiting but I think it is worth looking into some "QoS" (quality of service) balancing/throttling.
The text was updated successfully, but these errors were encountered:
Possibly it could be done via limiting but I think it is worth looking into some "QoS" (quality of service) balancing/throttling.
The standard solution is rate limiting. QoS balancing/throttling sounds orders of magnitude more complex. If we want to look into it as a research-type solution, that's fine, but rate limiting is really the thing to do.
Last saturday (Nov 23, 2024) we got main archive to become unresponsive because of being unable to communicate to API server. To the visitor it hanged for awhile eventually showing
more on the situation could be discovered in slack: https://app.slack.com/client/E01044K0LBZ/GMRLT5RQ8
Sample of logs from around that point in time happen someone decides to check:
and specific "trigger" to the situation is webdav needing to make per-asset requests on a heavy in number of assets dandiset 000026. The particular issue to be addressed to allow for more efficient API is
but the point of this issue is different. IMHO API service should be made more robust against DoS situations where one client or some specific set of IPs hog it up preventing others entirely. Possibly it could be done via limiting but I think it is worth looking into some "QoS" (quality of service) balancing/throttling.
The text was updated successfully, but these errors were encountered: