Set /pdf requests per node 8 to 4, adds retry. #402

bdc34 · 2023-10-10T22:15:36Z

This:

decreases the concurrent per webnode /pdf requests from 8 to 4
adds retry to /pdf reqeusts
adds retry to upload to GS
increases the wait time for a pdf to be built from 3 minutes to 5 minutes.
Adds a log msg on successful GET /pdf

Both retry's use the default settings: initial delay of 1 sec, max delay of 60 sec, multiplier of 2 and a timeout of 120 sec.

The docs say timeout is "Timeout: the maximum duration of time after which a certain operation must terminate (successfully or with an error). The countdown begins right after an operation was started. For example, if an operation was started at 09:24:00 with timeout of 75 seconds, it must terminate no later than 09:25:15."

Decreases the per node concurrent requests from 8 to 4. Increases the time to build from 3 to 6 min.

bdc34 · 2023-10-10T22:21:48Z

script/sync_prod_to_gcp/sync_published_to_gcp.py

@@ -262,6 +278,31 @@ def path_to_bucket_key(pdf) -> str:
        raise ValueError(f"Cannot convert PDF path {pdf} to a GS key")


+@retry.Retry(predicate=retry.if_exception_type(PDF_RETRY_EXCEPTIONS))
+def get_pdf(session, pdf_url) -> None:


Moves the GET /pdf out to use a retry on it.

cbf66

It'd be nice if the log messages could record the time it took to make each PDF, but we can add that tomorrow.

DavidLFielding

These changes look good to me.

Keep in mind, when not in redirect mode, that other requests will also trigger compilations. This may result in some 503 errors.

The 503 handling currently does not appear to do anything to slow things down. Will not be an issue when requests and max processes are in sync. Other user requests may result in compilations and 503 errors.

bdc34 · 2023-10-11T13:24:39Z

These changes look good to me.

Keep in mind, when not in redirect mode, that other requests will also trigger compilations. This may result in some 503 errors.

The 503 handling currently does not appear to do anything to slow things down. Will not be an issue when requests and max processes are in sync. Other user requests may result in compilations and 503 errors.

The @retry.Retry() on line will cause the retry to happen on a 503 but will only do the retry after a pause. On the first 503 it will be a 1 sec pause, then a 2 sec. then a 4 sec etc. There are parameters to tune how these pauses happen.

There is no mechanism in place in this code for slowing down beyond that. I think this should be fine since returning the 503 from /pdf is should be low cost and low load.

bdc34 added 3 commits October 10, 2023 17:53

sync_published_to_gcp.py: decreases per node concurrent /pdf req

31cba01

Decreases the per node concurrent requests from 8 to 4. Increases the time to build from 3 to 6 min.

sync_published_to_gcp.py: Adds retrys to PDF get and upload to GS

6972fb7

Moves get_pdf out of ensure_pdf

065299d

bdc34 requested review from ntai-arxiv, DavidLFielding, cbf66, jonathanhyoung and bmaltzan October 10, 2023 22:15

Removes unused exception classes

bf72ca4

bdc34 commented Oct 10, 2023

View reviewed changes

cbf66 approved these changes Oct 10, 2023

View reviewed changes

DavidLFielding approved these changes Oct 10, 2023

View reviewed changes

bdc34 merged commit 6ea67d7 into develop Oct 11, 2023

bdc34 deleted the sync-per-node-8-to-4 branch October 11, 2023 19:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Set /pdf requests per node 8 to 4, adds retry. #402

Set /pdf requests per node 8 to 4, adds retry. #402

Uh oh!

bdc34 commented Oct 10, 2023 •

edited

Loading

Uh oh!

bdc34 Oct 10, 2023

Uh oh!

cbf66 left a comment

Uh oh!

DavidLFielding left a comment

Uh oh!

bdc34 commented Oct 11, 2023

Uh oh!

Uh oh!

Set /pdf requests per node 8 to 4, adds retry. #402

Set /pdf requests per node 8 to 4, adds retry. #402

Uh oh!

Conversation

bdc34 commented Oct 10, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bdc34 Oct 10, 2023

Choose a reason for hiding this comment

Uh oh!

cbf66 left a comment

Choose a reason for hiding this comment

Uh oh!

DavidLFielding left a comment

Choose a reason for hiding this comment

Uh oh!

bdc34 commented Oct 11, 2023

Uh oh!

Uh oh!

bdc34 commented Oct 10, 2023 •

edited

Loading