Task/WG-383: add queue for heavy tasks #220

nathanfranklin · 2024-10-10T16:10:51Z

Overview:

Add heavy queue for computationally intensive tasks. In this PR, the queue will be used by:

Point cloud processing, which takes a lot of CPU and memory when processing large point clouds.
The street view uploading task can take a long time (even though it doesn't use a lot of CPU or memory).

The number of workers for both queues is:

default queue uses Celery's default behavior: nproc number of worker processes.
heavy queue is set to nproc/2 number of workers (so intensive/long tasks don't monopolize things)

Note: While we've discussed categorizing tasks based on their true intensity/need (e.g., differentiating between small and large point cloud processing), this PR doesn't implement that. Instead, it makes incremental improvements by just having a smaller queue for large tasks to i) reduce the risk of memory exhaustion during large point cloud processing, and ii) prevent system monopolization by intensive tasks.

Related Jira tickets:

WG-383

Summary of Changes:

Testing Steps:

Run and ensure you load features like images and then also point clouds to ensure both queues are working.

Also, fixes nsf log snippet application.

This was high as we used to support direct file upload instead of using TAPIS

…-Cloud/geoapi into task/WG-285-update-potree-converter

rstijerina

LGTM

rstijerina · 2024-10-31T18:08:44Z

devops/geoapi-workers/docker-compose.yml

+      sh -c '
+      celery -A geoapi.celery_app worker -l info -Q default -n default_worker@geoapi &
+      celery -A geoapi.celery_app worker -l info -Q heavy --concurrency=6 -n heavy_worker@geoapi &
+      wait


Just curious - is this preferable over letting the process hang on the celery command?

@rstijerina as we have two celery tasks, at least the first one needed to be ran in the background, & but second one could have been run in the foreground and then the wait skipped.
i didn't see a way to run celery and say, i want X concurrency on this queue and Y concurrency on this other queue

sophia-massie · 2024-10-31T19:18:35Z

geoapi/tasks/external_data.py

@@ -167,7 +167,7 @@ def _handle_point_cloud_conversion_error(pointCloudId, userId, files, error_desc
                                    f"Processing failed for point cloud ({pointCloudId})!")


-@app.task(rate_limit="1/s")
+@app.task(queue='heavy')


Just a suggestion - Should we add a logger to let us know which queue a task is using? Probably not needed just curious if that would help us down the line.

the task itself gets logged but not which queue it was in. we could have a base celery task that our tasks use and could log that info. good idea when we make improvements to this.

nathanfranklin added 28 commits September 11, 2024 16:56

Rework worker Dockerfile and bump PotreeConverter

6658fa7

Fix docker compose commands

1d30dd5

Update and improve custom html template

6a808ee

Activate conda when starting bash on running container

23e263e

Add an additional test

7181567

Improve nginx.conf to allow range requests for potree bin files

65bae49

Fix adding of nsf_logo.png

dbc0921

Merge branch 'main' into task/WG-285-update-potree-converter

f0fec29

Fix unit test

bc1abd2

Add vim package

6ce422c

Do not need to install laszip

1920b83

Clean up dockerfile

2646c3a

Update deployed nginx.conf to allow range requests for potree bin files

36b8cd1

Simplify entrypoint script

19b64a7

Refactor oder of Dockerfile

ed52183

Merge branch 'main' into task/WG-285-update-potree-converter

cb7d798

Fix nginx.conf so range requests work on Firefox

d889c83

Remove unused background image in template

dd075f2

Also, fixes nsf log snippet application.

Improve comment

1eb3729

Fix PYTHONPATH

acb7497

Merge branch 'main' into task/WG-285-update-potree-converter

0f92d60

Fix .bin settings

d55b875

Unify how some settings are set

092372e

Lower max body size

69f50bb

This was high as we used to support direct file upload instead of using TAPIS

Merge branch 'task/WG-285-update-potree-converter' of github.com:TACC…

115c097

…-Cloud/geoapi into task/WG-285-update-potree-converter

Merge branch 'main' into task/WG-285-update-potree-converter

7392166

Improve error handling to log memory issues

1353e32

Add heavy queue for computationally intensive tasks.

b2f088a

nathanfranklin changed the base branch from main to task/WG-285-update-potree-converter October 10, 2024 16:11

nathanfranklin assigned taoteg Oct 24, 2024

nathanfranklin assigned sophia-massie Oct 24, 2024

Base automatically changed from task/WG-285-update-potree-converter to main October 28, 2024 19:46

nathanfranklin requested review from sophia-massie and taoteg October 28, 2024 19:46

nathanfranklin unassigned taoteg and sophia-massie Oct 28, 2024

Merge branch 'main' into task/WG-383-add-queue-for-heavy-tasks

2ecdc8c

nathanfranklin requested review from shayanaijaz and rstijerina October 28, 2024 19:58

rstijerina approved these changes Oct 31, 2024

View reviewed changes

rstijerina reviewed Oct 31, 2024

View reviewed changes

sophia-massie approved these changes Oct 31, 2024

View reviewed changes

nathanfranklin merged commit 9027c0f into main Oct 31, 2024
3 checks passed

nathanfranklin deleted the task/WG-383-add-queue-for-heavy-tasks branch October 31, 2024 19:20

nathanfranklin mentioned this pull request Jan 4, 2025

update: use separate worker service for each queue #235

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Task/WG-383: add queue for heavy tasks #220

Task/WG-383: add queue for heavy tasks #220

nathanfranklin commented Oct 10, 2024

rstijerina left a comment

rstijerina Oct 31, 2024

nathanfranklin Oct 31, 2024

sophia-massie Oct 31, 2024

nathanfranklin Oct 31, 2024

Task/WG-383: add queue for heavy tasks #220

Task/WG-383: add queue for heavy tasks #220

Conversation

nathanfranklin commented Oct 10, 2024

Overview:

Related Jira tickets:

Summary of Changes:

Testing Steps:

rstijerina left a comment

Choose a reason for hiding this comment

rstijerina Oct 31, 2024

Choose a reason for hiding this comment

nathanfranklin Oct 31, 2024

Choose a reason for hiding this comment

sophia-massie Oct 31, 2024

Choose a reason for hiding this comment

nathanfranklin Oct 31, 2024

Choose a reason for hiding this comment