Add panoptes-production-sidekiq-alt pod type #4425

lcjohnso · 2024-12-20T19:23:40Z

To address high latency in lower priority queues while UserSeenSubjectsWorker jobs in data_high are dominating available threads, this PR adds a single dedicated pod that will service the default (most important), data_medium, and data_low queues. This change adds an additional pod to the minimum total number of pods, but this cost makes sense in the short term (i.e., while panoptes replica DB is out of service during migration).

lcjohnso · 2024-12-20T19:38:16Z

On hold for now due to more aggressive autoscaling resolving backups w/o need for queue-specialized pods.

yuenmichelle1

Approving and merging. Issue is still happening. Will create a write up on Slack thread.

Essentially: UserSeenSubjectsWorker hits a connection timeout on the update transaction. This leads to potential data locking. Not only that but UserSeenSubjectsWorker is set to retry (I believe until success), which leads to a backup of jobs in queue.

Current theory is that because we shut off our read-replica, even though this query and others do not utilize the read-replica, there are current heavy queries that were using read-replica that are now sharing resources from production db and making it harder for these smaller transactions to clear through.

However, because these workers are typically fast, CPU never hits threshold of autoscaler until job queue backup hits a really high amount, (by then latency of data_medium and lower priority queues are hours behind).

To mitigate, as Cliff has mentioned, we create a dedicated pod to deal with the lower priority queues. (Data_medium and lower)

Add new panoptes-production-sidekiq-alt pod for queue clearing

e427e7e

lcjohnso requested a review from yuenmichelle1 December 20, 2024 19:23

yuenmichelle1 approved these changes Dec 23, 2024

View reviewed changes

yuenmichelle1 merged commit 3cb6280 into master Dec 23, 2024
8 checks passed

yuenmichelle1 deleted the sidekiq-addaltpod-1 branch December 23, 2024 15:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add panoptes-production-sidekiq-alt pod type #4425

Add panoptes-production-sidekiq-alt pod type #4425

lcjohnso commented Dec 20, 2024

lcjohnso commented Dec 20, 2024

yuenmichelle1 left a comment •

edited

Loading

Add panoptes-production-sidekiq-alt pod type #4425

Add panoptes-production-sidekiq-alt pod type #4425

Conversation

lcjohnso commented Dec 20, 2024

lcjohnso commented Dec 20, 2024

yuenmichelle1 left a comment • edited Loading

Choose a reason for hiding this comment

yuenmichelle1 left a comment •

edited

Loading