-
Notifications
You must be signed in to change notification settings - Fork 217
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Image data refresh's "update media popularity constants" step hung on last run #1357
Comments
I've marked this as "critical" because we are unable to run the image data refresh until this issue is resolved. We may be able to enable the DAG and skip the popularity recalculation steps since those only run once a month, but if there's a root cause for this we should work to try and find it. |
It looks like I failed to update the |
@AetherUnbound Starting the data refresh today sounds fine. Could you do that and let folks know in Slack when it's been kicked off? Two questions as well:
|
Since we're just stuffing that as a field into the
We could potentially try running the matview refresh command directly on postgres, but that would require one of us to have a connection to postgres open for the duration of the query which seems less feasible than simply kicking off the DAG. Good questions! |
I've started the image data refresh and will report back here with the results tomorrow. |
I wonder if it was related to our increased scraping traffic from the past few weeks. Interesting! |
I would be really surprised, since this is all happening on the catalog database which should be isolated from user traffic! |
Description
The
refresh_popularity_metrics_and_constants.update_media_popularity_constants_view
task for the image data refresh DAG (which typically takes about 10 hours) ended up running for over 9 days. The query was still running on the postgres backend at that time. We took action to pause the DAG and kill the query. If we're able to reproduce this behavior, we need to investigate why it's happening.Additional context
Resolution
The text was updated successfully, but these errors were encountered: