Task session_cleanup "session_cleanup" failed: ResponseError: search_phase_execution_exception Root causes: no_shard_available_action_exception #205146
Labels
Team:Security
Team focused on: Auth, Users, Roles, Spaces, Audit Logging, and more!
We are seeing thousands of failures per day on serverless regarding the session cleanup task where it encounters no_shard_available_action_exception errors. While this error is not new, it is negatively impacting the task run success SLO where we have to keep a lower SLO target to accommodate for these failures.
The issue seems to be caused by the reliance of point in time functionality where it needs all shards to be available to succeed. I am opening this issue for the security team to investigate whether we need Elasticsearch to solve these issues or if we can make the task more resilient / not reliant on PIT.
Sample stack trace
The text was updated successfully, but these errors were encountered: