cancelling a waiter does not free all associated memory until a slot is returned #59

problame · 2024-10-28T18:11:20Z

Problem

Assume all slots are in use by submitted operations.

If we now want to perform another tokio-epoll-uring operation, we enqueue in SlotsInnerState::Open::waiters here:

If we now drop the returned rx, the tx remains stored in waiters until return_slot gets called and we run

That happens only if one of the submitted operations completes.

In the meantime, the txes pile up in waiters.

Memory exhaustion and hence OOM kill in pathological cases.

Unlikely, but, could happen if more and more ops are requested and filesystem hangs completely.

The OOM could make it harder to understand the root cause in such cases.

Proactive cleanup of the waiters array as part of Drop impl?
Wrap waiters in a semaphore?
Follow long-standing TODO to replace waiters with a bounded channel (cap=SLOT_SIZE) that in idle state stores all the slot indices?

I found this bug while

The text was updated successfully, but these errors were encountered: