fix: race when bumping items while loading a snapshot #4564

kostasrim · 2025-02-05T12:17:19Z

The original issue was submitted in #4497 and we supplied a fix in #4507. However, the fix ignored that the RdbLoader::Load() function is run per flow/shard thread and the "poison pill" of updating the loading state at the end of RdbLoader::Load() introduced a race condition:

shard_set->Add(i, [bc]() mutable {
      namespaces->GetDefaultNamespace().GetCurrentDbSlice().SetLoadInProgress(false);
      bc->Dec();
    });

Any flow F that finished loading its own snapshot first (relatively to the rest of the flows) will call SetLoadInProgress(false) on ALL shard threads. The consequence of that is that other flows are not yet done (their respective RdbLoader::Load()` is still processing) and next time the use the db slice API will start Bumping up items because now load in progress is false.

The fix is to update the state after all shard flows are done and similarly to update all shard flow before we start the Load() which shall provide a consistent state/view among all shard threads.

Should resolve #4554

P.s. we might be able to simplify the new db slice state via the global loading state. That's something I will need to follow but I won't do this as part of this PR.

Signed-off-by: kostas <[email protected]>

romange

Maybe I misunderstand something but why do we need to orcherstrate SetLoadInProgress on all the shards? Can we do it locally on each shard? i.e. independently for each shard?

kostasrim · 2025-02-05T15:04:38Z

Maybe I misunderstand something but why do we need to orcherstrate SetLoadInProgress on all the shards? Can we do it locally on each shard? i.e. independently for each shard?

Good question! Because master and replica might have different number of proactors. So imagine the following case:

Flow 1 -> Sets the shard's 1 loading state to true.
Flow 2 -> Still hasn't set its local loading state to true (the flow fiber did not even start because the proactor was busy)

Flow 1 had only one item, Load finished instantly and now it calls FinishLoad which calls FlushShardAsync which unfortunately calls LoadItemsBuffer on shard number 2. Boom, state loading is false. In other words, each flow at the end can dispatch to multiple shards which might have not yet have their state updated. And we can't really rely on the order of submissions to the task/shard queue (since we don't have a guarantee of sequential task execution from what you have said in the past -- and I could be wrong here).

However, saying this, I think there is a better solution! If we call FlushShardAsync we can first check if the loading state is updated. If it's not we can set it to true. That way we "save" the first scatter part of the operation (dispatching to all shard threads and update the state). We only keep the "gather" step (updating the state to false after all the loaders completed)

romange · 2025-02-05T17:16:14Z

i did not analyse the code but maybe using a unsigned counter instead of boolean in db_slice will simplify things?
i.e. every time we start we increase and when we stop we decrease.

fix: race when bumping items while loading a snapshot

d8554a0

Signed-off-by: kostas <[email protected]>

kostasrim self-assigned this Feb 5, 2025

romange reviewed Feb 5, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: race when bumping items while loading a snapshot #4564

fix: race when bumping items while loading a snapshot #4564

kostasrim commented Feb 5, 2025

romange left a comment

kostasrim commented Feb 5, 2025

romange commented Feb 5, 2025

fix: race when bumping items while loading a snapshot #4564

Are you sure you want to change the base?

fix: race when bumping items while loading a snapshot #4564

Conversation

kostasrim commented Feb 5, 2025

romange left a comment

Choose a reason for hiding this comment

kostasrim commented Feb 5, 2025

romange commented Feb 5, 2025