fix: pop logs in log segment after cleaning up logs #3064

ion-elgreco · 2024-12-16T22:16:46Z

Description

The description of the main changes of your pull request

Related Issue(s)

closes cleanup_expired_logs_for doesn't update logSegment after removal #3062

roeap

We definitely should not be producing corrupt state and endanger querying an invalid snapshot.

However I hope we can consider a little bit the implications of some of these changes and the root cause we are looking at.

Need to dive a bit deeper, but right now I don't really understand how this can happen - if the metadata is part of the snapshot, it should still be valid? Or rather why would they get cleaned up if they are part of teh active snapshot, without any operations in-between that would cause the actions to no longer be part of a version?

Certain I am missing something, just don't know what this is.

More generally, log cleanup would usually be not on "the critical path" when it comes to cleanup it seems these would run even in completely separate processes? SO rather then trying to update the internal log, could we just consume the state and trigger a new replay? Or something along those lines?

In the "Pure" thinking snapshots are immutable - incremental updates are an optimization, but actually this just treats the current state as snapshot and replays new files on top of this.

Last but not least, I am hoping to integrate more with kernel and already did some promising experiments. Ideally we would keep the surface area of the snapshot minimal until we have some more clarity.

ion-elgreco · 2024-12-17T22:02:25Z

@roeap The snapshot is not valid anymore after you remove the logs from the object store. The logs don't exist, hence the snapshot should not reference them anymore.

However in its current state we still reference files in the log segment that are deleted from the object store, which causes vacuum to fail because it tries to read each file from the log segment

roeap · 2024-12-17T22:05:39Z

The snapshot is not valid anymore after you remove the logs from the object store.

I still donÄT understand how that situation arises :) - are we deleting data?

hence the snapshot should not reference them anymore.

could we just discard the snapshot then? i.e. have the function just consume the snapshot? It is a snapshot after all :) which can go invalid if the table is updated ...

ion-elgreco · 2024-12-17T22:13:08Z

The snapshot is not valid anymore after you remove the logs from the object store.

I still donÄT understand how that situation arises :) - are we deleting data?

Yes, cleanup_metadata removes the physical log files, and also post_commit_hook removes logs if the interval for it has been met.

So in two instances the Snapshot will become invalid. Only by forcefully reloading the table it will be fixed. The linked issue mentions the behaviour

hence the snapshot should not reference them anymore.

could we just discard the snapshot then? i.e. have the function just consume the snapshot? It is a snapshot after all :) which can go invalid if the table is updated ...

Replaying the log stream seems more compute intensive, then just popping the log files you know got deleted from the object store

ion-elgreco requested review from wjones127, fvaleye, roeap, rtyler and hntd187 as code owners December 16, 2024 22:16

github-actions bot added binding/python Issues for the Python package binding/rust Issues for the Rust crate labels Dec 16, 2024

ion-elgreco force-pushed the fix--pop-logs-in-logsegment-after-cleanup branch 3 times, most recently from 42e9a94 to e4af46f Compare December 16, 2024 22:20

fix: pop logs in log segment

b096bab

ion-elgreco force-pushed the fix--pop-logs-in-logsegment-after-cleanup branch from e4af46f to b096bab Compare December 17, 2024 19:21

roeap requested changes Dec 17, 2024

View reviewed changes

ion-elgreco closed this Jan 6, 2025

ion-elgreco mentioned this pull request Jan 16, 2025

Very weird behavior with merge + checkpoints + optimization #3133

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: pop logs in log segment after cleaning up logs #3064

fix: pop logs in log segment after cleaning up logs #3064

ion-elgreco commented Dec 16, 2024

roeap left a comment

ion-elgreco commented Dec 17, 2024 •

edited

Loading

roeap commented Dec 17, 2024

ion-elgreco commented Dec 17, 2024 •

edited

Loading

fix: pop logs in log segment after cleaning up logs #3064

fix: pop logs in log segment after cleaning up logs #3064

Conversation

ion-elgreco commented Dec 16, 2024

Description

Related Issue(s)

roeap left a comment

Choose a reason for hiding this comment

ion-elgreco commented Dec 17, 2024 • edited Loading

roeap commented Dec 17, 2024

ion-elgreco commented Dec 17, 2024 • edited Loading

ion-elgreco commented Dec 17, 2024 •

edited

Loading

ion-elgreco commented Dec 17, 2024 •

edited

Loading