[server] Recover log and index file for unclean shutdown #1749

LiebingYu · 2025-09-24T03:30:51Z

Purpose

Linked issue: close #1716

Brief change log

Tests

API and Format

Documentation

LiebingYu · 2025-09-24T08:23:21Z

Ready for CR @wuchong @swuferhong

swuferhong

@LiebingYu Thanks for your work. I left some comments.

swuferhong · 2025-09-25T09:29:56Z

fluss-server/src/main/java/org/apache/fluss/server/log/LogLoader.java

+                new WriterStateManager(
+                        logSegments.getTableBucket(),
+                        logTabletDir,
+                        this.writerStateManager.writerExpirationMs());


Why we need to new a WriterStateManager? Maybe we can add a clear() method?

This logic follow Kafka, and ceate a new WriterStateManager is a lightweight operation. I think it's ok.

fluss-server/src/test/java/org/apache/fluss/server/log/LogLoaderTest.java

swuferhong · 2025-09-25T09:52:47Z

fluss-server/src/main/java/org/apache/fluss/server/log/LogLoader.java

+                        numUnflushed,
+                        logSegments.getTableBucket());
+
+                int truncatedBytes = -1;


Do we need to be this too aggressive here? Deleting all subsequent logSegments just because one cannot be repaired — I feel this might pose a risk of data loss. Also, we don't have test coverage for this logic.

This logic also follow Kafka. From my perspective, data loss is unlikely because the data is stored in multiple replicas. Once the file is truncated to the correct position, it can synchronize the latest data from the leader. If truncation is not carried out, the file appears to be unrecoverable, and if the host machine becomes the leader afterward, unforeseen problems might occur.

And I add test to cover this.

LiebingYu force-pushed the fix-corrupt-index-new branch 2 times, most recently from b9b5f2d to ceaf704 Compare September 24, 2025 07:50

swuferhong reviewed Sep 25, 2025

View reviewed changes

LiebingYu added 2 commits September 25, 2025 19:48

[server] Recover log and index file for unclean shutdown

574140e

fix comments

1e8a582

LiebingYu force-pushed the fix-corrupt-index-new branch from ceaf704 to 1e8a582 Compare September 25, 2025 14:06

add tests

ed5f1c2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[server] Recover log and index file for unclean shutdown #1749

[server] Recover log and index file for unclean shutdown #1749

Uh oh!

LiebingYu commented Sep 24, 2025

Uh oh!

LiebingYu commented Sep 24, 2025

Uh oh!

swuferhong left a comment

Uh oh!

swuferhong Sep 25, 2025

Uh oh!

LiebingYu Sep 25, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

swuferhong Sep 25, 2025

Uh oh!

LiebingYu Sep 26, 2025

Uh oh!

LiebingYu Sep 26, 2025

Uh oh!

Uh oh!

[server] Recover log and index file for unclean shutdown #1749

Are you sure you want to change the base?

[server] Recover log and index file for unclean shutdown #1749

Uh oh!

Conversation

LiebingYu commented Sep 24, 2025

Purpose

Brief change log

Tests

API and Format

Documentation

Uh oh!

LiebingYu commented Sep 24, 2025

Uh oh!

swuferhong left a comment

Choose a reason for hiding this comment

Uh oh!

swuferhong Sep 25, 2025

Choose a reason for hiding this comment

Uh oh!

LiebingYu Sep 25, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

swuferhong Sep 25, 2025

Choose a reason for hiding this comment

Uh oh!

LiebingYu Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

LiebingYu Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!