fix: detached volume replicas not evicted #3293

c3y1huang · 2024-11-27T05:53:43Z

Which issue(s) this PR fixes:

Issue longhorn/longhorn#9781, longhorn/longhorn#9857

What this PR does / why we need it:

We need to allow the volume eviction controller to handle detached volumes. This proposes another approach to address longhorn/longhorn#9781: proceed with volume attachment for replica eviction only when suitable node/disk candidates are available.

Special notes for your reviewer:

Additional documentation or context

coderabbitai · 2024-11-27T05:53:49Z

Walkthrough

The changes in this pull request enhance the VolumeEvictionController by integrating a new dependency on the ReplicaScheduler for managing replica evictions. A new method checks for valid disk candidates before proceeding with eviction, ensuring that attachment tickets are only created when appropriate. Additionally, a new error constant related to eviction failures is introduced to improve error handling. These modifications aim to refine the decision-making process during volume eviction.

Changes

File	Change Summary
controller/volume_eviction_controller.go	- Added field `scheduler *scheduler.ReplicaScheduler` to `VolumeEvictionController` struct. - Modified `reconcile` method to check for disk candidates before creating attachment tickets. - Introduced method `hasDiskCandidateForReplicaEviction` to validate disk candidates.
k8s/pkg/apis/longhorn/v1beta2/node.go	- Added error constant `ErrorReplicaScheduleEvictReplicaFailed` with message "evict replica failed".

Assessment against linked issues

Objective	Addressed	Explanation
Volume should return to detached state during node eviction (#9781)	✅

Possibly related PRs

fix: detached volume reattached after node eviction #3270: This PR modifies the VolumeEvictionController in the same file and enhances the logic for managing volume attachments, closely related to the changes made in this PR regarding eviction processes.

Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?

❤️ Share

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Generate unit testing code for this file.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai generate unit testing code for this file.
- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read src/utils.ts and generate unit testing code.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
- @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

c3y1huang · 2024-11-27T05:54:11Z

@coderabbitai review

coderabbitai · 2024-11-27T05:54:16Z

✅ Actions performed

Review triggered.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

coderabbitai

Actionable comments posted: 0

🧹 Outside diff range and nitpick comments (2)

controller/volume_eviction_controller.go (2)

209-211: Ensure proper handling when no disk candidates are found during eviction

By adding the condition if vec.hasDiskCandidateForReplicaEviction(replicas, vol), the attachment ticket is only created if there are valid disk candidates. Consider implementing additional logic to handle scenarios where no disk candidates are available, such as updating the volume's condition status or notifying the user that the eviction cannot proceed. This will improve transparency and allow users to take corrective actions.

219-242: Consider evaluating all replicas before returning

Currently, the hasDiskCandidateForReplicaEviction function returns false immediately upon encountering a replica without disk candidates. It might be beneficial to evaluate all replicas to identify every replica that cannot be scheduled for eviction. This approach provides comprehensive feedback, which can be valuable for debugging and informing users about all scheduling issues.

📜 Review details

Configuration used: .coderabbit.yaml
Review profile: CHILL

📥 Commits

Reviewing files that changed from the base of the PR and between 612db95 and 97ff87a.

📒 Files selected for processing (2)

controller/volume_eviction_controller.go (4 hunks)
k8s/pkg/apis/longhorn/v1beta2/node.go (1 hunks)

🔇 Additional comments (4)

k8s/pkg/apis/longhorn/v1beta2/node.go (1)

59-59: LGTM! Verify the usage of this new error constant.

The new error constant follows the existing naming pattern and provides a clear message for replica eviction failures, which aligns with the PR objectives.

Let's verify how this constant is used in the codebase:

✅ Verification successful

Let me check the usage context in the volume eviction controller to ensure proper implementation.

The error constant is properly implemented and used in the volume eviction controller

The constant is used appropriately in controller/volume_eviction_controller.go when there are no disk candidates available for evicting a replica. The error is aggregated with other potential errors and logged with proper context, following the codebase's error handling patterns.

🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# Description: Check the usage of the new error constant in the codebase
# Expected: This constant should be used in volume eviction related code

# Search for direct usage of the constant
rg "ErrorReplicaScheduleEvictReplicaFailed" --type go

# Search for the error message to catch string literal usage
rg "evict replica failed" --type go

Length of output: 472

Script:

#!/bin/bash
# Check the implementation context in volume_eviction_controller.go
rg -B 5 -A 5 "ErrorReplicaScheduleEvictReplicaFailed" controller/volume_eviction_controller.go

Length of output: 586

controller/volume_eviction_controller.go (3)

24-25: Addition of scheduler and util package imports is appropriate

The imports for "github.com/longhorn/longhorn-manager/scheduler" and "github.com/longhorn/longhorn-manager/util" are necessary for the new scheduling functionality and utility methods.

43-43: Introduce scheduler field in VolumeEvictionController

Adding the scheduler *scheduler.ReplicaScheduler field to the VolumeEvictionController struct integrates the ReplicaScheduler for managing replica evictions.

69-70: Initialize ReplicaScheduler in NewVolumeEvictionController

Initializing the scheduler with scheduler.NewReplicaScheduler(ds) ensures that the controller can utilize the scheduling logic for replica eviction decisions.

derekbit

LGTM. Let @PhanLe1010 and @shuo-wu review as well.

longhorn/longhorn-9781 longhorn/longhorn-9857 Signed-off-by: Chin-Ya Huang <[email protected]>

coderabbitai

Actionable comments posted: 0

🧹 Outside diff range and nitpick comments (2)

controller/volume_eviction_controller.go (2)

69-70: Consider adding error handling for scheduler initialization

While the scheduler initialization is correct, consider handling potential initialization errors by modifying the scheduler's constructor to return an error, similar to other components in the codebase.

-vec.scheduler = scheduler.NewReplicaScheduler(ds)
+var err error
+vec.scheduler, err = scheduler.NewReplicaScheduler(ds)
+if err != nil {
+    return nil, errors.Wrap(err, "failed to initialize replica scheduler")
+}

219-242: Consider performance optimizations for disk candidate checking

The implementation is solid, but could benefit from the following optimizations:

Add early return for empty replicas map
Consider parallel processing for large replica sets
Add context support for potential cancellation

-func (vec *VolumeEvictionController) hasDiskCandidateForReplicaEviction(replicas map[string]*longhorn.Replica, volume *longhorn.Volume) bool {
+func (vec *VolumeEvictionController) hasDiskCandidateForReplicaEviction(ctx context.Context, replicas map[string]*longhorn.Replica, volume *longhorn.Volume) bool {
+    if len(replicas) == 0 {
+        return true
+    }
+
+    type result struct {
+        replica *longhorn.Replica
+        found   bool
+        err     error
+    }
+
+    resultChan := make(chan result, len(replicas))
+    for _, replica := range replicas {
+        go func(r *longhorn.Replica) {
+            replicaCopy := r.DeepCopy()
+            replicaCopy.Spec.HardNodeAffinity = ""
+
+            diskCandidates, multiError, err := vec.scheduler.FindDiskCandidates(replicaCopy, replicas, volume)
+            if err != nil {
+                resultChan <- result{replica: r, found: false, err: err}
+                return
+            }
+
+            if len(diskCandidates) == 0 {
+                aggregatedReplicaScheduledError := util.NewMultiError(longhorn.ErrorReplicaScheduleEvictReplicaFailed)
+                if multiError != nil {
+                    aggregatedReplicaScheduledError.Append(multiError)
+                }
+                vec.logger.Warnf("No disk candidates for evicting replica %q: %v", r.Name, aggregatedReplicaScheduledError.Join())
+                resultChan <- result{replica: r, found: false, err: nil}
+                return
+            }
+            resultChan <- result{replica: r, found: true, err: nil}
+        }(replica)
+    }
+
+    for range replicas {
+        select {
+        case <-ctx.Done():
+            return false
+        case res := <-resultChan:
+            if res.err != nil {
+                vec.logger.WithError(res.err).Warnf("Failed to find disk candidates for evicting replica %q", res.replica.Name)
+                return false
+            }
+            if !res.found {
+                return false
+            }
+        }
+    }
+
+    vec.logger.Infof("Found disk candidates for evicting replicas of volume %q", volume.Name)
+    return true
}

📜 Review details

Configuration used: .coderabbit.yaml
Review profile: CHILL

📥 Commits

Reviewing files that changed from the base of the PR and between 97ff87a and a4af88d.

📒 Files selected for processing (2)

controller/volume_eviction_controller.go (4 hunks)
k8s/pkg/apis/longhorn/v1beta2/node.go (1 hunks)

🚧 Files skipped from review as they are similar to previous changes (1)

k8s/pkg/apis/longhorn/v1beta2/node.go

🔇 Additional comments (3)

controller/volume_eviction_controller.go (3)

24-25: LGTM: Clean struct extension

The addition of the scheduler dependency and corresponding struct field is well-organized and follows Go conventions.

Also applies to: 43-43

209-211: LGTM: Improved eviction control logic

The additional check for disk candidates before creating the attachment ticket aligns well with the PR objectives. This should prevent unnecessary eviction attempts when no suitable disk candidates are available.

Let's verify the impact on volume eviction:

✅ Verification successful

Based on the search results, I can now verify the impact of the volume eviction changes. The code shows that:

The volume eviction controller creates attachment tickets to coordinate volume attachment/detachment during eviction
The attachment ticket system is used consistently across multiple controllers (backup, clone, snapshot, etc.)
The condition hasDiskCandidateForReplicaEviction() is used to prevent unnecessary attachment tickets when no suitable disk candidates are available

The change aligns with the existing attachment ticket pattern and adds an important optimization to prevent unnecessary eviction attempts.

LGTM: Verified eviction control logic

The additional check for disk candidates before creating the attachment ticket is a valid optimization that prevents unnecessary volume attachment/detachment cycles when no suitable disk candidates are available for eviction.

🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# Description: Check for any existing volume eviction patterns or issues
# Look for patterns of volume eviction in the codebase

# Check for volume eviction related error patterns
rg -A 3 "volume.*evict|evict.*volume" --type go

# Check for attachment ticket usage patterns
rg -A 3 "AttachmentTicket" --type go

Length of output: 67463

224-224: Verify scheduler's FindDiskCandidates implementation

Let's ensure the scheduler's FindDiskCandidates method properly handles all edge cases for replica eviction.

✅ Verification successful

FindDiskCandidates implementation is robust and well-tested

The implementation in scheduler/replica_scheduler.go is thoroughly designed with:

Comprehensive error handling using MultiError for non-fatal cases
Proper node candidate filtering with multiple validation checks
Disk candidate selection with proper resource and tag validation
Well-documented interface and implementation
Proper test coverage in scheduler/replica_scheduler_test.go

The usage in volume_eviction_controller.go correctly handles all error cases and empty disk candidates scenarios.

🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# Description: Check scheduler's FindDiskCandidates implementation
# Look for the implementation and its test coverage

# Find the implementation
ast-grep --pattern 'func $_($$) (_, _, error) {
  $$$
  FindDiskCandidates
  $$$
}'

# Check test coverage
rg -A 5 "TestFindDiskCandidates" --type go

Length of output: 128

Script:

#!/bin/bash
# Let's try a different approach to find the FindDiskCandidates implementation

# Search for the method declaration
ast-grep --pattern 'func $_FindDiskCandidates($$$) ($$$) {
  $$$
}'

# Also search for interface definition
ast-grep --pattern 'type $_ interface {
  $$$
  FindDiskCandidates($$$)
  $$$
}'

# Backup search using ripgrep
rg "FindDiskCandidates" -A 10 --type go

Length of output: 4115

Script:

#!/bin/bash
# Let's examine the scheduler implementation more thoroughly

# Check the getNodeCandidates implementation since it's crucial for FindDiskCandidates
ast-grep --pattern 'func $_ getNodeCandidates($$$) ($$$) {
  $$$
}'

# Look for test coverage of the scheduler
rg "TestReplicaScheduler" -A 5 --type go

# Check for any potential error handling patterns
rg "MultiError" -B 2 -A 2 scheduler/replica_scheduler.go

Length of output: 7348

…-evict

derekbit · 2024-12-03T15:17:08Z

@mergify backport v1.7.x v1.6.x

mergify · 2024-12-03T15:17:19Z

backport v1.7.x v1.6.x

✅ Backports have been created

#3303 fix: detached volume replicas not evicted (backport #3293) has been created for branch v1.7.x
#3304 fix: detached volume replicas not evicted (backport #3293) has been created for branch v1.6.x but encountered conflicts

c3y1huang self-assigned this Nov 27, 2024

c3y1huang mentioned this pull request Nov 27, 2024

[BUG] Detached Volume Stuck in Attached State During Node Eviction longhorn/longhorn#9781

Closed

coderabbitai bot reviewed Nov 27, 2024

View reviewed changes

c3y1huang marked this pull request as ready for review November 27, 2024 06:27

c3y1huang mentioned this pull request Nov 27, 2024

[BUG] Test case test_node_eviction_multiple_volume failed to reschedule replicas after volume detached longhorn/longhorn#9857

Closed

c3y1huang requested review from derekbit and a team November 27, 2024 06:30

derekbit requested review from PhanLe1010 and shuo-wu November 28, 2024 04:37

derekbit approved these changes Nov 28, 2024

View reviewed changes

fix: detached volume replicas not evicted

a4af88d

longhorn/longhorn-9781 longhorn/longhorn-9857 Signed-off-by: Chin-Ya Huang <[email protected]>

derekbit force-pushed the 9781-fix-detached-volume-replica-failed-to-evict branch from 97ff87a to a4af88d Compare December 3, 2024 08:47

coderabbitai bot reviewed Dec 3, 2024

View reviewed changes

Merge branch 'master' into 9781-fix-detached-volume-replica-failed-to…

873db45

…-evict

derekbit merged commit bb0a41d into longhorn:master Dec 3, 2024
9 checks passed

This was referenced Dec 3, 2024

fix: detached volume replicas not evicted (backport #3293) #3303

Merged

fix: detached volume replicas not evicted (backport #3293) #3304

Merged

c3y1huang deleted the 9781-fix-detached-volume-replica-failed-to-evict branch December 4, 2024 03:22

This was referenced Dec 11, 2024

fix: clear fast-failover status when node goes not ready #3340

Merged

feat: support v2 volume live migration #3323

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: detached volume replicas not evicted #3293

fix: detached volume replicas not evicted #3293

c3y1huang commented Nov 27, 2024

coderabbitai bot commented Nov 27, 2024 •

edited

Loading

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

Documentation and Community

c3y1huang commented Nov 27, 2024

coderabbitai bot commented Nov 27, 2024

coderabbitai bot left a comment

derekbit left a comment

coderabbitai bot left a comment

derekbit commented Dec 3, 2024

mergify bot commented Dec 3, 2024 •

edited

Loading

fix: detached volume replicas not evicted #3293

fix: detached volume replicas not evicted #3293

Conversation

c3y1huang commented Nov 27, 2024

Which issue(s) this PR fixes:

What this PR does / why we need it:

Special notes for your reviewer:

Additional documentation or context

coderabbitai bot commented Nov 27, 2024 • edited Loading

Walkthrough

Changes

Assessment against linked issues

Possibly related PRs

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

Documentation and Community

c3y1huang commented Nov 27, 2024

coderabbitai bot commented Nov 27, 2024

coderabbitai bot left a comment

Choose a reason for hiding this comment

derekbit left a comment

Choose a reason for hiding this comment

coderabbitai bot left a comment

Choose a reason for hiding this comment

derekbit commented Dec 3, 2024

mergify bot commented Dec 3, 2024 • edited Loading

✅ Backports have been created

coderabbitai bot commented Nov 27, 2024 •

edited

Loading

mergify bot commented Dec 3, 2024 •

edited

Loading