Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature Request] Auto-recovery of cluster after hardware failure w/ remote store #11921

Open
Bukhtawar opened this issue Jan 18, 2024 · 0 comments
Labels
enhancement Enhancement or improvement to existing feature or request Storage:Durability Issues and PRs related to the durability framework

Comments

@Bukhtawar
Copy link
Collaborator

Is your feature request related to a problem? Please describe

Today on auto-restore we aren't able to auto-recover due to cases like isolated primaries #3706 espl cases like no-replica where we need to build a robust mechanism to ensure we don't have divergent writes.

Describe the solution you'd like

Once such mechanism to support zero replica is to use a empty replica that hosts no data, only metadata of the shard to ensure it doesn't lead to additional storage costs. This replica would perform continuous no-op replication on every indexing request and on failure of the primary can be promoted to the primary after the data has been synced from the S3. This simplifies problems with isolated writers and makes the replication protocol easy to reason about

Related component

Storage:Durability

Describe alternatives you've considered

No response

Additional context

No response

@Bukhtawar Bukhtawar added enhancement Enhancement or improvement to existing feature or request untriaged labels Jan 18, 2024
@github-actions github-actions bot added the Storage:Durability Issues and PRs related to the durability framework label Jan 18, 2024
@Bukhtawar Bukhtawar moved this from 🆕 New to Later (6 months plus) in Storage Project Board May 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement Enhancement or improvement to existing feature or request Storage:Durability Issues and PRs related to the durability framework
Projects
Status: Later (6 months plus)
Development

No branches or pull requests

1 participant