Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature Request] Reduce refresh lag for remote store #11898

Open
Bukhtawar opened this issue Jan 16, 2024 · 2 comments
Open

[Feature Request] Reduce refresh lag for remote store #11898

Bukhtawar opened this issue Jan 16, 2024 · 2 comments
Labels
enhancement Enhancement or improvement to existing feature or request Storage:Remote

Comments

@Bukhtawar
Copy link
Collaborator

Is your feature request related to a problem? Please describe

With remote store, each refresh call requires primary to upload the local segments to remote and replica then needs to download the same on it's end. This leads to significant delay and adds significant lag on the replica leading to data staleness when queried on replica

Describe the solution you'd like

We can optimise search queries to use an optimistic protocol to send a request on the primary to see if it has the blocks of data needed to serve real time query results. If yes, then the data blocks can be sent over to replica over the wire avoiding an S3 upload/download path and real time queries served from replica.
There are caveats with data sync and the amount of data that needs to be copied over based on ingestion volume and refresh rates. The approach needs to be benchmarked and tested for scale before this can be fully realised.

Related component

Storage:Remote

Describe alternatives you've considered

No response

Additional context

No response

@Bukhtawar Bukhtawar added enhancement Enhancement or improvement to existing feature or request untriaged labels Jan 16, 2024
@andrross
Copy link
Member

Do you have some idea of what the user experience would be here? Would this be something for a user to opt in to? For example, use this feature if you want replication delays equal to or better than document replication, but do not use this feature if you want to maximize ingest and search throughput.

@linuxpi
Copy link
Collaborator

linuxpi commented May 2, 2024

[Storage Triage - attendees 1 2 3 4 5 6 7 8 9 10 11 12]

@Bukhtawar thanks for opening this issue. This looks promising for certain usecases.

We would require deeper investigation and some benchmarks to make a decision here.

@linuxpi linuxpi moved this from 🆕 New to Later (6 months plus) in Storage Project Board May 2, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement Enhancement or improvement to existing feature or request Storage:Remote
Projects
Status: Later (6 months plus)
Development

No branches or pull requests

3 participants