[Feature Request] Reduce refresh lag for remote store #11898

Bukhtawar · 2024-01-16T11:52:41Z

Is your feature request related to a problem? Please describe

With remote store, each refresh call requires primary to upload the local segments to remote and replica then needs to download the same on it's end. This leads to significant delay and adds significant lag on the replica leading to data staleness when queried on replica

Describe the solution you'd like

We can optimise search queries to use an optimistic protocol to send a request on the primary to see if it has the blocks of data needed to serve real time query results. If yes, then the data blocks can be sent over to replica over the wire avoiding an S3 upload/download path and real time queries served from replica.
There are caveats with data sync and the amount of data that needs to be copied over based on ingestion volume and refresh rates. The approach needs to be benchmarked and tested for scale before this can be fully realised.

Related component

Storage:Remote

Describe alternatives you've considered

No response

Additional context

No response

andrross · 2024-01-16T19:10:35Z

Do you have some idea of what the user experience would be here? Would this be something for a user to opt in to? For example, use this feature if you want replication delays equal to or better than document replication, but do not use this feature if you want to maximize ingest and search throughput.

linuxpi · 2024-05-02T15:31:20Z

[Storage Triage - attendees 1 2 3 4 5 6 7 8 9 10 11 12]

@Bukhtawar thanks for opening this issue. This looks promising for certain usecases.

We would require deeper investigation and some benchmarks to make a decision here.

Bukhtawar added enhancement Enhancement or improvement to existing feature or request untriaged labels Jan 16, 2024

github-actions bot added the Storage:Remote label Jan 16, 2024

Bukhtawar removed the untriaged label Jan 16, 2024

Bukhtawar added this to Storage Project Board Feb 15, 2024

github-project-automation bot moved this to 🆕 New in Storage Project Board Feb 15, 2024

linuxpi moved this from 🆕 New to Later (6 months plus) in Storage Project Board May 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Reduce refresh lag for remote store #11898

[Feature Request] Reduce refresh lag for remote store #11898

Bukhtawar commented Jan 16, 2024

andrross commented Jan 16, 2024

linuxpi commented May 2, 2024

[Feature Request] Reduce refresh lag for remote store #11898

[Feature Request] Reduce refresh lag for remote store #11898

Comments

Bukhtawar commented Jan 16, 2024

Is your feature request related to a problem? Please describe

Describe the solution you'd like

Related component

Describe alternatives you've considered

Additional context

andrross commented Jan 16, 2024

linuxpi commented May 2, 2024