You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is your feature request related to a problem? Please describe
With remote store, each refresh call requires primary to upload the local segments to remote and replica then needs to download the same on it's end. This leads to significant delay and adds significant lag on the replica leading to data staleness when queried on replica
Describe the solution you'd like
We can optimise search queries to use an optimistic protocol to send a request on the primary to see if it has the blocks of data needed to serve real time query results. If yes, then the data blocks can be sent over to replica over the wire avoiding an S3 upload/download path and real time queries served from replica.
There are caveats with data sync and the amount of data that needs to be copied over based on ingestion volume and refresh rates. The approach needs to be benchmarked and tested for scale before this can be fully realised.
Related component
Storage:Remote
Describe alternatives you've considered
No response
Additional context
No response
The text was updated successfully, but these errors were encountered:
Do you have some idea of what the user experience would be here? Would this be something for a user to opt in to? For example, use this feature if you want replication delays equal to or better than document replication, but do not use this feature if you want to maximize ingest and search throughput.
Is your feature request related to a problem? Please describe
With remote store, each refresh call requires primary to upload the local segments to remote and replica then needs to download the same on it's end. This leads to significant delay and adds significant lag on the replica leading to data staleness when queried on replica
Describe the solution you'd like
We can optimise search queries to use an optimistic protocol to send a request on the primary to see if it has the blocks of data needed to serve real time query results. If yes, then the data blocks can be sent over to replica over the wire avoiding an S3 upload/download path and real time queries served from replica.
There are caveats with data sync and the amount of data that needs to be copied over based on ingestion volume and refresh rates. The approach needs to be benchmarked and tested for scale before this can be fully realised.
Related component
Storage:Remote
Describe alternatives you've considered
No response
Additional context
No response
The text was updated successfully, but these errors were encountered: