[Remote Vector Index Build] Add vector data upload implementation to RemoteIndexBuildStrategy #2550

jed326 · 2025-02-21T05:56:30Z

Description

This PR includes implementation + testing for upload vector and doc id blobs to a remote repository with the RemoteIndexBuildStrategy. This PR also includes a new knn.remote_index_build.size_threshold setting to control the size threshold above which the remote index build service will be used.

Vector Upload Details

For a detailed explanation on the vector uploads, please see the Parallel Blob Upload section from the LLD: #2465

At a high level the flow is as follows (Currently only repository-s3 supports parallel blob uploads via this interface, so I will reference S3 directly in the following):

KNN Plugin provides an InputStream supplier to asyncBlobUpload (See: RemoteIndexBuildStrategy#getTransferPartStreamSupplier)
Repository S3 determines the number of parts (N) to split the blob into based on repository settings. By default we use 16mb parts.
Repository uses the supplier from [1] to create N VectorValueInputStreams each representing a only a specific part of the underlying KNNVectorValues.
Repository S3 performs a multi-part upload request which each stream as an upload part. The number of parallel parts depends on the upload priority and threadpool configurations through the repository settings.

Because KNNVectorValues is an iterator, we need to create N instances of it, otherwise the iterating through all of the vectors is still done sequentially which will greatly reduce the throughput. VectorValueInputStreams takes in a constructor parameter to position the head of the input stream at a specific byte offset in the KNNVectorValues, including offsets within a given vector. For example, for doc id X the vector could be partially read by InputStream part 1 and partially in InputStream part 2.

This PR DOES NOT include:

Checksum implementation, which will come as a separate follow-up PR to focus on the parallel upload approach here
Prefix pattern optimizations (see: Optimised prefix pattern per shard for remote store data and metadata files for higher throughput OpenSearch#12567). This will be re-evaluated as a follow-up

For reference, this is the blob path being used:

<BASE_PATH>/<Index UUID>"_vectors"/<uuid>"_"<field name>"_"<segment name".knnvec/knndid

Related Issues

Relates #2465
Relates #2392
Relates #2391

Check List

New functionality includes testing.
~~- [ ] New functionality has been documented.~~
~~- [ ] API changes companion pull request created.~~
Commits are signed per the DCO using --signoff.
~~- [ ] Public documentation issue/PR created.~~

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

jed326 · 2025-02-21T05:57:27Z

@jmazanec15 @shatejas @Vikasht34 @navneet1v could you please help review? thanks!

navneet1v

@jed326 can you please add the flow of functions used by the Repository service for InputStream. Since there are some override functions its hard to follow which function is called when.

It will also be good if you can add a packge-info.java explaining the flow and usage of the InputStream in more details.

src/main/java/org/opensearch/knn/index/codec/nativeindex/NativeIndexBuildStrategyFactory.java

src/main/java/org/opensearch/knn/index/codec/nativeindex/remote/RemoteIndexBuildStrategy.java

src/main/java/org/opensearch/knn/index/codec/nativeindex/remote/DocIdInputStream.java

src/main/java/org/opensearch/knn/index/codec/nativeindex/remote/RemoteIndexBuildStrategy.java

navneet1v · 2025-02-21T09:00:26Z

src/main/java/org/opensearch/knn/index/codec/nativeindex/remote/VectorValuesInputStream.java

+        }
+
+        bytesRemaining--;
+        return currentBuffer.get() & 0xFF;


why we need a & here?

We technically don't (added a code comment explaining), however I think it's safer to keep it here. IntelliJ itself also gives you a warning if you don't do the conversion and all the base InputStream implementations do the same.

I also added some tests to verify that read() and read(byte[] b, int offset, int length) read the same data over a given stream.

navneet1v · 2025-02-21T09:11:33Z

@jed326 lets avoid using skip-change log tag and update changelogs

CHANGELOG.md

src/main/java/org/opensearch/knn/index/codec/nativeindex/NativeIndexBuildStrategyFactory.java

src/main/java/org/opensearch/knn/index/codec/nativeindex/remote/DocIdInputStream.java

src/main/java/org/opensearch/knn/index/codec/nativeindex/remote/RemoteIndexBuildStrategy.java

jmazanec15

overall - looks good to me. A few minor comments, but nothing blocking

src/main/java/org/opensearch/knn/index/codec/nativeindex/remote/DocIdInputStream.java

src/main/java/org/opensearch/knn/index/codec/nativeindex/remote/RemoteIndexBuildStrategy.java

jmazanec15 · 2025-02-25T17:19:57Z

src/main/java/org/opensearch/knn/index/codec/nativeindex/remote/RemoteIndexBuildStrategy.java

        StopWatch stopWatch;
        long time_in_millis;
        try {
            stopWatch = new StopWatch().start();
+            // We create a new time based UUID per file in order to avoid conflicts across shards. It is also very difficult to get the


What makes this time based?

This is the implementation: https://github.com/opensearch-project/OpenSearch/blob/7e2d2437a14580e985440c140f06680dc4e3cd81/libs/common/src/main/java/org/opensearch/common/UUIDs.java#L47-L51

src/main/java/org/opensearch/knn/index/codec/nativeindex/remote/RemoteIndexBuildStrategy.java

src/testFixtures/java/org/opensearch/knn/KNNRestTestCase.java

src/main/java/org/opensearch/knn/index/KNNSettings.java

shatejas

Looks good overall. A few things related to maintenance and UTs

src/main/java/org/opensearch/knn/index/codec/nativeindex/remote/RemoteIndexBuildStrategy.java

...st/java/org/opensearch/knn/index/codec/nativeindex/remote/RemoteIndexBuildStrategyTests.java

shatejas · 2025-02-25T23:13:59Z

...st/java/org/opensearch/knn/index/codec/nativeindex/remote/RemoteIndexBuildStrategyTests.java

+
+        assertTrue(asyncUpload.get());
+        assertTrue(upload.get());
+        verify(mockBlobStore).blobContainer(any());


Since you already have a testBasePath, can you replace any() with it?

Unfortunately BlobPath path = getRepository().basePath().add(indexSettings.getUUID() + VECTORS_PATH); creates a new BlobPath instances instead of modifying the existing one, so I would need to mock BlobPath itself as well as set up all the mocks for IndexSettings. Since we're using our own custom blobContainer implementation here I don't think it's necessary for this test.

src/main/java/org/opensearch/knn/index/codec/nativeindex/remote/VectorValuesInputStream.java

src/main/java/org/opensearch/knn/index/codec/nativeindex/remote/RemoteIndexBuildStrategy.java

navneet1v

Overall code looks good to me. Please fix @shatejas comments specially related to creating repo/dataacess class .

Signed-off-by: Jay Deng <[email protected]>

jed326 · 2025-02-26T04:57:59Z

Thanks @navneet1v @shatejas @jmazanec15, I've added a new VectorRepositoryAccessor interface with a default implementation of DefaultVectorRepositoryAccessor to handle all of the KNNVectorValues -> InputStream conversions. Please take another look whenever you get a chance, thanks!

src/main/java/org/opensearch/knn/index/codec/nativeindex/NativeIndexBuildStrategyFactory.java

.../java/org/opensearch/knn/index/codec/nativeindex/remote/DefaultVectorRepositoryAccessor.java

Vikasht34 · 2025-02-26T17:48:25Z

.../java/org/opensearch/knn/index/codec/nativeindex/remote/DefaultVectorRepositoryAccessor.java

+                .expectedChecksum(null)
+                .build();
+
+            AtomicReference<Exception> exception = new AtomicReference<>();


Since we are dealing with call backs and async flow , Better option to use is CompletableFuture than AtomicReference

I looked into this some more, I think the existing implementation is still preferable to CompletableFuture. The reason is asyncBlobUpload does not throw an exception on upload failures, instead it propagates it to whatever ActionListener is passed into it.

If we want to do this with CompletableFuture without an AtomicReference then we would need to throw the exception within the ActionListener, which is not correct behavior as the listener is not intended to interrupt execution flows.

This is how it should work.

CompletableFuture<Void> vectorUploadFuture; if (blobContainer instanceof AsyncMultiStreamBlobContainer) { log.info("Using parallel upload for repository: {}", repository); vectorUploadFuture = CompletableFuture.runAsync(() -> { try { uploadVectorsAsync(blobContainer, blobName, vectorBlobLength, vectorDataType, knnVectorValuesSupplier); } catch (IOException e) { throw new RuntimeException("Vector upload failed", e); } }); } else { log.info("Using sequential upload for repository: {}", repository); vectorUploadFuture = CompletableFuture.runAsync(() -> { try { uploadVectorsSequentially(blobContainer, blobName, vectorBlobLength, vectorDataType, knnVectorValues); } catch (IOException e) { throw new RuntimeException("Sequential vector upload failed", e); } }); } CompletableFuture<Void> docIdUploadFuture = CompletableFuture.runAsync(() -> { try { writeDocIds(knnVectorValues, totalLiveDocs, blobName, blobContainer); } catch (IOException e) { throw new RuntimeException("Doc ID upload failed", e); } }); // Wait for both tasks to complete CompletableFuture.allOf(vectorUploadFuture, docIdUploadFuture).join();

.../java/org/opensearch/knn/index/codec/nativeindex/remote/DefaultVectorRepositoryAccessor.java

src/main/java/org/opensearch/knn/index/codec/nativeindex/remote/DocIdInputStream.java

Vikasht34 · 2025-02-26T18:29:40Z

src/main/java/org/opensearch/knn/index/codec/nativeindex/remote/DocIdInputStream.java

+
+    @Override
+    public int read(byte[] b, int off, int len) throws IOException {
+        if (currentBuffer == null) {


nit :- Can we have somthing boolean instread of making CurrentBuffer null , like end of stream.

I'm not against using a boolean, but I think it's important to make currentBuffer null to enforce we do not read extra bytes out of the buffer. Then an NPE here would mean a fatal/unrecoverable error in the remote upload process so we want to fail and use the fallbackStrategy. Without setting currentBuffer to null I think we're at risk of reading extra bytes.

src/main/java/org/opensearch/knn/index/codec/nativeindex/remote/VectorValuesInputStream.java

shatejas

Looks good no major concerns as such

Vikasht34

LGTM

jed326 requested review from heemin32, navneet1v, VijayanB, vamshin, jmazanec15, naveentatikonda, junqiu-lei, martin-gaievski, ryanbogan, luyuncheng, shatejas, 0ctopus13prime and Vikasht34 as code owners February 21, 2025 05:56

jed326 added skip-changelog v3.0.0 labels Feb 21, 2025

navneet1v reviewed Feb 21, 2025

View reviewed changes

jed326 mentioned this pull request Feb 18, 2025

[Meta] Remote Vector Index Build Component in OpenSearch Vector Engine #2391

Open

18 tasks

jed326 removed the skip-changelog label Feb 21, 2025

jed326 force-pushed the remote-vector-dev branch 2 times, most recently from 1e53a4f to 7511f21 Compare February 22, 2025 00:50

jmazanec15 reviewed Feb 24, 2025

View reviewed changes

jed326 force-pushed the remote-vector-dev branch 2 times, most recently from b3c4c85 to af72c3b Compare February 24, 2025 19:04

jed326 changed the title ~~Add vector data upload implementation to RemoteIndexBuildStrategy~~ [Remote Vector Index Build] Add vector data upload implementation to RemoteIndexBuildStrategy Feb 24, 2025

jed326 mentioned this pull request Feb 24, 2025

[Remote Vector Index Build] Add download + indexOuput#write implementation to RemoteIndexBuildStrategy #2554

Merged

2 tasks

jed326 force-pushed the remote-vector-dev branch 2 times, most recently from a8063b5 to 8f996e3 Compare February 25, 2025 04:18

jmazanec15 reviewed Feb 25, 2025

View reviewed changes

jed326 force-pushed the remote-vector-dev branch from 8f996e3 to 4b9e282 Compare February 25, 2025 17:55

navneet1v reviewed Feb 25, 2025

View reviewed changes

src/main/java/org/opensearch/knn/index/KNNSettings.java Outdated Show resolved Hide resolved

jed326 force-pushed the remote-vector-dev branch from 4b9e282 to 97ff4b7 Compare February 25, 2025 20:00

jmazanec15 previously approved these changes Feb 25, 2025

View reviewed changes

shatejas reviewed Feb 26, 2025

View reviewed changes

navneet1v reviewed Feb 26, 2025

View reviewed changes

src/main/java/org/opensearch/knn/index/codec/nativeindex/remote/RemoteIndexBuildStrategy.java Outdated Show resolved Hide resolved

navneet1v reviewed Feb 26, 2025

View reviewed changes

jed326 dismissed jmazanec15’s stale review via 583dc6f February 26, 2025 03:35

jed326 force-pushed the remote-vector-dev branch from 97ff4b7 to 583dc6f Compare February 26, 2025 03:35

Add vector data upload implementation to RemoteIndexBuildStrategy

8d4c0af

Signed-off-by: Jay Deng <[email protected]>

jed326 force-pushed the remote-vector-dev branch from 583dc6f to 8d4c0af Compare February 26, 2025 03:40

Vikasht34 reviewed Feb 26, 2025

View reviewed changes

shatejas approved these changes Feb 26, 2025

View reviewed changes

Vikasht34 approved these changes Feb 26, 2025

View reviewed changes

jmazanec15 approved these changes Feb 26, 2025

View reviewed changes

jmazanec15 merged commit 5873add into opensearch-project:main Feb 26, 2025
35 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Remote Vector Index Build] Add vector data upload implementation to RemoteIndexBuildStrategy #2550

[Remote Vector Index Build] Add vector data upload implementation to RemoteIndexBuildStrategy #2550

jed326 commented Feb 21, 2025 •

edited

Loading

jed326 commented Feb 21, 2025

navneet1v left a comment

navneet1v Feb 21, 2025

jed326 Feb 21, 2025 •

edited

Loading

navneet1v commented Feb 21, 2025

jmazanec15 left a comment

jmazanec15 Feb 25, 2025

jed326 Feb 25, 2025

shatejas left a comment

shatejas Feb 25, 2025

jed326 Feb 26, 2025

navneet1v left a comment

jed326 commented Feb 26, 2025

Vikasht34 Feb 26, 2025

jed326 Feb 26, 2025

Vikasht34 Feb 26, 2025

Vikasht34 Feb 26, 2025

jed326 Feb 26, 2025

shatejas left a comment

Vikasht34 left a comment

[Remote Vector Index Build] Add vector data upload implementation to RemoteIndexBuildStrategy #2550

[Remote Vector Index Build] Add vector data upload implementation to RemoteIndexBuildStrategy #2550

Conversation

jed326 commented Feb 21, 2025 • edited Loading

Description

Vector Upload Details

Related Issues

Check List

jed326 commented Feb 21, 2025

navneet1v left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jed326 Feb 21, 2025 • edited Loading

Choose a reason for hiding this comment

navneet1v commented Feb 21, 2025

jmazanec15 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

shatejas left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

navneet1v left a comment

Choose a reason for hiding this comment

jed326 commented Feb 26, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

shatejas left a comment

Choose a reason for hiding this comment

Vikasht34 left a comment

Choose a reason for hiding this comment

jed326 commented Feb 21, 2025 •

edited

Loading

jed326 Feb 21, 2025 •

edited

Loading