Add auto-generation of index mappings & settings based on processors #552

ohltyler · 2024-12-27T22:47:34Z

Description

This PR adds default index mappings and index settings as ML processors are added / removed from ingest pipelines in real-time. The main idea is to add/remove any knn_vector field mappings, and update index.knn to true/false as users add or remove ML-related ingest processors, that contain any known embedding models. These 2 settings are critical and required for proper knn search to happen. Currently, the backend does not show any errors if these aren't configured correctly, and leads to user confusion when later seeing that there are no embeddings added. By adding this, this helps minimize users missing these configurations if knn search is desired.

Testing:

tested all different output map transformation types on the UI, and how adding / removing one/multiple ML processors, including the individual output maps, ensure the behavior is as expected
ensured existing presets still work as expected

Demo video, showing all of the different output map transform types, and how adding/removing processors updates the fields in real-time.

screen-capture.14.webm

Issues Resolved

Resolves #547

Check List

Commits are signed per the DCO using --signoff

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

Signed-off-by: Tyler Ohlsen <[email protected]>

…552) Signed-off-by: Tyler Ohlsen <[email protected]> (cherry picked from commit 08b97d3) Signed-off-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

…552) (#553) (cherry picked from commit 08b97d3) Signed-off-by: Tyler Ohlsen <[email protected]> Signed-off-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

ohltyler added 3 commits December 27, 2024 09:33

Refactor dimension fetching to standalone fn

d0986f4

Signed-off-by: Tyler Ohlsen <[email protected]>

Update index settings based on ingest processors

29e3a3c

Signed-off-by: Tyler Ohlsen <[email protected]>

Auto-update mappings

56b61c2

Signed-off-by: Tyler Ohlsen <[email protected]>

ohltyler added enhancement New feature or request backport 2.x workflow editor labels Dec 27, 2024

ohltyler requested review from dbwiddis, owaiskazi19, joshpalis, amitgalitz, jackiehanyang, minalsha and saimedhi as code owners December 27, 2024 22:47

cleanup

9a24f70

Signed-off-by: Tyler Ohlsen <[email protected]>

joshpalis approved these changes Dec 28, 2024

View reviewed changes

ohltyler merged commit 08b97d3 into opensearch-project:main Dec 28, 2024
6 of 7 checks passed

ohltyler deleted the index-auto-config branch December 28, 2024 02:44

opensearch-trigger-bot bot mentioned this pull request Dec 28, 2024

[Backport 2.x] Add auto-generation of index mappings & settings based on processors #553

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add auto-generation of index mappings & settings based on processors #552

Add auto-generation of index mappings & settings based on processors #552

ohltyler commented Dec 27, 2024 •

edited

Loading

Add auto-generation of index mappings & settings based on processors #552

Add auto-generation of index mappings & settings based on processors #552

Conversation

ohltyler commented Dec 27, 2024 • edited Loading

Description

Issues Resolved

Check List

ohltyler commented Dec 27, 2024 •

edited

Loading