Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add auto-generation of index mappings & settings based on processors #552

Merged
merged 4 commits into from
Dec 28, 2024

Conversation

ohltyler
Copy link
Member

@ohltyler ohltyler commented Dec 27, 2024

Description

This PR adds default index mappings and index settings as ML processors are added / removed from ingest pipelines in real-time. The main idea is to add/remove any knn_vector field mappings, and update index.knn to true/false as users add or remove ML-related ingest processors, that contain any known embedding models. These 2 settings are critical and required for proper knn search to happen. Currently, the backend does not show any errors if these aren't configured correctly, and leads to user confusion when later seeing that there are no embeddings added. By adding this, this helps minimize users missing these configurations if knn search is desired.

Testing:

  • tested all different output map transformation types on the UI, and how adding / removing one/multiple ML processors, including the individual output maps, ensure the behavior is as expected
  • ensured existing presets still work as expected

Demo video, showing all of the different output map transform types, and how adding/removing processors updates the fields in real-time.

screen-capture.14.webm

Issues Resolved

Resolves #547

Check List

  • Commits are signed per the DCO using --signoff

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

Signed-off-by: Tyler Ohlsen <[email protected]>
@ohltyler ohltyler merged commit 08b97d3 into opensearch-project:main Dec 28, 2024
6 of 7 checks passed
@ohltyler ohltyler deleted the index-auto-config branch December 28, 2024 02:44
opensearch-trigger-bot bot pushed a commit that referenced this pull request Dec 28, 2024
…552)

Signed-off-by: Tyler Ohlsen <[email protected]>
(cherry picked from commit 08b97d3)
Signed-off-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
ohltyler pushed a commit that referenced this pull request Dec 28, 2024
…552) (#553)

(cherry picked from commit 08b97d3)

Signed-off-by: Tyler Ohlsen <[email protected]>
Signed-off-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[ENHANCEMENT] Improve index settings & mappings configurations
2 participants