-
Notifications
You must be signed in to change notification settings - Fork 509
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[DOC] Term Query documentation does not warn of performance impact of case_insensitive searches #9028
Comments
We may also want to treat this as a bug in OpenSearch. Specifically, we should set a value for Elsewhere -- like in IMO, we should:
That way, folks using 2.19 can at least safeguard themselves by explicitly setting |
I'm going to turn my previous message into an issue for https://github.com/opensearch-project/OpenSearch |
@dbwiddis: Assigned this to you. |
@msfroh do you think we'll address your bug before 2.19.0? That may change how I write this. |
I'll try to get a fix in for next week. I'd really like to have the opt-in safety in 2.19 before making it opt-out in 3.0. |
I'll track that change and do the doc PR for it here! |
What do you want to do?
Tell us about your request. Provide a summary of the request.
The documentation for term query includes the
case_insensitive
parameter.Due to the implementation details of this type of search, every (alphabetic) character in such a query doubles the complexity of the search, consuming a lot of heap memory and potentially crashing nodes due to high CPU with GC thrashing. Even a relatively short search term (about 16 characters) could result in nearly 8 GB of heap.
This potential impact should be highlighted in the documentation as a warning. Additionally there are preferred strategies for doing case insensitive searches that should be presented as an alternative in the docs.
I'm happy to write such docs but would hope to get some assistance from @msfroh in validating them technically.
Version: List the OpenSearch version to which this issue applies, e.g. 2.14, 2.12--2.14, or all.
all
The text was updated successfully, but these errors were encountered: