-
Notifications
You must be signed in to change notification settings - Fork 71
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
clp-s: Several issues searching for logs that contain escaped characters. #590
Comments
Reproduced with permission from zulip is a longer example:
Double quotes (4000): search on the log field doesn't work, even when escaping or double-escaping the quotes. Can workaround by using wildcards. |
The draft PR fixes most of these issues, except for 4002 and 4003. Those two cases seem to run into another issue in Grep::process_raw_query where seemingly correct query strings generate no relevant subqueries. Interestingly this appears to be related to the last quote -- e.g. the query The issue with 4001 is actually just a bash issue -- it turns out that bash does not provide any mechanism to escape single quotes ( |
Bug
Search does not work as expected when searching against JSON values that contain escaped characters. This is likely an issue with how string predicates are un-escaped both for clp style search, and for wildcard matching.
Importantly, clp-s makes the decision to not un-escape raw JSON values before ingesting them, which causes some edge cases we are not currently considering during search.
For example, the value in
{"key": "a: \"bcde\""}
gets ingested verbatim asa: \"bcde\"
. However, the search*: "a: \"bcde\""
fails to return the matching result.CLP version
0.1.2
Environment
clp-json package.
Reproduction steps
Ingest
{"key": "a: \"bcde\""}
Perform the query
*: "a: \"bcde\""
The text was updated successfully, but these errors were encountered: