Make tantivy queries a bit lighter #171

pudo · 2024-08-27T09:19:01Z

This reduces the clauses in the tantivy query down a bit, hoping that lighter queries will equal a faster xref.

WIP

jbothma · 2024-08-27T09:51:44Z

nomenklatura/index/tantivy_index.py

+                for word in clean.split(WS):
+                    fields[type.name].add(word)


do we need to index each word individually? I thought it's sufficient to index the full name as long as we query for each part

tantivy definitely has its own tokenizer, I just thought if we're going to split it up for phonetic anyway, why not use those tokens :) but yeah we can obviously rely on the real indexer instead.

pudo added 2 commits August 26, 2024 17:28

experiment with less complex queries against the tantivy index

4501f5e

remove imports

159ed1f

jbothma reviewed Aug 27, 2024

View reviewed changes

simplify a bit more

2916dd9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make tantivy queries a bit lighter #171

Make tantivy queries a bit lighter #171

pudo commented Aug 27, 2024

jbothma Aug 27, 2024

pudo Aug 27, 2024

Make tantivy queries a bit lighter #171

Are you sure you want to change the base?

Make tantivy queries a bit lighter #171

Conversation

pudo commented Aug 27, 2024

jbothma Aug 27, 2024

Choose a reason for hiding this comment

pudo Aug 27, 2024

Choose a reason for hiding this comment