Query encoding of sparse retrieval models directly in Java via ONNX #1772

lintool · 2022-02-20T17:18:50Z

lintool
Feb 20, 2022
Maintainer

ONNX has been around for a while, but last time I looked at it, it wasn't very mature. Perhaps this has changed:
https://www.reddit.com/r/LanguageTechnology/comments/pcygyp/run_transformers_models_directly_in_javascript/

For sparse retrieval models, we need to perform inference on the queries at search time. Currently, we sidestep this by using pre-encoded queries (generated from pyserini). With something like the above, perhaps it is now possible to do query encoding directly in Java.

Worth taking another look?

tteofili · 2022-02-22T13:38:09Z

tteofili
Feb 22, 2022
Collaborator

I don't have first hand experience with ONNX, I've just had a look at how Apache OpenNLP did the same for using NER / DocCat models from Huggingface; it seems something specific for sparse retrieval can be done in a similar fashion.

0 replies

lintool · 2022-04-03T15:02:06Z

lintool
Apr 3, 2022
Maintainer Author

Another option to consider? https://github.com/deepjavalibrary/djl

0 replies

lintool · 2023-12-27T17:24:31Z

lintool
Dec 27, 2023
Maintainer Author

We've done this with ONNX! Closing.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Query encoding of sparse retrieval models directly in Java via ONNX #1772

{{title}}

Replies: 3 comments

{{title}}

{{title}}

{{title}}

Select a reply

Query encoding of sparse retrieval models directly in Java via ONNX #1772

lintool Feb 20, 2022 Maintainer

Replies: 3 comments

tteofili Feb 22, 2022 Collaborator

lintool Apr 3, 2022 Maintainer Author

lintool Dec 27, 2023 Maintainer Author

lintool
Feb 20, 2022
Maintainer

tteofili
Feb 22, 2022
Collaborator

lintool
Apr 3, 2022
Maintainer Author

lintool
Dec 27, 2023
Maintainer Author