-
Notifications
You must be signed in to change notification settings - Fork 17
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[FEATURE] sltr queries with minimum_should_match features #20
Comments
We need to better understand how the @jhinch-at-atlassian-com -- do you have any ideas of how @noCharger -- Can you look into this? Would be a good place to get started on understanding the plugin. Thanks! |
The best place to start looking is from RankerQuery.RankerWeight#scorer and RankerQuery.DisjunctionDISI#advance. You would need to compare this to how the equivalent functionality in |
@jhinch-at-atlassian-com I like this plan and the approach we're taking to support |
|
Is your feature request related to a problem?
Non-linear scoring functions, particularly gradient boost decisions trees can be used as a technique used to deal with combining scores together for features which have different magnitudes and score distributions. However, currently
sltr
queries functions similar tobool
query with aminimum_should_match
of0
with a custom scoring function meaning it cannot be used conveniently within the initial query and currently is encouraged to only be used in rescore blocks.For example given the following featureset definition:
and a model
example_model
which was created using the above featureset, the followingsltr
query:Can be thought conceptually as:
What solution would you like?
It would be great if the features used by the model could have a requirement of a minimum which should match so that the
sltr
:which would translates to roughly the following:
What alternatives have you considered?
Its possible to work around this by having a surrounding
bool
query and duplicate the features as filters in thatbool
query:However this has the problem that it executes the query blocks twice and it requires duplicating the definitions and ensuring the featureset and query remain in sync.
Do you have any additional context?
This is the equivalent feature request as o19s/elasticsearch-learning-to-rank#476 but to the OpenSearch fork.
The text was updated successfully, but these errors were encountered: