You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am exploring using questioning_authority with the Getty AAT vocabulary, for a use case involving auto-complete on a staff metadata editing form.
I am finding that the Getty AAT vocabularly is not performing the way I expect it to to be suitable for that use case for us.
There is another long-open Getty related ticket at #84 , but it's not clear what the actual goals/problems of that ticket are, so I'm filing another one. But that ticket has useful links to Getty documentation and relevant listserv, and suggestions from a Getty developer of possible implementation improvements that may be relevant here.
For my uses:
I want search to be "left-anchored"/truncated, search for parch should find terms involving parchment. But it doesn't, only whole-word matches are currently returned.
The search right now is strangely case-sensitive in ways I don't expect. For instance, in fact entering parchment also returns no hits, but entering Parchment (capital-P) returns 4 (although they all come back in lowercase, only a search with initial capital-P can find them!)
Those are the main barriers, but as I investigate the code/implementation, other issues where it's not doing what I want but I'm not certain what other people would want/expect include:
Should the results be ranked by lucene relevance/best match or alphabetical? Currently they are alphabetical, I think maybe I prefer relevane (I think FAST searches in qa are relevance).
Should the number of results be capped/limited? Probably only feasible if result sort is "relevance", otherwise it doesn't necessarily make sense to limit it after so many ordered alphabetically. But unlimited, you can in some cases get so many results that it's difficult to do in a dropdown menu UX. (FAST search is, I beleive, limited results).
What should happen with multi-word input? Currently it looks like the code tries to do an "AND" search for all terms, but I'm not sure if this is desired or consistent with other qa vocabularies. Should it be a phrase search instead, with only the last term being "right-anchored" partial-word-match allowed, for an auto-complete use case?
Should all Getty lookups behave the same, or is there a reason for them to differ? Currently the three Getty vocabularies supported all use slightly different queries, I'm not sure if this is intentional/useful, or if they should all be made consistent. See [1][2][3]
The hard part is knowing what the desired outcome is. Does my sense of what it "should" do match what other people think? What parts of current implementation woudl be considered bugs vs working as intended, even if as intended is not what I personally want/need for my use case?
I could maybe find some time to PR improvements here, perhaps as part of a maintenance hours pledge. If it was clear what the consensus was on "correct behavior" or what the community expects.
It might be helpful to understand the most commonly used vocabularies in qa, to assume that they are not buggy but working as designed, and then try to make Getty AAT and/or other Getty work similarly to the popular ones? I am not sure if anyone is currently using Getty in production in a way that it works for their use cases, or if it's currently just buggy and un-used.
The text was updated successfully, but these errors were encountered:
Are any of you currently using Getty vocabs in qa in deployed apps?
It would be helpful to have feedback from anyone currently using Getty vocabs via qa on whether AAT and other Getty vocabs are currently working as anyone expects/desires, or are currently broken/un-used; and then what behavior would be desired/acceptable, whether to make it usable or as improvements.
I am exploring using questioning_authority with the Getty AAT vocabulary, for a use case involving auto-complete on a staff metadata editing form.
I am finding that the Getty AAT vocabularly is not performing the way I expect it to to be suitable for that use case for us.
There is another long-open Getty related ticket at #84 , but it's not clear what the actual goals/problems of that ticket are, so I'm filing another one. But that ticket has useful links to Getty documentation and relevant listserv, and suggestions from a Getty developer of possible implementation improvements that may be relevant here.
For my uses:
parch
should find terms involvingparchment
. But it doesn't, only whole-word matches are currently returned.parchment
also returns no hits, but enteringParchment
(capital-P) returns 4 (although they all come back in lowercase, only a search with initial capital-P can find them!)Those are the main barriers, but as I investigate the code/implementation, other issues where it's not doing what I want but I'm not certain what other people would want/expect include:
The hard part is knowing what the desired outcome is. Does my sense of what it "should" do match what other people think? What parts of current implementation woudl be considered bugs vs working as intended, even if as intended is not what I personally want/need for my use case?
I could maybe find some time to PR improvements here, perhaps as part of a maintenance hours pledge. If it was clear what the consensus was on "correct behavior" or what the community expects.
It might be helpful to understand the most commonly used vocabularies in qa, to assume that they are not buggy but working as designed, and then try to make Getty AAT and/or other Getty work similarly to the popular ones? I am not sure if anyone is currently using Getty in production in a way that it works for their use cases, or if it's currently just buggy and un-used.
The text was updated successfully, but these errors were encountered: