-
Notifications
You must be signed in to change notification settings - Fork 493
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Indexing: N/A values should not be indexed #1883
Comments
We should not be indexing these N/A values. |
Notice with production data we also see several N/As. |
@scolapasta please review a195b17 (pushed to a branch) and let me know if it's ok to merge in. The question I have is for the dataset level. In that commit the logic is to never index an "N/A" value for any dataset field that has a controlled vocabulary that includes "N/A" (not just the "subject" field). |
I chatted with @scolapasta and it sounds like we want to avoid indexing "N/A" for as many fields as possible, including fields that don't have controlled vocabularies (which is most fields). Since the logic I'll need to put in the code is spread on various lines, I just did a query on our migrated data to make sure I adjust indexing for as many fields as possible. The main ones to adjust will be for authors and descriptions:
|
@scolapasta please review the new-and-improved version at 269081b and let me know if I should merge it to master. |
prevent N/A values from being indexed #1883
@scolapasta merged #1897 but we decided to change Ready for QA. |
@scolapasta @kcondon I cant test this in build since we need to see how migrated local studies got indexed. So can you please tell me when this fix is in the new production site (harvard.dataverse.edu) or vm6? |
Searched for N/A in production and was unable to find it indexed. Also looked at facets to confirm this was the case. Closing this ticket. |
"I noticed that in #1246 that we don't want to display N/A values in UI, but as of this writing it's easy to create a dataset like this via SWORD by simply not including any dcterms:subject elements in the XML"
I first made that comment a month ago at #1430 (comment) but I'm opening this ticket (with a screenshot) to make sure we're ok with showing N/A in the Subject facet:
This happens all the time on https://apitest.dataverse.org right now where I'm not specifying a subject in the XML I use to create a dataset via SWORD.
The text was updated successfully, but these errors were encountered: