-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Create metadata blocks for CAFE's collection of climate and geospatial data #232
Comments
I'd also like to use this GitHub issue to record the concerns/risks with this effort, similar to how we noted in other GitHub issues that metadata fields in metadata blocks created for other collections in HDV serve purposes that overlap with fields that are already available, such as fields in the Citation metadata block, and that facilitate describing data that others in the community have expressed interest in, like the metadata block for 3D Data discussed in #144. Metadata added in custom metadata block won't be in most metadata exports Showing or hiding fields based on what's entered in other fields so that depositors see only relevant fields Those first two fields are dropdown menus where the options are "Yes" and "No". So if a depositor chooses "No" for the "Geospatial File Type" field, depositors shouldn't enter metadata in the other fields that describe a geospatial file, since there isn't a geospatial file. Since Dataverse will always show all of the fields, the CAFE folks plan to address this with instructions in a dataset template and/or training. Letting depositors type in and enter a term in a field that uses a vocabulary The external controlled vocabulary mechanism handles this in a more common and arguably better way by using a UI component that let's depositors choose a term from a vocabulary and also enter their own terms in the same field. But this mechanism works only for vocabularies hosted externally and not for vocabularies that are defined in metadata block TSV files. Custom metadata block about data location versus geospatial metadata block that ships with Dataverse Describing geospatial files in the dataset-level metadata Overlap among fields in the "Metadata Block About Data Sources" and fields in Citation metadata block I also mentioned that once Dataverse can send metadata about related resources to DataCite (IQSS/dataverse#5277), we'll need to think about if and how to include the related datasets described in their custom metadata block. Automatic layout of child fields might make it hard for depositors to fill fields the way we expect We've seen and talked about how this design also confuses depositors who use other compound fields like the Related Publication fields in the Citation metadata block. There's related discussion in IQSS/dataverse#5277. Metadata in "Metadata Block About Data Sources" is hard to read when viewing metadata on dataset page |
2023/11/13
|
@jggautier Just to confirm - am I installing both customCAFEDataLocation.tsv and customCAFEDataSources.tsv in prod.? |
Ah yes. That CAFE collection's managers would like both of those metadata blocks available for the collection. I'll update this issue's title. |
I looked into this briefly, and I'm wondering if it would be better for new blocks to go through more of a qa process, like we do with everything else, before deploying them in prod. My biggest questions/concern were with the GeoSpatialResolution fields in these blocks, since we just had to spend so much effort addressing issues with similar fields in the Geospatial block. There may be some similar issues with validation for the values in this block. Namely, the values are defined as floats. So it is impossible to enter anything that does not parse as a decimal fractional. This would be the right behavior when "Decimal degrees" is selected in the "Unit" pulldown. But you can also select “Degrees-minutes-seconds” in the same pulldown - but it is then impossible to enter such a value: (there are other notations for formatting "degrees-minutes-seconds" values of course, but none of them will parse as a valid decimal fractional). I feel like if we want this field to support all the notations listed, the only way to achieve that would be to switch it back to text, and add custom validation methods, like we did with the Geospatial block fields. |
I was also told that there were some technical issues with bringing up a test instance for the researchers involved to experiment with. But I feel like that part must be something we can figure out. |
@sbarbosadataverse, it was agreed to continue testing this and the other metadata block after they were added to Harvard Dataverse. But can we bring up this concern, about validation, with the collection's manager Keith, and ask if user testing can be done before these metadatablocks are added? |
We had a quick chat about this on slack. It sounded like I should clarify what I said above:
By "like we do with everything else" I didn't meant literally the same process as how we QA dev. issues - deploying it on dataverse-internal, have the same QA person test it, etc. etc. I meant more like the same idea of testing and confirming that everything works properly before trying it in prod. It sounds like this should be a somewhat different process for custom blocks - focused more on letting the researchers who requested the block do the testing and confirming that everything works the way they like. |
Basically, rather than using the production for testing these blocks, let's let the collection admin(s) experiment with them on a test instance. |
Thanks @landreev for applying the changes to the ec2 test instance! I see the changes and everything's working as expected. I'll let the collection admin know that:
Does that all sound good? I'll wait to hear back from you before I email the collection admin. |
That sounds perfect. |
Okay, they wrote that they're okay with it |
OK, I'll look into the 2 datasets with the populated fields next. Will report. |
For the record - nope, you cannot remove these CAFE*-blocks fields in the UI. |
Looking into it now. |
[edit: n/m!] @jggautier Could you please confirm that these are the versions of the blocks that are currently installed in production? (Jan. 3 commits) - https://github.com/IQSS/dataverse.harvard.edu/blob/f4a79a733a9d5b816080cf463914e8743a761fff/metadatablocks/customCAFEDataLocation.tsv Not super crucial; I just want to replicate the prod. setup on my own dev. box 1:1 to test the delete queries. |
successfully removed "Location"... |
OK, all the existing field values and fields have been erased, and both custom blocks uninstalled, like they were never there. |
(no, I didn't get to installing the blocks last night, but will do shortly) |
Thanks. Sorry I didn't get to help look at those Jan. 3 commits yesterday. Got caught up in other stuff. I checked those two published datasets. They're editable and I don't see any traces of the new blocks' fields in the forms. I do see the metadata in the JSON exports, like https://dataverse.harvard.edu/api/datasets/export?exporter=dataverse_json&persistentId=doi:10.7910/DVN/Y1WNU7. Just pointing that out just in case. |
Correct, I didn't bother re-exporting the 2 datasets. |
The 2 blocks have been installed (again). Please review/double-check that these are the correct versions. |
I just reviewed them and they're the correct versions. Thanks! |
@jggautier Can we close it, or do you want to keep it open until they enter all the metadata they need? |
Ah, yes we can close it. I'll do that now. Wasn't sure if there was anything else we needed to do for re-adding the metadata blocks. We can track the remaining tasks, mostly about those two published datasets you edited, in our email thread with the collection admin. |
This GitHub issue is being used to track progress of the creation of a metadata block or metadata blocks I'm helping design for a Dataverse collection that the BUSPH-HSPH Climate Change and Health Research Coordinating Center (CAFE) will be managing on Harvard Dataverse. Their unpublished collection is at https://dataverse.harvard.edu/dataverse/cafe.
In this repo at https://github.com/IQSS/dataverse.harvard.edu/tree/master/metadatablocks, I've added the .tsv and .properties files that define the metadata fields, and I'll continue updating those files as the CAFE folks review and improve the metadata fields.
This screenshot shows the metadata block we're planning to add, as of 2023-11-07, so that depositors can describe the geospatial data:
This screenshot shows the metadata block we're planning to add, as of 2023-11-07, so that depositors can describe the source datasets of the dataset being deposited:
The text was updated successfully, but these errors were encountered: