You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
currently when a user uses a public pymovements dataset it is assumed that they will do the proper research within our method to attribute/cite the dataset correctly.
a disclaimer should be integated
Description of a solution
integrate a general disclaimer about dataset attribution if the dataset is used in their project.
add citation key(s) from pymovements bibliography to each dataset
add a dataset description property (similar to the docstring)
Minimum acceptance criteria
disclaimer automatically appears when downloading a dataset
>>> import pymovements as pm
>>> pm.Dataset('BSC', 'data').download()
You are downloading the BSC dataset. Please be aware that pymovements does not
host or distribute any dataset resources and only provides a convenient interface to
download the public dataset resources that were published by their respective authors.
property showing general dataset information
>>> import pymovements as pm
>>> pm.Dataset('BSC', 'data').description
Some general description of the BSC dataset.
add citation_keys property
>>> import pymovements as pm
>>> pm.Dataset('BSC', 'data').citation_keys
BSC
add test checking that citation keys are not the empty string for public datasets
add citation (key) o disclaimer
>>> import pymovements as pm
>>> pm.Dataset('BSC', 'data').download()
You are downloading the BSC dataset. Please be aware that pymovements does not
host or distribute any dataset resources and only provides a convenient interface to
download the public dataset resources that were published by their respective authors.
You can lookup the citation in our bibliography using ...
Sample Code
to implement the disclaimer add to dataset.py
defdisclaimer(self) ->str:
_disclaimer_text=f"""\You are downloading the {self.name} dataset. Please be aware that pymovements does nothost or distribute any dataset resources and only provides a convenient interface todownload the public dataset resources that were published by their respective authors."""return_disclaimer_text
add to download:
self.disclaimer()
add to dataset_definition.py
citation_keys: Sequence[str] =''
this can also be a list or tuple in case one dataset spans multiple citations without changing the type annotation
For the disclaimer we need the fields name and citation (which should be in a human-readable citation format).
Also I wouldn't add anything to Dataset except for Dataset.download(disclaimer: bool = True).
There's not much use in explicitly calling Dataset.disclaimer(), because the download either has not happened, or already has finished, so then the disclaimer doesn't make sense anymore.
Instead, I would like to keep all logic in dataset/dataset_download.py.
This issue depends on #1057 (or at least the citation field), so don't worry about this issue here for now, and finish #1057 (or create a new PR just for the citation field).
Honestly I'm perfectly fine with having to explicitly opt-out (most users won't even notice that this is possible) but I also wouldn't mind much with not having the option.
Uh oh!
There was an error while loading. Please reload this page.
Description of the problem
currently when a user uses a public pymovements dataset it is assumed that they will do the proper research within our method to attribute/cite the dataset correctly.
a disclaimer should be integated
Description of a solution
Minimum acceptance criteria
Sample Code
to implement the disclaimer add to
dataset.py
add to download:
add to
dataset_definition.py
this can also be a
list
ortuple
in case one dataset spans multiple citations without changing the type annotationadd an
assert
totests/unit/datasets/datasets_test.py
checking thatcitation_keys
are not the empty string:extend disclaimer to include citation
@dkrako I self-assign, please edit this issue for anything you want differently, then you can delete this tagging and I'll implement it asap.
The text was updated successfully, but these errors were encountered: