Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Reconsider use of Figshare for the sample data #67

Open
tbooth opened this issue Jul 31, 2024 · 2 comments
Open

Reconsider use of Figshare for the sample data #67

tbooth opened this issue Jul 31, 2024 · 2 comments
Labels
reviewer Issues arising from comments on https://github.com/carpentries-lab/reviews/issues/17

Comments

@tbooth
Copy link
Collaborator

tbooth commented Jul 31, 2024

From @cmeesters:

"See this link for details about this dataset and the redistribution licence." contains the link
https://figshare.com/articles/dataset/data-for-snakemake-novice-bioinformatics_tar_xz/19733338/1.
It leads to a description on summary level, but also a "sorry, we can't preview this file" - which is slightly irritating.
Is figshare a good place for non-figure data?

@tbooth tbooth added the reviewer Issues arising from comments on https://github.com/carpentries-lab/reviews/issues/17 label Jul 31, 2024
@tbooth
Copy link
Collaborator Author

tbooth commented Jul 31, 2024

I'd agree that Figshare is problematic. I was copying Data Carpentry:
https://datacarpentry.org/image-processing/instructor/index.html#data

But I think we can do a bit better. I've also noticed that if you download the file from Figshare too many times it puts a temporary block on downloads, which could be a real problem.

I'll see about hosting the data on WorkflowHub.eu or somewhere like that.

@tbooth
Copy link
Collaborator Author

tbooth commented Jul 31, 2024

I could put a copy of the file here on GitHub, but GitHub does not allow .tar.xz files, and the .tar.gz version is just a tad too big. I guess I could shorten the FASTQ headers to shave off a bit of space, or do something horrible like this:

data-for-snakemake-novice-bioinformatics.tar.xz.gz

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
reviewer Issues arising from comments on https://github.com/carpentries-lab/reviews/issues/17
Projects
None yet
Development

No branches or pull requests

1 participant