-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
BugSigDB 1.0 release #4
Comments
Note, functionality from our bugsigdbr package will likely be helpful here, including |
I added a new organization secret |
@lgeistlinger I'd like to better understand what the release looks like. Will it be 3 csv files: one each for studies, experiments, and signatures? Or will there be different versions in content or file formats? There's also a place where I can add metadata. I'll try to write it using the website and the repository, but maybe I need some help with the following: Are there any specific keywords should we associate with the data? This isn't required, but may help users find it.
|
Thanks for checking in @jwokaty.
That's the core, yes. This is also what I think what
will get a
Currently it directly pulls the bloody edge including unreviewed content from https://bugsigdb.org/Help:Export, which would then become available via:
When preparing the three csv files for the 1.0 release, it's important to restrict to reviewed contents only. Let me know whether you have questions or whether it's easier if I provide these files. In addition, I think we want to supplement the core release with GMT files containing the actual signatures.
@lwaldron let us know if you have thoughts on this.
A couple that come to mind:
@lwaldron might have opinions. I'd be good with Creative Commons Attribution 4.0 International. |
@lgeistlinger would you write a simple .R script that dumps all the required files? I guess it'll be a few: gmt, cab, ncbi ID, names, genus, species, mixed. No need to put versioning or dates in file names, but maybe a comment line in the first line with the date, license, and reference to bugsigdb.org? |
Sure. |
That is basically done and part of bugsigdbr now: https://github.com/waldronlab/bugsigdbr/blob/main/inst/scripts/dump_release.R Call the script via: which will produce the following output files:
Pending waldronlab/BugSigDB#92 a filter by review status will be incorporated. |
@lgeistlinger I am working on this at https://github.com/jwokaty/BugSigDBExports, which I will transfer to waldronlab when it's in a good state. I've created a GitHub action that we can run manually to generate exports with @lwaldron and I had discussed setting up the action to do a daily export that would be committed to BugSigDBExports. We would do a manual release to get into Zenodo. However, the version number that is passed to |
I think that is great.
This is correct. The only place where the
If you call the script with a slight notational abuse and provide a date instead of a version number to
and the argument can then also be renamed to |
It looks great! Nice use of the |
And BTW I think the Zenodo releases can just have a date within too, releases will have a version number from the tag but the files contain dates as usual. |
We are still waiting on waldronlab/BugSigDB#92 to close before we publish because this may change the content, correct? I've transferred the BugSigDBExports to waldronlab and have scheduled it to do weekly exports on Sunday using dates. The first will be this Sunday. Does this issue really belong to the new repository? I can transfer it. |
Yes, this issue does belong in https://github.com/waldronlab/BugSigDBExports. |
Sorry for being impatient and just transferring it, was only so I could refer to it from waldronlab/BugSigDB#92 (although I just learned that those references get automatically updated with the transfer!) |
I set up the Zenodo integration and did a little debugging trying to get it to work (https://zenodo.org/account/settings/github/repository/waldronlab/BugSigDBExports#) but am stuck now with the following error on Zenodo: {
"errors": "Something went wrong when we tried to publish your release. If your release has not been published within the next hour, please contact us via our support form to resolve this issue."
} Let's see if it fixes itself within the next hour, otherwise I'll contact the Zenodo support team. It's very particular about the .zenodo.json file. |
@jwokaty @lwaldron quick update: I incorporated a filter for complete content in the dump release script, so the only thing remaining for the 1.0 release to zenodo is to clean up the ontology columns (waldronlab/BugSigDB#92 (comment)). Will work on that so that we get that through the door prior to the October Bioc release. |
Hi @lgeistlinger, there's an issue with automatically releasing to Zenodo, so we should do a manual release of the files. (The last I heard from them, they were still working on it last week.) What is the date that we will do the release, Oct. 25? And do want to modify https://github.com/waldronlab/bugsigdbr to get data from Zenodo (along with the bleeding edge)? |
And yes, that is what I think we are aiming for. Being able to pull the zenodo release (stable) as well as the continously updated version (bleeding edge) from BugSigDBExports as we do it currently, if that makes sense. |
Closed by #12. I manually did the release on Zenodo at https://zenodo.org/record/5606166 since the automatic mechanism still isn't working. (I should have removed the README, but I you can't change the files after publishing.) |
Thanks @jwokaty. That is great. I think we might have jumped the gun here a little bit though as the first release is still waiting on the fix of the ontology columns in the export. This needs to be fixed by Ike first before we can go ahead and do our first official release. |
Thanks for clarifying. |
Hi @jwokaty @lwaldron : this is finally ready for release! we finished the ontology columns in the export and everything is looking good now for upload of the stable BugSigDB 1.0 release to zenodo. @jwokaty can you go ahead and perform the upload to zenodo? (Not sure whether this will involve overwriting your previous upload under https://zenodo.org/record/5606166, or whether we bump this to 1.0.1 then). Thanks! |
The release should be accordingly based on the latest export: 1137470 |
@lgeistlinger We have to bump the version to 1.0.1. I just want to check if there should be a specific release title and any description for the release before I create the release. Also, would you like me to 'draft' the upload in Zenodo so that you can take a look before I finalize everything? |
Thanks @jwokaty! I noticed a small inconvenience in the bulk export from bugsigdb.org, with some conditions / body sites being present in upper case and lower case (eg "Feces" and "feces"). I introduced a small fix for that in 121c571. Can you trigger a manual export for that and base the 1.0.1 on this export?
Nothing specific here from my side.
That sounds like a good idea! |
@lgeistlinger I've drafted the new version at https://zenodo.org/deposit/5819260. (I am assuming that you can see it.) |
Thanks @jwokaty , logging in to zenodo via Github (lgeistlinger / [email protected]), I am seeing:
when trying to access the link you provided. |
I apologize. I thought maybe because we had access to the same thing that maybe we could all see the draft. Maybe there's no way for you to see it? I just updated all the files, except for the README, and then updated the version number to v1.0.1. |
Cool! Can you maybe share the files via google drive or dropbox with me to quickly review them on my end. Thanks! |
The files are the same as in this release: https://github.com/waldronlab/BugSigDBExports/releases/tag/v1.0.1 |
Ah very nice, somehow I didn't notice the |
Hi @jwokaty:
@lwaldron and I had started to discuss a release scheme for BugSigDB.
One idea was to follow Bioconductor's semi-annual release scheme, and have a stable release of BugSigDB signatures every half a year. We also discussed zenodo as the platform for hosting the stable release (= csv files for studies, experiments, and signatures).
A stable release is supposed to contain all reviewed content from BugSigDB up to a defined freeze date. For the BugSigDB 1.0 release this could encompass reviewed content up to the present date for simplicity, or if we wanted to synchronize with Bioconductor, up to the past 3.13 release date.
Would you like to go ahead and export the content, filter by date and review status, upload to zenodo, and include the stable release link under https://bugsigdb.org/Help:Export ?
Thanks!
The text was updated successfully, but these errors were encountered: