-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Use specific data formats #36
Comments
To accomplish this, we would need some kind of mapping of our formats to Galaxy formats. And since both of these frameworks allow extension, we'll probably always need From there it ought to be possible observe that a Galaxy collection which contains entirely a certain type would be compatible with our file collection and thus constrain the collections available to import. (Do Galaxy collections have an observable format?) |
Seems that the EDAM annotation is present for the Galaxy data types. Is there a list of qiime datatypes somewhere, maybe with EDAM annotations? In general collections can contain datasets of different types. On the tool side one can use the I find the discussion on automatically generated tool wrappers quite enlightening, because it often sheds light on shortcomings of Galaxy (or its tool framework). As a further comment on collections: they are a nice way to generate parallelism. |
ping @ebolyen .. seems that we had the same ideas already earlier :) as just discussed: I will try to produce a figure (or a hierarchical datatype like yaml/json) of Galaxy's datatype hierarchy annotated with edam_format and edam_data entries .. then we can think of a mapping .. maybe with some help of @matuskalas |
Create a little script over here. There are a few datatypes deriving from more than one class. So I used only the first in the MRO. Result can be found here. If needed you can probably run it using
probably some additional python modules are needed. |
I just started to explore the qiime2 Galaxy tools. Obviously starting with the import tool I noticed that often the unspecific
format="data"
is used, e.g.galaxy-tools/tools/suite_qiime2_core__tools/qiime2_core__tools__import.xml
Line 604 in 4456c16
this should be avoided, in particular if there are corresponding datatypes in Galaxy. In this specific example
format="fastq.gz"
seems appropriate. But there are alsofastqsanger.gz
orfastqillumina.gz
if a specific phred encoding is required.The text was updated successfully, but these errors were encountered: