You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Like empty collections, there are times you might want to have a mixed datatype collection but 99+% percent of the time I think collections should be uniformly datatyped. It would be nice if the collection creators issued a little warning if the datatypes don't match along the lines of the excellent work of @ahmedhamidawan in #19250.
There may be a few more usecases for mixed dbkey collections but it would be great if there was some sort of lighter weight indicator about that too. Maybe the thing for both dbkey and extension is that they should be highlighted in the UI if they are mixed across the collection - so right now we show the extension for every element in a list - maybe only do that if they are different with a little warning next to them and then match that for dbkey?
The text was updated successfully, but these errors were encountered:
I would use different language yeah, the history panel and dataset info call these format instead of extension or datatype. "Datasets have differing formats, generally {collectionType}s should contain datasets of all the same type."
Maybe as a second level - two helptext link overs could be added. On format that describe it as having different names in Galaxy (dataytpe, format, extension) and explain the difference between file extensions of dataset datatypes and then reuse this help text in the history panel format label and dataset info format label. And a link out at the end of your - saying maybe "Why?" that on hover explains why dataset collections are not folders. I asked ChatGPT for some language and sharpened it to something like:
Dataset collections are designed to streamline the analysis of large numbers of datasets by grouping them together into a single, manageable entity. Unlike generic folders on your computer, which can hold any mix of file types, dataset collections are specifically intended to be homogenous. This homogeneity is crucial for consistency in processing. Homogeneous datasets ensure that each dataset in the collection can be processed uniformly with the same tools and workflows. This eliminates the need for individual adjustments, which can be time-consuming and prone to error. Most tools and workflows in Galaxy are designed to operate on collections of similar data types. Homogeneous collections allow these tools to operate uniformly over the collection.
Like empty collections, there are times you might want to have a mixed datatype collection but 99+% percent of the time I think collections should be uniformly datatyped. It would be nice if the collection creators issued a little warning if the datatypes don't match along the lines of the excellent work of @ahmedhamidawan in #19250.
There may be a few more usecases for mixed dbkey collections but it would be great if there was some sort of lighter weight indicator about that too. Maybe the thing for both dbkey and extension is that they should be highlighted in the UI if they are mixed across the collection - so right now we show the extension for every element in a list - maybe only do that if they are different with a little warning next to them and then match that for dbkey?
The text was updated successfully, but these errors were encountered: