Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Incorrect MIME-type for TSV-file inputs #831

Open
ebete opened this issue Feb 13, 2025 · 4 comments
Open

Incorrect MIME-type for TSV-file inputs #831

ebete opened this issue Feb 13, 2025 · 4 comments
Labels
bug Something isn't working

Comments

@ebete
Copy link

ebete commented Feb 13, 2025

Description of the bug

The MIME-types for TSV files in nextflow_schema.json are set to text/tsv instead of text/tab-separated-values (see: IANA tsv).

This causes issues within Seqera cloud, as TSV-files uploaded using the Datasets tab will have the text/tab-separated-values MIME-type, and will thus not be selectable as an input for the pipeline as it doesn't match text/tsv.

See Seqera platform docs regarding this feature:
https://docs.seqera.io/platform/24.3/data/datasets#use-a-dataset

Command used and terminal output

Relevant files

No response

System information

  • Seqera cloud: 24.2.3_5387adb
  • nf-core/ampliseq: 2.12.0
@ebete ebete added the bug Something isn't working label Feb 13, 2025
@pinin4fjords
Copy link
Member

Actually the Platform incompatibility is more to do with the lack of pattern specification in the schema- could be fixed like #832.

@d4straub
Copy link
Collaborator

The problem is solved for --input in the dev branch. However, all params that point to files should have patterns to make them work, so I keep the issue open as reminder.

@ebete
Copy link
Author

ebete commented Feb 20, 2025

It looks like TSV datasets are still not selectable from the dropdown within Seqera Platform when using the dev branch. But I'm also not sure if the platform uses extensions for matching datasets, and instead opts for MIME-types. At least in the front-end, the extension is stripped from dataset names.

@pinin4fjords
Copy link
Member

pinin4fjords commented Feb 24, 2025

It looks like TSV datasets are still not selectable from the dropdown within Seqera Platform when using the dev branch. But I'm also not sure if the platform uses extensions for matching datasets, and instead opts for MIME-types. At least in the front-end, the extension is stripped from dataset names.

Yes, there are two issues here.

  1. Lack of a pattern means Platform doesn't respond correctly (fixed in the linked PR)
  2. Platform doesn't respond correctly to patterns with the multiple extensions required here. This is a bug in Platform itself and will need to be resolved there.

The only viable workaround right now is to reset the schema to use a single file extension in the pattern. You could override the schema yourself pending the fix to the above.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants