Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MirTop output changes depending on sample quality #74

Open
apeltzer opened this issue Oct 11, 2022 · 2 comments
Open

MirTop output changes depending on sample quality #74

apeltzer opened this issue Oct 11, 2022 · 2 comments

Comments

@apeltzer
Copy link

Expected behavior and actual behavior.

MirTop seems to create certain summary metrics only for samples with significant quality in the output file(s).

This makes downstream tools such as MultiQC break frequently, as MirTop just skips reporting these stats in the output file - while MultiQC depends on them to be present. An example is this here: MultiQC/MultiQC#1778

Other examples: MultiQC/MultiQC#1723 and MultiQC/MultiQC#1716 where we found different ids in the JSOn to be missing.

It would be great to always report all values in order to be on the safe side and not break any downstream tools anymore. Can't we just default to report 0 in such cases where there is no data?

Steps to reproduce the problem.

a.) Take a sample of (debatable) quality, run MirTop.
b.) Run MultiQC on the output to generate a report --> MultiQC fails.

Specifications like the version of the project, operating system, or hardware.

Doing this in the nf-core/smrnaseq workflow, data is sometimes just bad quality but I have several hundreds of samples where only a minor amount <<5% are failing due to this.

@lpantano
Copy link
Contributor

It is a good idea, I have it in mind, I just need 30min to address this. I need to get all the categories from here: https://github.com/ewels/MultiQC/blob/master/multiqc/modules/mirtop/mirtop.py#L61 (+ _sum _count _mean) to here: https://github.com/miRTop/mirtop/blob/dev/mirtop/gff/stats.py#L106.

@apeltzer
Copy link
Author

Yes looks like it 👍🏻 Happy to test if you have a pre-release version to make sure it works nicely? :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants