Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support for already-qc'd fna files of reads #7

Open
shiraz-shah opened this issue Jul 12, 2023 · 0 comments
Open

Support for already-qc'd fna files of reads #7

shiraz-shah opened this issue Jul 12, 2023 · 0 comments

Comments

@shiraz-shah
Copy link

Dear Moreno,
It would be so convenient to have support for read fna files that have already been QC'd so they can be mapped directly by bowtie2. The current QC step is slow, and allowing for this would speed up viromeQC a lot!

A lot of us have MDA amplified viromes, where a read deduplication step is necessary after read filtering and trimming to remove redundant reads prior to assembly, mapping, etc. Such deduplication is already done by other tools (such as vsearch or rmdup) and dramatically reduces the size of the read files. Following qc and deduplication, the quality scores are no longer important, so the output is normally fna files of reads instead of fastq files.

Incorporating support for qc'd fna files (bowtie2 also supports this) would dramatically increase the speed of viromeQC because the length filtering step is omitted, plus fewer reads have to be mapped by bowtie2. E.g. many of my MDA amplified virome files are often 10x smaller after passing my own qc and deduplication pipeline.

Please consider this, as it would be extremely easy for you to implement!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant