Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Quality trimming reads? #44

Open
timghaly opened this issue Feb 19, 2024 · 2 comments
Open

Quality trimming reads? #44

timghaly opened this issue Feb 19, 2024 · 2 comments

Comments

@timghaly
Copy link

Dear Plass team,

I am very interested in using this tool for protein assembly of soil metagnomes. I am just curious if you would recommend to first quality filter and trim reads, e.g., using fastp. Will this improve the precision of Plass, or will the potential reduction in read length from the trimming come at too great a cost in sensitivity? What would you recommend?

Best,
Tim

@FlyinTeller
Copy link

FlyinTeller commented Apr 9, 2024

Quality Trimming is a good idea for the assembly process.
We haven't done comparative tests of assembly with and without quality trimming. But if you don't quality trim, it means that you would get a much lower sequence identity in the overlap between reads. This in turn could only be counteracted by lowering the threshold for sequence identity, which could have a negative impact on precision. There might be a small reduction in sensitivity when the overlap between reads becomes less than the length of a kmer, but this should be a small effect and the loss in precision you would get from not quality trimming probably outweighs this effect.

@timghaly
Copy link
Author

Great, thanks for your advice!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants