Question: Determining Required Percentage Similarity and Handling Database Input #321

ahfitzpa · 2023-12-11T19:33:47Z

I am planning a virus sequencing project using Readfish. Considering the ONT error rate and the adaptive sampling system, what is the necessary percentage similarity between the reference sequence (database) and the target sequence (expected on your flow cell). Given the diversity of viruses, I would like to avoid an unwieldy mmi input file and also avoid false hits.

github-actions · 2023-12-11T19:34:05Z

Thank you for your issue. Give us a little time to review it.

PS. You might want to check the FAQ if you haven't done so already.

This is an automated reply, generated by FAQtory

Adoni5 · 2023-12-12T20:15:17Z

I think you could get away with something in the range of 70% based on some work I was doing today.

How many virus's are you looking at in the sample? You could probably get away with using a generic reference for a group of species, but that said if you are looking to differentiate between two very similar species, it might require more thought. Any thoughts @mattloose ?

ahfitzpa · 2023-12-15T18:41:30Z

I will not know how many viruses are in the samples as it is an virus discovery project in a wide variety of samples types, therefore I cannot take a host depletion approach.
What I am hoping and will test from what you are saying is that AS via ReadFish is pretty permissive, so I can reduce the size of my db by clustering to a specific similarity. I will have fun at the other end of sequencing disentangling similar species anyway due to the ONT error rate, though it is much improved.
The size limits are pretty well documented are AS. Do you think that increasing the time a sequence spends in the pore would permit AS of shorter sequences?

github-actions · 2024-01-15T02:09:57Z

This issue is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 5 days.

github-actions · 2024-01-21T02:11:14Z

This issue was closed because there has been no response for 5 days after becoming stale.

github-actions bot added the Stale label Jan 15, 2024

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Jan 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question: Determining Required Percentage Similarity and Handling Database Input #321

Question: Determining Required Percentage Similarity and Handling Database Input #321

ahfitzpa commented Dec 11, 2023 •

edited

Loading

github-actions bot commented Dec 11, 2023

Adoni5 commented Dec 12, 2023

ahfitzpa commented Dec 15, 2023

github-actions bot commented Jan 15, 2024

github-actions bot commented Jan 21, 2024

Question: Determining Required Percentage Similarity and Handling Database Input #321

Question: Determining Required Percentage Similarity and Handling Database Input #321

Comments

ahfitzpa commented Dec 11, 2023 • edited Loading

github-actions bot commented Dec 11, 2023

Adoni5 commented Dec 12, 2023

ahfitzpa commented Dec 15, 2023

github-actions bot commented Jan 15, 2024

github-actions bot commented Jan 21, 2024

ahfitzpa commented Dec 11, 2023 •

edited

Loading