Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

masked regions #525

Closed
askleppe opened this issue Nov 12, 2024 · 1 comment
Closed

masked regions #525

askleppe opened this issue Nov 12, 2024 · 1 comment

Comments

@askleppe
Copy link

Hi,

We have regions in the reference fasta that are masked and that we ideally would like for sniffles to ignore when scanning for SVs. I could not find anything regarding what Sniffles does or how it handles masked regions. If we want it to be ignored, how do we do go about that? Currently we get SVs that have masked regions, eg N in reference show up with an insertion. The regions are not many but they're there and they seem to dominate the SV findings at this stage. It seems a bit dubious.

Does sniffles distinguish between soft masked and hard masked regions, or is there a setting we may apply to avoid masked regions?

Thank you for your input.

Best wishes,
April

@fritzsedlazeck
Copy link
Owner

Dear April,
correct this case is currently not directly supported. What I would suggest is to call with sniffles and then use e.g. bedtools to disregard the SV entries in the regions you want to ignore.

Are you aligning two genomes? The N in the insertion might be the reference allele. you need to specify the reference sequence to avoid the N (as the reference position). If there are literally N in the inserted sequence (so the ALT column ) that would mean that these are also in your reads.

Hope that helps
Fritz

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants