Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Downstream filtering for haploid TRs #105

Open
Owuorgpo opened this issue Apr 5, 2021 · 4 comments
Open

Downstream filtering for haploid TRs #105

Owuorgpo opened this issue Apr 5, 2021 · 4 comments

Comments

@Owuorgpo
Copy link

Owuorgpo commented Apr 5, 2021

Nice tool!
I Am trying to implement this tool for identifying TRs expansion in a bacterial genome. Though gangSTR works perfectly well, the problem is with subsequent filtering with dumpSTR (from TRTools). All sites are filtered out even after lowering the threshold, which I guess is an issue that starts from the gangSTR stage. Are there specific considerations when --ploidy 1 is set and the genome is haploid?

@nmmsv
Copy link
Collaborator

nmmsv commented Apr 5, 2021

Hi @Owuorgpo
Unfortunately --ploidy 1 is relatively under-tested and issues are prevalent in both GangSTR and TRTools. Is it perhaps possible to share the VCF/BAM files that were used in your run to help in debugging?
Note: I'm currently only working part-time at the lab, so development is going to be a bit slow. Apologies in advance!

@Owuorgpo
Copy link
Author

Owuorgpo commented Apr 6, 2021

Sample.vcf.gz
Sure, here is a VCF file from gangSTR. This is unfiltered, straight from gangSTR with the default setting at --ploidy 1. Though dumpSTR is expected to drop most of these that are not supported even in the BAM file, it ends up dropping everything, including TR expansions we have validated before

@nmmsv
Copy link
Collaborator

nmmsv commented Apr 19, 2021

Hi @Owuorgpo,
Sorry for very delayed response. My plate has been quite full with an internship that I recently started. Unfortunately I won't be able to debug this immediately. I will keep the issue open to get to it as soon as I can set aside some time.

@Johnymcb
Copy link

Johnymcb commented May 8, 2021

Hi @nmmsv,
I am getting a similar error filtering calls using DumpSTR when --ploidy 1 option or --samp-sex are used on haploid Chromosomes. The process terminates with an error . I am working on human WGS. See attached sample VCFs. This is the error: " ml = [int(item) for item in sample["REPCN"]] ; TypeError: 'int' object is not iterable "

File2_ploidy-opt.vcf.zip
File1_sample-sex-opt.vcf.zip

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants