Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Key Error from REF and ALT fields when running DumpSTR on GangSTR vcf file #81

Open
BonnieCSE opened this issue Nov 13, 2019 · 5 comments
Assignees

Comments

@BonnieCSE
Copy link

DumpSTR throws KeyError: 'csf1poatct' when running on a GangSTR vcf file (Platinum Genomes pedigree). I noticed the error was due to strings such as ‘csf1poatct,’ ‘d7s820tatc,’ and ‘d8s1179tcta’ in the REF and ALT fields in chr5 pos 149455887, chr7 pos 83789542, and chr8 pos 125907115.

@nmmsv
Copy link
Collaborator

nmmsv commented Feb 13, 2020

Hi Bonnie,
Did we figure this out? I remember you mentioned it before but forgot if it was addressed or not.

@nmmsv nmmsv self-assigned this Feb 13, 2020
@BonnieCSE
Copy link
Author

BonnieCSE commented Feb 13, 2020 via email

@nmmsv
Copy link
Collaborator

nmmsv commented Feb 13, 2020

O cool I remember now.
If you have the bed file that created this issue can you add it here? That way I can reproduce the error and take it from there!

@BonnieCSE
Copy link
Author

Is it ok if I send the vcf file? I can't seem to find the bed file.

chr5_subset.vcf is a subset of chr5 (contains one of the 3 lines that cause the error), and when it is the input vcf for dumpSTR, the error is thrown.
plat_merged_errors.vcf contains all 3 of the lines that cause dumpSTR to throw the Key Error.

chr5_subset.vcf.gz
plat_merged_errors.vcf.gz

@nmmsv
Copy link
Collaborator

nmmsv commented Feb 13, 2020

awesome, thanks! I'll check them out later.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants