Replies: 5 comments 11 replies
-
The command you include here is not the recommended way to run the seqr pipeline. Please make sure you are running the latest docker image, have followed all the set up instructions for pipeline reference data, and are running one of the supported loading commands as outlined in the latest version of our instructions: https://github.com/broadinstitute/seqr/blob/master/deploy/LOCAL_INSTALL.md#annotating-and-loading-vcf-callsets |
Beta Was this translation helpful? Give feedback.
-
I see. In the future, including the command you actually run is much more helpful than putting in part of the command that is executed. Please provide the actual command you ran, including all the arguments, and the full logs. Without the full logs I can't do much more, but this might be an issue with the input VCF format. Can you confirm that you are running the pipeline on a GATK joint called VCF? |
Beta Was this translation helpful? Give feedback.
-
Please provide the actual command you ran, including all the arguments, and the full logs. Logs are available by running |
Beta Was this translation helpful? Give feedback.
-
Since there was nothing informative in the logs I reran the analyses with a newly "clean" called vcf (without existing annotations). But now we hit another error:
When I narrow this down to where in the vcf this exception gets triggered it seems to only happen for variants on chrX, chrY and chrM. Loading the vcf without those 3 contigs works perfectly (chr1 -> 22). Test VCF data:
Working vcf line:
Failing vcf lines:
Any idea why this HailException would get triggered only on chrX, chrY and chrM variants in the vcf? could this be a ploidy handling issue? |
Beta Was this translation helpful? Give feedback.
-
I've run into this error before, and it is related to ploidy - for context, the issue is that our loading pipeline uses Hail's function for splitting multi-allelic sites: https://hail.is/docs/0.2/methods/genetics.html#hail.methods.split_multi_hts - which expects three scores for PL (and might expect a specific number for other scores, I've run into the PL issue first). For variants with a haploid call, our pipeline will throw an error because it's expecting scores that don't exist for a haploid call. Currently we don't have a good fix for this other than not including haploid calls - digging into the source for the function https://hail.is/docs/0.2/_modules/hail/methods/statgen.html#split_multi_hts, you might be able to work around this by removing the PL field from the VCF - but our downstream code might be expecting a PL score. |
Beta Was this translation helpful? Give feedback.
-
When running
python3 -m seqr_loading SeqrMTToESTask --local-scheduler
I run into an error.Looking at the source code I can't figure out why this occurs. Hopefully someone here can provide some assistance on how I can further debug this.
Version info:
Error log:
Beta Was this translation helpful? Give feedback.
All reactions