4.2.3.0
Download release: gatk-4.2.3.0.zip
Docker image: https://hub.docker.com/r/broadinstitute/gatk/
Highlights of the 4.2.3.0 release:
-
Notable bug fixes for
Mutect2
andFuncotator
-
Support in
CombineGVCFs
andGenotypeGVCFs
for "reblocked" GVCFs as produced by theReblockGVCF
tool. Reblocked GVCFs have a significantly reduced storage footprint. -
More control over the Smith-Waterman parameters in
HaplotypeCaller
andMutect2
-
A new Fragment Allele Depth (
FAD
) variant annotation similar to theAD
annotation except that allele support is considered per read pair, not per individual read -
GenomicsDB bug fixes and enhancements
Full list of changes:
-
HaplotypeCaller/Mutect2
- Fixed a bug where
Mutect2
failed to filter germline variants with alternate representations (#7103)- This caused variants with alternative representations in gnomAD to not be recognized as being the same as called variants in some cases. This resulted in variants that were called and not filtered, but they should have been filtered by "germline".
- Exposed Smith-Waterman parameters as tool arguments in
HaplotypeCaller
,Mutect2
, andFilterAlignmentArtifacts
. (#6885)- Enables use of alternative parameters for different event representation (e.g. three consecutive SNPs instead of two small indels)
- Can now specify the Smith-Waterman implementation in
FilterAlignmentArtifacts
(#7105) - Added a
--debug-assembly-variants-out
diagnostic option to output a side VCF with variants detected by assembly forHaplotypeCaller
andMutect2
(#7384) Mutect2
: the--genotype-germline-sites
argument is no longer marked as experimental (#7533)
- Fixed a bug where
-
GenotypeGVCFs / CombineGVCFs
- Updated
CombineGVCFs
andGenotypeGVCFs
to handle "reblocked" GVCFs with diploid data that are potentially missing hom-ref genotype PLs (#7223) - Homozygous reference genotypes with no PLs and zero depth are now output as no-calls by
GenotypeGVCFs
(#7471) - Bug fixes for
GenotypeGVCFs
/GnarlyGenotyper
when allele-specific annotations have empty values due to lack of informative reads or no depth (#7491) (#7186)
- Updated
-
GenomicsDB
- Added a new
--call-genotypes
GenomicsDB argument, enabling output of called genotypes (i.e. not ./.) when tools likeCombineGVCFs
andSelectVariants
read from a GenomicsDB workspace (#7223) - Added a
--bypass-feature-reader
argument toGenomicsDBImport
to allow the C-based htslib VCF reader implementation to be used instead of the Java implementation (#7393)- Using this option will reduce memory usage and potentially speed up the import process
- Updated to GenomicsDB 1.4.2 (#7520)
- This release fixes a commonly-encountered bookkeeping issue with GenomicsDB array fragments. Should fix errors of the type: "Error: Cannot read from buffer; Error: cannot load book-keeping" as reported in #7012
- Full release notes are here: https://github.com/GenomicsDB/GenomicsDB/releases/tag/v1.4.2
- Added a new
-
Funcotator
-
CNV Calling
- CNV WDLs now handle BAM/CRAM index paths explicitly, as for cases where the index is not in the same path as its file (#7518)
- gCNV in the CASE mode now fills in all hidden DenoisingModelConfig and CopyNumberCallingConfig arguments from the input model configuration (#7464)
- Exposed number of samples used for estimating denoised copy ratios in gCNV via a new
--num-samples-copy-ratio-approx
argument (#7450)
-
SV Calling
JointGermlineCNVSegmentation
: bug fixes and refactoring (#7243)- A number of bugs, particularly with max-clique clustering, have been fixed, as well as a parameter swap bug in
JointGermlineCNVSegmentation
- Reworks classes used by
JointGermlineCNVSegmentation
for SV clustering and defragmentation. The design ofSVClusterEngine
has been overhauled to enable the implementation ofCNVDefragmenter
andBinnedCNVDefragmenter
subclasses. Logic for producing representative records from a collection of clustered SVs has been separated into anSVCollapser
class, which provides enhanced functionality for handling genotypes for SVs more generally.
- A number of bugs, particularly with max-clique clustering, have been fixed, as well as a parameter swap bug in
-
Notable Enhancements
- Added a new Fragment Allele Depth (
FAD
) variant annotation (#7511)- This annotation is identical to the
AD
annotation except that allele support is considered per read pair, not per individual read
- This annotation is identical to the
- Added a new Fragment Allele Depth (
-
Miscellaneous Changes
-
Documentation
-
Dependencies