-
Notifications
You must be signed in to change notification settings - Fork 188
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add GATK ValidateVariants #390
Open
G-kodes
wants to merge
5
commits into
snakemake:master
Choose a base branch
from
G-kodes:ValidateVariants
base: master
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
Changes from all commits
Commits
Show all changes
5 commits
Select commit
Hold shift + click to select a range
9376252
Init ValidateVariants GATK
G-kodes 7c2131b
House cleaning
G-kodes e523a03
Removed PreLoader in favor of shell().prefix()
G-kodes 65be8f5
Merge branch 'snakemake:master' into ValidateVariants
G-kodes 0819f4f
Merge branch 'master' into ValidateVariants
johanneskoester File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,7 @@ | ||
channels: | ||
- bioconda | ||
- conda-forge | ||
- defaults | ||
dependencies: | ||
- gatk4 ==4.1.4.1 | ||
- snakemake-wrapper-utils ==0.1.3 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,12 @@ | ||
name: GATK ValidateVariants | ||
description: Interleave two paired-end FASTA/Q files | ||
url: https://gatk.broadinstitute.org/hc/en-us/articles/360037057272-ValidateVariants | ||
authors: | ||
- Graeme Ford | ||
input: | ||
vcf: VCF file to be validated | ||
output: | ||
- VCF output file | ||
params: | ||
extra: any extra commands as a string | ||
notes: Multiple threads can be used during compression of the output file with ``pigz``. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change | ||||
---|---|---|---|---|---|---|
@@ -0,0 +1,13 @@ | ||||||
rule vcf_spec_validation: | ||||||
input: | ||||||
vcf="sample.vcf", | ||||||
output: | ||||||
"results/sample_VALID.vcf", | ||||||
log: | ||||||
"results/sample_VALID.log", | ||||||
params: | ||||||
R="genome.fasta", | ||||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I cannot find this to be used in the wrapper. On the other hand, extra is missing. Maybe you meant to write
Suggested change
|
||||||
resources: | ||||||
mem_mb=1024, | ||||||
wrapper: | ||||||
"master/bio/gatk/validatevariants" |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,3 @@ | ||
@HD VN:1.5 | ||
@SQ SN:ref LN:45 M5:7a66cae8ab14aef8d635bc80649e730b UR:file:/home/johannes/scms/snakemake-wrappers/bio/picard/createsequencedictionary/test/genome.fasta | ||
@SQ SN:ref2 LN:40 M5:1636753510ec27476fdd109a6684680e UR:file:/home/johannes/scms/snakemake-wrappers/bio/picard/createsequencedictionary/test/genome.fasta |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,4 @@ | ||
>ref | ||
AGCATGTTAGATAAGATAGCTGTGCTAGTAGGCAGTCAGCGCCAT | ||
>ref2 | ||
aggttttataaaacaattaagtctacagagcaactacgcg |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,2 @@ | ||
ref 45 5 45 46 | ||
ref2 40 57 40 41 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,18 @@ | ||
##fileformat=VCFv4.0 | ||
##fileDate=20170110 | ||
##source=pindel | ||
##reference=hg38 | ||
##INFO=<ID=END,Number=1,Type=Integer,Description="End position of the variant described in this record"> | ||
##INFO=<ID=HOMLEN,Number=1,Type=Integer,Description="Length of base pair identical micro-homology at event breakpoints"> | ||
##INFO=<ID=PF,Number=1,Type=Integer,Description="The number of samples carry the variant"> | ||
##INFO=<ID=HOMSEQ,Number=.,Type=String,Description="Sequence of base pair identical micro-homology at event breakpoints"> | ||
##INFO=<ID=SVLEN,Number=1,Type=Integer,Description="Difference in length between REF and ALT alleles"> | ||
##INFO=<ID=SVTYPE,Number=1,Type=String,Description="Type of structural variant"> | ||
##INFO=<ID=NTLEN,Number=.,Type=Integer,Description="Number of bases inserted in place of deleted code"> | ||
##FORMAT=<ID=PL,Number=3,Type=Integer,Description="Normalized, Phred-scaled likelihoods for genotypes as defined in the VCF specification"> | ||
##FORMAT=<ID=GT,Number=1,Type=String,Description="Genotype"> | ||
##FORMAT=<ID=RD,Number=1,Type=Integer,Description="Reference depth, how many reads support the reference"> | ||
##FORMAT=<ID=AD,Number=2,Type=Integer,Description="Allele depth, how many reads support this allele"> | ||
#CHROM POS ID REF ALT QUAL FILTER INFO FORMAT a | ||
ref 8 . A G . PASS END=140434574;HOMLEN=5;HOMSEQ=AAAAA;SVLEN=35;SVTYPE=INS GT:AD 0/0:317,6 | ||
ref2 10 . A AGTTA . PASS END=55238278;HOMLEN=7;HOMSEQ=CTGCCAC;SVLEN=-51;SVTYPE=DEL GT:AD 0/0:40789,1734 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,25 @@ | ||
__author__ = "Graeme Ford" | ||
__copyright__ = "Copyright 2021, Graeme Ford" | ||
__email__ = "[email protected]" | ||
__license__ = "MIT" | ||
|
||
from snakemake.shell import shell | ||
from snakemake_wrapper_utils.java import get_java_opts | ||
|
||
extra = snakemake.params.get("extra", "") | ||
java_opts = get_java_opts(snakemake) | ||
|
||
log = snakemake.log_fmt_shell(stdout=True, stderr=True) | ||
|
||
extra = snakemake.params.get("extra", "") | ||
|
||
|
||
shell( | ||
"gatk " | ||
"--java-options '{java_opts}' " | ||
"ValidateVariants " | ||
"-V {snakemake.input.vcf} " | ||
"{extra} " | ||
"{log} " | ||
"> {output}" | ||
) |
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think the output should just have a .txt suffix or so, since this tool will not create a vcf file, right?