-
Notifications
You must be signed in to change notification settings - Fork 18
iva options
You must give paired reads either as two separate files with
-f reads_1.fastq -r reads_2.fastq
or in a single interleaved file with
--fr reads.fastq
and you must give an output directory. So the minimal usage is either:
iva -f reads_1.fastq -r reads_2.fastq Ouptut_dir
or:
iva --fr reads.fastq Ouptut_dir
Keep intermediate files, which could be many!
By default, everything is deleted and the only
output files are the contigs FASTA file contigs.fasta
and a file info.txt
that has how IVA was run and
versions of all third-party programs it used.
You can give a file of contigs that IVA will try to extend,
instead of generating a de novo assembly.
This option is incompatible with --reference
.
See also --make_new_seeds
and --ctg_first_trim
.
You can provide a reference genome, in which case
IVA will try to assemble one contig per sequence in
this file. For each reference sequence it will
generate a seed sequence from the reads mapped to the
middle 500bp of the reference sequence.
This option is incompatible with --contigs
.
Maximum number of contigs allowed in the assembly. No more seeds generated if the cutoff is reached. Default: 50.
Be verbose by printing messages to stdout. Use up to four times for increasing verbosity. You probably don't want to go above 2. 4 is meant for debugging.
Maximum insert size (includes read length). Reads with inferred insert size more than the maximum will not be used to extend contigs. Default: 800.
Number of threads to use. More than one thread can be used when preprocessing the reads and when running SMALT mapping. Default: 1.
Show program's version number and exit.
Provide the location of trimmomatic.jar file to enable
read trimming before assembly. This option is
required if --adapters
is used.
FASTA file of adapter sequences to be trimmed off
reads. If used, you must also use --trimmomatic
.
Default: use adapters file bundled with IVA.
Minimum length of read after trimming. Only applies
if --trimmomatic
is used.
Default: 50.
FASTA file of primers. The first perfect match found to a sequence in the primers file will be trimmed off the start of each read. This is run after Trimmomatic (if --trimmomatic used).
The kmer hash length used when mapping reads
with SMALT (the -k
option in smalt index
.
Higher numbers will increase speed at the cost of reduced
sensitivity.
Default: 19.
The kmer hash step length when mapping reads
with SMALT (the -s
option in smalt index
).
Higher numbers will increase speed at the cost of reduced
sensitivity.
Default: 11.
The minimum identity threshold for mapping to be reported
by SMALT (the -y
option in smalt map
).
This must be between 0 and 1. The default of 0.5 means that
half of each read must map. This means that up to half of
each read could hang off the end of a contig, so half of the
read can be used to extend a contig.
Default: 0.5.
Number of bases to trim off the end of every contig
before extending for the first time. Only relevant
if --contigs
is used. Default: 25.
During iterative extension, number of bases to trim off the end of a contig when extension fails (then try extending again before generating a new seed). Default: 10.
Minimum kmer depth needed to use that kmer to extend a contig. Default: 5.
Sets N, where kmer for extension must be at least N times more abundant than next most common kmer. Default: 2.
Maximum number of bases to try to extend on each iteration. Default: 100.
Set minimum number of bases of a read hanging off a contig end for those bases to be used for extension. Default: 3.
When no more contigs can be extended, generate a new seed. This is forced to be true when --contigs is not used.
When making a seed sequence, use the most common kmer of this length. Warning: it is not recommended to set this higher than 95. Default: min(median read length, 95).
Stop extending seed using perfect matches from reads when this length is reached. Future extensions are then made by treating the seed as a contig. Default: 0.9*max_insert.
Minimum frequency of kmer needed to be used as a seed. Default:25.
Maximum frequency of kmer needed to be used as a seed. Default: 1000000.
Maximum number of bases to try to extend on each iteration. Default: 50.
Number of overlapping bases needed between read and seed to use that read to extend. Default: seed_start_length.
Minimum kmer depth needed to use that kmer to extend a seed. Default: 5.
Sets N, where kmer for extension must be at least N times more abundant than next most common kmer. Default: 2.