Skip to content

Commit

Permalink
Only keep GFA file and rename it contigs.gfa
Browse files Browse the repository at this point in the history
  • Loading branch information
tseemann committed Sep 3, 2017
1 parent 1f863e3 commit 360dad1
Show file tree
Hide file tree
Showing 2 changed files with 17 additions and 11 deletions.
21 changes: 11 additions & 10 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -71,12 +71,11 @@ You will need to install all the dependencies manually:
* FLASH
* SAMtools >= 1.3
* BWA MEM
* PILON
* KMC
* KMC >= 2
* seqtk
* pigz
* Java
* Trimmomatic
* Pilon (Java)
* Trimmomatic (Java)

## Output files

Expand All @@ -89,12 +88,17 @@ The FASTA description of each sequence in `contigs.fa` have space-separated
`name=value` pairs with the length in bases (`len`), the average coverage
(`cov`), the number of post-assembly SNP/indel corrections made (`corr`),
and the original contig name from Spades (`spades`). Two examples are:

```
>contig00001 len=263154 cov=8.9 corr=1 spades=NODE_1_length_263154_cov_8.86703_pilon
>contig00041 len=339 cov=8.8 corr=0 spades=NODE_41_length_339_cov_8.77027_pilon
```

The (uncorrected) assembly graph file for viewing in
[Bandage](https://rrwick.github.io/Bandage/) is available too:
```
contigs.gfa
```

There is a log file for each of the tools used to generate the assembly:
```
00-shovill.log
Expand All @@ -112,8 +116,6 @@ As Spades is the most important tool used, some useful output files are
kept.<br>
&#9888; Do not confuse the final `contigs.fa` with these files!
```
assembly_graph.fastg
assembly_graph.gfa
before_rr.fasta
contigs.fasta
scaffolds.fasta
Expand All @@ -136,7 +138,7 @@ pilon.changes
--force Force overwite of existing output folder (default: OFF)
--R1 XXX Read 1 FASTQ (default: '')
--R2 XXX Read 2 FASTQ (default: '')
--depth N Sub-sample --R1/--R2 to this depth. Disable with --depth 0 (default: 50)
--depth N Sub-sample --R1/--R2 to this depth. Disable with --depth 0 (default: 100)
--gsize XXX Estimated genome size <blank=AUTODETECT> (default: '')
--kmers XXX K-mers to use <blank=AUTO> (default: '')
--opts XXX Extra SPAdes options eg. --plasmid --sc ... (default: '')
Expand Down Expand Up @@ -166,5 +168,4 @@ Not published yet.
## Authors

* **Torsten Seemann**
* Jason Kwong
* Anders Goncalves da Silva
* Jason Kwong, Simon Gladman, Anders Goncalves da Silva
7 changes: 6 additions & 1 deletion bin/shovill
Original file line number Diff line number Diff line change
Expand Up @@ -229,13 +229,17 @@ for my $id (sort { $len{$b} <=> $len{$a} } keys %{$seq}) {
}
write_fasta("contigs.fa", $seq);

move("assembly_graph.gfa", "contigs.gfa");

msg("Writing $EXE log file.");
move("spades.log", "60-spades.log");

# Cleanup time!
unless ($keepfiles) {
# corrected, overlapped and symlinked original reads
unlink glob("*q.gz");
# keep the .gfa and delete the .fastg
unlink "assembly_graph.fastg";
# BWA indices
unlink glob("$asm.fasta.*");
unlink $BAM, "$BAM.bai";
Expand All @@ -259,7 +263,8 @@ msg("Walltime used: $pretty");
#msg("Type '$EXE --citation' for more details.");

msg("Results in: $outdir");
msg("Final assembly in: $outdir/contigs.fa");
msg("Final assembly graph: $outdir/contigs.gfa");
msg("Final assembly contigs: $outdir/contigs.fa");
msg("It contains $ncontigs (min=$minlen) contigs totalling $nbases bp.");

# Inspiration
Expand Down

0 comments on commit 360dad1

Please sign in to comment.