Releases · labgem/PPanGGOLiN

22 Jan 10:32

axbazin

1.0.13

9f77620

PPanGGOLiN release 1.0.13

Better handling of input files (will raise errors before doing anything if any name is duplicated or if any input file does not exist)
Added the '--defrag' option to use the defragmentation pipeline in the 'workflow' command (rather than having to use each subcommand separately only to use this option)

Bugfixes :

--soft-core option in 'evolution' and in 'write' commands actually works now when you change it
Cope with .gbk/.gbff files with :
- duplicate 'locus_tag' fields within assemblies and between different assemblies (happens when genomes are downloaded from Genbank) (fix for #25 )
- no contig identifier in VERSION field (happens with Prokka annotated genomes)

Assets 2

16 Nov 14:35

axbazin

1.0.1

c0a4535

PPanGGOLiN release 1.0.1

Bug fixes:

Deal with lowercase fasta (fix for #24 )
Better checking for input files (and hopefully clear errors) (fix for #22 )

Assets 2

07 Nov 12:17

axbazin

1.0.0

003c4bf

PPanGGOLiN release 1.0.0

New features:

Can choose the number of partitions in the 'workflow' subcommand
Can customize identity and coverage thresholds in the 'cluster' subcommand
Added 4 new possible outputs :
- proteic fasta for representative sequences of the gene families
- nucleic fasta for representative sequences of the gene families
- nucleic fasta of all the CDS
- a list containing the gene family IDs and the gene IDs alike the .tsv file format of MMseqs2
Added unit tests for the different classes thank to @sletort

bug fixes :

Do not take into account the Markov Random Field if its criteria reaches infinity (problem of large dimensionality in statistics, PPanGGOLiN should crash less on VERY fragmented datasets.)
now properly reading .gbff/.gbk files
Improved compatibility for the .gexf files

Assets 3

25 Sep 13:17

ggautreau

v0.3.88

5bfb77c

Pre release version

PPanGGOLiN can annotate and build gene families by itself for an easier use, or use annotated genomes and formerly built gene families directly.
PPanGGOLiN can have more than 3 partition and can estimate the optimal number of partitions.
PPanGGOLiN can run parts of its pipeline separatly for better parameter tuning.
PPanGGOLiN can provide a number of output files that will illustrate or describe your pangenome.
PPanGGOLiN uses a HDF5 file to store all the informations related to a pangenome, and reuse or re-generate any of those data for further analysis.
PPanGGOLiN makes a better use of CPUs in a multithreaded run.
PPanGGOLiN can project the pangenome's partitions on a given protein set.

PPanGGOLiN is compatible with macOS

and a lot of bugfixes. (and maybe some new)

Assets 2

15 Jul 10:17

ggautreau

0.1.4

65eac3d

0.1.4

fix a bug about the multithreading in the computation of the layout

Assets 2

10 Jul 16:21

ggautreau

0.1.3

d3bca76

0.1.3

fix an unexpected behavior when the dispersion around the centroid vector of the shell genome is above the one of the cloud genome
fix a bug regarding the deletion of temporary files

Assets 2