Skip to content

Commit

Permalink
docs update
Browse files Browse the repository at this point in the history
  • Loading branch information
mikolmogorov committed Oct 25, 2020
1 parent 357c971 commit 72f504a
Show file tree
Hide file tree
Showing 2 changed files with 82 additions and 27 deletions.
26 changes: 26 additions & 0 deletions LICENSE
Original file line number Diff line number Diff line change
@@ -0,0 +1,26 @@
maf2synteny is distributed with the BSD license.

Copyright (c) 2013-2014, Mikhail Kolmogorov and coauthors
All rights reserved.

Redistribution and use in source and binary forms, with or without
modification, are permitted provided that the following conditions are met:
* Redistributions of source code must retain the above copyright
notice, this list of conditions and the following disclaimer.
* Redistributions in binary form must reproduce the above copyright
notice, this list of conditions and the following disclaimer in the
documentation and/or other materials provided with the distribution.
* Neither the name of the authors nor the
names of contributors may be used to endorse or promote products
derived from this software without specific prior written permission.

THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS" AND
ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED
WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE
DISCLAIMED. IN NO EVENT SHALL AUTHORS BE LIABLE FOR ANY
DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES
(INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES;
LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND
ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT
(INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS
SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
83 changes: 56 additions & 27 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,37 +1,66 @@
maf2synteny
===========

A tool for converting for recovering synteny blocks
from multiple alignment (in MAF fromat)
A tool that postprocesses whole genome alignment (for two or more genomes)
and produces coarse-grained synteny blocks. Currently, either
[Cactus](https://github.com/ComparativeGenomicsToolkit/cactus)
or [SibeliaZ](https://github.com/medvedevgroup/SibeliaZ) alignments are supported.

This tool is a standalone version of Ragout module [http://fenderglass.github./Ragout]
This tool is originally a part of [Ragout package](http://fenderglass.github.io/Ragout).

Install
-------
Building
--------

`make`

Building requires C++11 compiler and development environent installed in your system

make

Usage
-----

Usage: maf2synteny [-o out_dir] [-s simpl_params] [-m block_sizes] maf_file

positional arguments:
maf_file path to maf file

optional arguments:
-o out_dir path to the output directory [default = .]
-s simpl_params path to a file with simplifcation parameters [default = not set]
-b block_sizes comma-separated list of synteny block scales [default = 5000]

Default simplification parameters:
----------------------------------
10, 10
30, 100
50, 5000
100, 7000
500, 10000
1500, 50000
5000, 100000
10000, 500000
50000, 1000000
```
Usage: maf2synteny [-o out_dir] [-s simpl_params] [-m block_sizes] alignment_file
positional arguments:
alignment_file path to alignment file in maf or gff format
optional arguments:
-o out_dir path to the output directory [default = .]
-s simpl_params path to a file with simplifcation parameters [default = not set]
-b block_sizes comma-separated list of synteny block scales [default = 5000]
```

Input
-----

maf2synteny takes as input whole genome alignment in `.maf` or `.gff` formats.
The tools is optimized to work with Cactus or SibeliaZ alignments. Other
tools that generate alignments in similar format might not be supported.

The main reason for that is that maf2synteny expect all local alignments to be
non-overlapping, which is a very specific kind of whole genome alignmet.
Such alignment characterizes all repeated elements in the genome(s)
in the alphabet of maximum non-overlapping units. If the input alignmet
does not have this property, maf2synteny will likely not be able to produce good-quality
synteny blocks.

Simplification parameters
---------------------------------

One can specify custom graph simplification parameters to control
the granularity of produced synteny blocks. The paramers
could be supplied in a text file, where each line contains
two numbers: `min_block max_gap`. Please refer to the paper below
for the explanation how these parameters control the block
generation rpcedure.

Default parameters: {30, 10}, {100, 100}, {500, 1000}, {1000, 5000}, {5000, 15000}


Citation
--------

* Kolmogorov M, Armstrong J, Raney BJ, Streeter I, Dunn M, Yang F, Odom D, Flicek P, Keane TM,
Thybert D, Paten B. "Chromosome assembly of large and complex genomes using multiple references"
Genome research. 2018 [doi:10.1101/gr.236273.118](https://genome.cshlp.org/content/28/11/1720.short)

0 comments on commit 72f504a

Please sign in to comment.