Skip to content

Commit

Permalink
updated readme with figures and tutorials
Browse files Browse the repository at this point in the history
  • Loading branch information
SimonDMurray authored Feb 19, 2024
1 parent e9342d4 commit 4d38bd3
Showing 1 changed file with 10 additions and 11 deletions.
21 changes: 10 additions & 11 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -10,22 +10,21 @@ The main pipeline logic is as follows:

![image info](./docs/img/pipeline.png)

In full:
* Downloads the genome and gene annotation files `[DOWNLOAD]`.
* Extract gene fasta sequences `[GFFREAD]`.
* Finds orthologous genes using last `[JCVI]`.
* Find syntenic block using MCScanX `[SYNTENY]`.
* Plot figures and create summary output tables `[PLOT_SCORE]`, `[PLOT_TREE]` and `[SUMMARISE_PLOTS]`.

The pipeline downloads the genome and gene annotation [DOWNLOAD], extracts gene fasta sequences [GFFREAD] -> Finds orthologous genes (Last, within JCVI program) [JCVI]-> Find syntenic block (MSCanX, within JCVI program) [SYNTENY]-> Plot figures and create summary output tables [SCORE] [SCORE_TREE] [PLOT_SCORE].
## Tutorial

## nf-synteny tutorials

We have a few short tutorials to help you test and explore the pipeline:

1. Running a basic pipeline run ([click here](https://github.com/Eco-Flow/synteny/blob/nf-test-dev/docs/Gitpod_tutorial.md))
2. Running with gene ontology (In development)
We have a short [tutorial](https://github.com/Eco-Flow/synteny/blob/nf-test-dev/docs/Gitpod_tutorial.md) to help you test and explore the pipeline.

## Installation

Nextflow pipelines require a few prerequisites. There is further documentation on the nf-core webpage [here](https://nf-co.re/docs/usage/installation), about how to install Nextflow.

### Prerequistites
### Prerequisites

- [Docker](https://docs.docker.com/engine/install/) or [Singularity](https://docs.sylabs.io/guides/3.11/admin-guide/installation.html).
- [Java](https://www.java.com/en/download/help/download_options.html) and [openJDK](https://openjdk.org/install/) >= 8 (**Please Note:** When installing Java versions are `1.VERSION` so `Java 8` is `Java 1.8`).
Expand Down Expand Up @@ -77,7 +76,7 @@ Drosophila_santomea,data/Drosophila_santomea/genome.fna.gz,data/Drosophila_santo
* `--tree /path/to/tree/file` - A path to a file containing a phylogenetic tree for all species in Newick format i.e. `data/score_tree_input/tree.txt`.
* `--clean` - A true or false value assigned to this parameter will determine whether the work directory is automatically deleted or not if the pipeline is successful. Deleting the work directory saves space however it will not be possible to use this work directory in future for caching (**Default:** `true`).
* `--architecture` - An `amd` or `arm` value assigned to this parameter determines whether containers built for the amd or arm CPU architecture are used (**Default:** `amd`).
* `--help` - A true value assinged to this parameter will cause the help message to be displayed instead of pipeline running (**Default:** `false`).
* `--help` - A true value assigned to this parameter will cause the help message to be displayed instead of pipeline running (**Default:** `false`).
* `--custom_config` - A path or URL to a custom configuration file.

## Profiles
Expand Down Expand Up @@ -141,7 +140,7 @@ Subdirectories:
3. `My_scores.tsv` - A table (pairwise) of number of syntenic gene pairs, as well as the max and average syntenic block length (in numbers of genes)
4. `Synteny_matrix.tsv` - A Matrix of syntenic gene pair totals (pairwise).
5. `Trans_location_version.out.txt` - A Table of scores (pairwise), documenting numbers of scaffolds, syntenic block, genes, as well as a variety of scores.
6. `Synt_gene_scores` - A folder with pairwise gene scores. Scores are based on the distance to nearest syntenic break. Where '1' means a gene in on the edge of a sytenic block.
6. `Synt_gene_scores` - A folder with pairwise gene scores. Scores are based on the distance to nearest syntenic break. Where '1' means a gene in on the edge of a syntenic block.
7. `My_sim_cores.tsv` - A Matrix containing nucleotide percentage similarities.
8. `My_comp_synteny_similarity.tsv` - A Matrix containing pairwise nucleotide percentages and total number of syntenic genes.

Expand Down

0 comments on commit 4d38bd3

Please sign in to comment.