Skip to content

Latest commit

 

History

History
50 lines (33 loc) · 1.89 KB

README.md

File metadata and controls

50 lines (33 loc) · 1.89 KB

DecoyPYrat

DecoyPYrat - Fast Hybrid Decoy Sequence Database Creation for Proteomic Mass Spectrometery Analyses

Accurate statistical evaluation of sequence database peptide identifications from tandem mass spectra is essential in mass spectrometry based proteomics experiments. These statistics are dependent on accurately modelling random identifications.

The target-decoy approach has risen to become the de-facto approach to calculating false discovery rates (FDR) in proteomic datasets. The main principle of this approach is to search a set of decoy protein sequences that emulate the size and composition of the target protein sequences searched whilst not matching real proteins in the sample.

DecoyPYrat creates decoy protein sequences by following these steps: each protein is reversed and the cleavage sites switched with preceding amino acid. Peptides are checked for existence in target sequences and if found the tool will attempt to shuffle them iterativly until they are unique.

Download and installation

Bioconda

DecoyPYrat is available in the bioconda bioinformatics software repository. To access it, first install Miniconda, and then run the command:

conda install -c bioconda decoypyrat

After this, you can invoke the software like this:

decoypyrat

Direct script usage

You can clone this repository and invoke the software like this:

python decoypyrat/decoyPYrat.py

Getting help

You can see the full usage instructions by specifying the "-h" argument:

If installing with Bioconda:

decoypyrat -h

If using the script directly:

python decoypyrat/decoyPYrat.py -h

Citation:

DecoyPyrat: Fast Non-redundant Hybrid Decoy Sequence Generation for Large Scale Proteomics. J Proteomics Bioinform. 2016 Jun 27;9(6):176-180.