-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathREADME.txt
55 lines (39 loc) · 1.98 KB
/
README.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
Programming project
From ADN to formation of proteins : how to align sequences ?
========
The programming project aims to fins optimal alignments between DNA and protein sequences by using dynamic programming.
Algorithms are stored in /src. One can find some our results in /results and the report in /report.
Launch tool
--------
One can find the main script in /bin/main.py. One can change default parameters at the beginig of the script.
The input files are in /bin/example. The script will run over these files. If you don't want to spend a lot of time,
please, delete SubSeq_Res(>320).txt from this folder.
To run script with default parameters please run (your working directory should be the main project directory)
python -m bin.main
Requirements
--------
>300 Mb GPU, Python >=3.6, used packages: numpy, matplolib, os, time
Features
--------
- LongestSubSeq.py - computes the length of one longest commun subsequence between two sequences
given in an input file
- OptimalAlignment.py - Computes and displays (nicely) one optimal alignment between two sequences
given in an input file
- OptimalAlignmentProtein.py - Computes one optimal alignment between two sequences of amino acids
obtained from two sequences of nucleotides given in an input file using Blosum50
- BasicLocalAlignmentSearchTool.py - Tool for searching local alignments between two sequences
Parameters
--------
input_folder - folder with input .txt files
result_folder - folder where results will be stored
save_images = False - if True save images for Tasks 4-8 (memory consuming).
###Tasks 5-8###
open_gap = 10 - penalty for opening gap
increase_gap = 1 #penalty for increasing gap
###Tasks 7-8##
g = 'ADCRGHC' - first string of toy example
t = 'EDADCRGNRADACRGHC' - second string of toy example
k = 4 - number of characters for perfect matches search
th = 0.9 - upper border to accept a perfect match
th_l = 0.1 - lower border to accept alignment
th_protein = 0.6 - upper border to accept a perfect match in test examples