Skip to content

jnktsj/trim-suppl_BIB100115

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Read quality trimming scripts

The trimming scripts in this repository are used for "Evaluation of pre-processing, mapping, and post-processing algorithms for analyzing whole genome bisulfite sequencing data".

Read quality triming methods can be grouped into two main classes, running-sum and window-based. To test the performance of read trimming methods, we implemented the following trimming algorithms:

Mott trimming (running-sum): mott-trim.py

The method starts from the 3´-end of each read, subtracts a preset cutoff quality score from the quality score at each position and adds the remainder to a cumulative score at the position. The 3´ portion of the read starting from the position with the minimum cumulative score is trimmed.

Dynamic trimming (window-based): dynamic-trim.py

The method searches for the longest stretch of positions (window) in each read such that the quality scores of each position in the window exceed a preset threshold.

Simple trimming: simple-trim.py

In addition to testing the two main trimming algorithm, we also teseted simple trimming algorithm. The method scans from the 5´ end of each read. As soon as it detects a position with quality scores below a preset threshold, it discards this position and the remaining positions at the 3´-end of the read.

About

Three read trimming scripts

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages