Department of Computer Science
 Rutgers University

Home page

Home page  Contact us  Site map 

 

 

 

 

 

TreQ: Indel-tolerant read mapping

TreQ is a read mapper for high-throughput DNA sequencing reads, in particular reads from 100 nt to hundreds of nucleotides, and for large edit distance between sequencing read and match in the reference genome. In contrast to existing read mappers, TreQ can cope particularly well with indels, either one long indel; see the figure giving the percentage of accurate matches as a function of indel length for 200 nt reads. TreQ performs best at a time comparable to BWA at large edit distance settings, SSAHA2 is the second best but is five times slower than tree. This makes TreQ an excellent choice for analyzing genetic variants in low-coverage situations and without the need for paired-end sequencing. TreQ will be released under the GPL upon publication.

Publications

Mahmud, Md and Wiedenhoeft, John and Schliep, Alexander . Indel-tolerant Read Mapping with Trinucleotide Frequencies using Cache-Oblivious kd-Trees (2012) [details]

Mahmud, Md. Reduced representations for efficient analysis of genomic data; from microarray to high throughput sequencing (2014) [details]