MOM: maximum oligonucleotide mapping

被引:27
作者
Eaves, Hugh L. [1 ]
Gao, Yuan [1 ,2 ]
机构
[1] Ctr Study Biol Complex, Richmond, VA USA
[2] Virginia Commonwealth Univ, Dept Comp Sci, Richmond, VA USA
关键词
D O I
10.1093/bioinformatics/btp092
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Current short read mapping programs are based on the reasonable premise that most sequencing errors occur near the 3 end of the read. These programs map reads with either a small number of mismatches in the entire read, or a small number of mismatches in the segment remaining after trimming bases from the 3 end or a single base from the 5 end. Though multiple sequencing errors most likely occur near the 3 end of the reads, they can still occur at the 5 end of the reads. Trimming from the 3 end will not be able to map these reads. We have developed a program, Maximum Oligonucleotide Mapping (MOM), based on the concept of query matching that is designed to capture a maximal length match within the short read satisfying the user defined error parameters. This query matching approach thus accommodates multiple sequencing errors at both ends. We demonstrate that this technique achieves greater sensitivity and a higher percentage of uniquely mapped reads when compared to existing programs such as SOAP, MAQ and SHRiMP.
引用
收藏
页码:969 / 970
页数:2
相关论文
共 4 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   Substantial biases in ultra-short read data sets from high-throughput DNA sequencing [J].
Dohm, Juliane C. ;
Lottaz, Claudio ;
Borodina, Tatiana ;
Himmelbauer, Heinz .
NUCLEIC ACIDS RESEARCH, 2008, 36 (16)
[3]  
Kent WJ, 2002, GENOME RES, V12, P656, DOI [10.1101/gr.229202. Article published online before March 2002, 10.1101/gr.229202]
[4]   SOAP: short oligonucleotide alignment program [J].
Li, Ruiqiang ;
Li, Yingrui ;
Kristiansen, Karsten ;
Wang, Jun .
BIOINFORMATICS, 2008, 24 (05) :713-714