ProbeMatch: rapid alignment of oligonucleotides to genome allowing both gaps and mismatches

被引:17
作者
Kim, You Jung [2 ]
Teletia, Nikhil [1 ]
Ruotti, Victor [5 ]
Maher, Christopher A. [3 ]
Chinnaiyan, Arul M. [3 ]
Stewart, Ron [5 ]
Thomson, James A. [4 ,5 ]
Patel, Jignesh M. [1 ]
机构
[1] Univ Wisconsin, Dept Comp Sci, Madison, WI 53706 USA
[2] Univ Michigan, Dept EECS, Ann Arbor, MI 48109 USA
[3] Univ Michigan, Dept Pathol, Ann Arbor, MI 48109 USA
[4] Univ Wisconsin, Sch Med & Publ Hlth, Madison, WI USA
[5] Morgridge Inst Res, Madison, WI USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
D O I
10.1093/bioinformatics/btp178
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
We have developed a tool, called ProbeMatch, for matching a large set of oligonucleotide sequences against a genome database using gapped alignments. Unlike most of the existing tools such as ELAND which only perform ungapped alignments allowing at most two mismatches, ProbeMatch generates both ungapped and gapped alignments allowing up to three errors including insertion, deletion and mismatch. To speedup sequence alignment, ProbeMatch uses gapped q-grams and q-grams of various patterns to identify target hits to a query sequence. This approach results in fewer initial sequences to examine with no loss in sensitivity. ProbeMatch has been used to align 169 095 Illumina GAII reads against the human genome, which could not be mapped by ELAND, and found alignments for 28 625 reads of the 169 095 reads in less than 3 h.
引用
收藏
页码:1424 / 1425
页数:2
相关论文
共 6 条
[1]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[2]  
Amdahl, 1967, AFIPS C P, P483, DOI DOI 10.1145/1465482.1465560
[3]  
Burkhardt S, 2003, FUND INFORM, V56, P51
[4]   Whole-genome sequencing and variant discovery in C-elegans [J].
Hillier, LaDeana W. ;
Marth, Gabor T. ;
Quinlan, Aaron R. ;
Dooling, David ;
Fewell, Ginger ;
Barnett, Derek ;
Fox, Paul ;
Glasscock, Jarret I. ;
Hickenbotham, Matthew ;
Huang, Weichun ;
Magrini, Vincent J. ;
Richt, Ryan J. ;
Sander, Sacha N. ;
Stewart, Donald A. ;
Stromberg, Michael ;
Tsung, Eric F. ;
Wylie, Todd ;
Schedl, Tim ;
Wilson, Richard K. ;
Mardis, Elaine R. .
NATURE METHODS, 2008, 5 (02) :183-188
[5]   SeqMap: mapping massive amount of oligonucleotides to the genome [J].
Jiang, Hui ;
Wong, Wing Hung .
BIOINFORMATICS, 2008, 24 (20) :2395-2396
[6]   SOAP: short oligonucleotide alignment program [J].
Li, Ruiqiang ;
Li, Yingrui ;
Kristiansen, Karsten ;
Wang, Jun .
BIOINFORMATICS, 2008, 24 (05) :713-714