Improved base calling for the Illumina Genome Analyzer using machine learning strategies

被引:169
作者
Kircher, Martin [1 ]
Stenzel, Udo [1 ]
Kelso, Janet [1 ]
机构
[1] Max Planck Inst Evolutionary Anthropol, Dept Evolutionary Genet, D-04103 Leipzig, Germany
来源
GENOME BIOLOGY | 2009年 / 10卷 / 08期
关键词
Quality Score; Additional Data File; Base Calling; Base Caller; Sequencing Chemistry;
D O I
10.1186/gb-2009-10-8-r83
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
The Illumina Genome Analyzer generates millions of short sequencing reads. We present Ibis (Improved base identification system), an accurate, fast and easy-to-use base caller that significantly reduces the error rate and increases the output of usable reads. Ibis is faster and more robust with respect to chemistry and technology than other publicly available packages. Ibis is freely available under the GPL from http://bioinf.eva.mpg.de/Ibis/.
引用
收藏
页数:9
相关论文
共 10 条
[1]   Accurate whole human genome sequencing using reversible terminator chemistry [J].
Bentley, David R. ;
Balasubramanian, Shankar ;
Swerdlow, Harold P. ;
Smith, Geoffrey P. ;
Milton, John ;
Brown, Clive G. ;
Hall, Kevin P. ;
Evers, Dirk J. ;
Barnes, Colin L. ;
Bignell, Helen R. ;
Boutell, Jonathan M. ;
Bryant, Jason ;
Carter, Richard J. ;
Cheetham, R. Keira ;
Cox, Anthony J. ;
Ellis, Darren J. ;
Flatbush, Michael R. ;
Gormley, Niall A. ;
Humphray, Sean J. ;
Irving, Leslie J. ;
Karbelashvili, Mirian S. ;
Kirk, Scott M. ;
Li, Heng ;
Liu, Xiaohai ;
Maisinger, Klaus S. ;
Murray, Lisa J. ;
Obradovic, Bojan ;
Ost, Tobias ;
Parkinson, Michael L. ;
Pratt, Mark R. ;
Rasolonjatovo, Isabelle M. J. ;
Reed, Mark T. ;
Rigatti, Roberto ;
Rodighiero, Chiara ;
Ross, Mark T. ;
Sabot, Andrea ;
Sankar, Subramanian V. ;
Scally, Aylwyn ;
Schroth, Gary P. ;
Smith, Mark E. ;
Smith, Vincent P. ;
Spiridou, Anastassia ;
Torrance, Peta E. ;
Tzonev, Svilen S. ;
Vermaas, Eric H. ;
Walter, Klaudia ;
Wu, Xiaolin ;
Zhang, Lu ;
Alam, Mohammed D. ;
Anastasi, Carole .
NATURE, 2008, 456 (7218) :53-59
[2]   Shotgun bisulphite sequencing of the Arabidopsis genome reveals DNA methylation patterning [J].
Cokus, Shawn J. ;
Feng, Suhua ;
Zhang, Xiaoyu ;
Chen, Zugen ;
Merriman, Barry ;
Haudenschild, Christian D. ;
Pradhan, Sriharsa ;
Nelson, Stanley F. ;
Pellegrini, Matteo ;
Jacobsen, Steven E. .
NATURE, 2008, 452 (7184) :215-219
[3]   On the algorithmic implementation of multiclass kernel-based vector machines [J].
Crammer, K ;
Singer, Y .
JOURNAL OF MACHINE LEARNING RESEARCH, 2002, 2 (02) :265-292
[4]   Alta-Cyclic: a selfoptimizing base caller for next-generation sequencing [J].
Erlich, Yaniv ;
Mitra, Partha P. ;
delaBastide, Melissa ;
McCombie, W. Richard ;
Hannon, Gregory J. .
NATURE METHODS, 2008, 5 (08) :679-682
[5]   Base-calling of automated sequencer traces using phred.: II.: Error probabilities [J].
Ewing, B ;
Green, P .
GENOME RESEARCH, 1998, 8 (03) :186-194
[6]   SOAP: short oligonucleotide alignment program [J].
Li, Ruiqiang ;
Li, Yingrui ;
Kristiansen, Karsten ;
Wang, Jun .
BIOINFORMATICS, 2008, 24 (05) :713-714
[7]  
Pournelle G. H., 1953, Journal of Mammalogy, V34, P133, DOI 10.1890/0012-9658(2002)083[1421:SDEOLC]2.0.CO
[8]  
2
[9]   Probabilistic base calling of Solexa sequencing data [J].
Rougemont, Jacques ;
Amzallag, Arnaud ;
Iseli, Christian ;
Farinelli, Laurent ;
Xenarios, Ioannis ;
Naef, Felix .
BMC BIOINFORMATICS, 2008, 9 (1)
[10]  
Tsochantaridis I, 2005, J MACH LEARN RES, V6, P1453