Swift: primary data analysis for the Illumina Solexa sequencing platform

被引:61
作者
Whiteford, Nava [1 ]
Skelly, Tom [1 ]
Curtis, Christina [2 ]
Ritchie, Matt E. [3 ]
Loehr, Andrea [4 ]
Zaranek, Alexander Wait [5 ]
Abnizova, Irina [1 ]
Brown, Clive [1 ]
机构
[1] Wellcome Trust Sanger Inst, Cambridge CB10 1SA, England
[2] Univ Cambridge, CRUK Cambridge Res Inst, Li Ka Shing Ctr, Dept Oncol, Cambridge CB2 0RE, England
[3] Walter & Eliza Hall Inst Med Res, Bioinformat Div, Parkville, Vic 3052, Australia
[4] Harvard Smithsonian Ctr Astrophys, Cambridge, MA 02138 USA
[5] Harvard Univ, Sch Med, Boston, MA 02115 USA
基金
英国惠康基金;
关键词
D O I
10.1093/bioinformatics/btp383
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Primary data analysis methods are of critical importance in second generation DNA sequencing. Improved methods have the potential to increase yield and reduce the error rates. Openly documented analysis tools enable the user to understand the primary data, this is important for the optimization and validity of their scientific work. Results: In this article, we describe Swift, a new tool for performing primary data analysis on the Illumina Solexa Sequencing Platform. Swift is the first tool, outside of the vendors own software, which completes the full analysis process, from raw images through to base calls. As such it provides an alternative to, and independent validation of, the vendor supplied tool. Our results show that Swift is able to increase yield by 13.8%, at comparable error rate.
引用
收藏
页码:2194 / 2199
页数:6
相关论文
共 16 条
[1]  
[Anonymous], 1982, IMAGE ANAL MATH MORP
[2]   Accurate whole human genome sequencing using reversible terminator chemistry [J].
Bentley, David R. ;
Balasubramanian, Shankar ;
Swerdlow, Harold P. ;
Smith, Geoffrey P. ;
Milton, John ;
Brown, Clive G. ;
Hall, Kevin P. ;
Evers, Dirk J. ;
Barnes, Colin L. ;
Bignell, Helen R. ;
Boutell, Jonathan M. ;
Bryant, Jason ;
Carter, Richard J. ;
Cheetham, R. Keira ;
Cox, Anthony J. ;
Ellis, Darren J. ;
Flatbush, Michael R. ;
Gormley, Niall A. ;
Humphray, Sean J. ;
Irving, Leslie J. ;
Karbelashvili, Mirian S. ;
Kirk, Scott M. ;
Li, Heng ;
Liu, Xiaohai ;
Maisinger, Klaus S. ;
Murray, Lisa J. ;
Obradovic, Bojan ;
Ost, Tobias ;
Parkinson, Michael L. ;
Pratt, Mark R. ;
Rasolonjatovo, Isabelle M. J. ;
Reed, Mark T. ;
Rigatti, Roberto ;
Rodighiero, Chiara ;
Ross, Mark T. ;
Sabot, Andrea ;
Sankar, Subramanian V. ;
Scally, Aylwyn ;
Schroth, Gary P. ;
Smith, Mark E. ;
Smith, Vincent P. ;
Spiridou, Anastassia ;
Torrance, Peta E. ;
Tzonev, Svilen S. ;
Vermaas, Eric H. ;
Walter, Klaudia ;
Wu, Xiaolin ;
Zhang, Lu ;
Alam, Mohammed D. ;
Anastasi, Carole .
NATURE, 2008, 456 (7218) :53-59
[3]  
BROWN CG, 2006, SOLEXA ILLUMINA GAPI
[4]  
CASTRO ED, 1987, IEEE T PATTERN ANAL, V9, P700, DOI DOI 10.1109/TPAMI.1987.4767966
[5]   A benchmark for affymetrix GeneChip expression measures [J].
Cope, LM ;
Irizarry, RA ;
Jaffee, HA ;
Wu, ZJ ;
Speed, TP .
BIOINFORMATICS, 2004, 20 (03) :323-331
[6]   Alta-Cyclic: a selfoptimizing base caller for next-generation sequencing [J].
Erlich, Yaniv ;
Mitra, Partha P. ;
delaBastide, Melissa ;
McCombie, W. Richard ;
Hannon, Gregory J. .
NATURE METHODS, 2008, 5 (08) :679-682
[7]   Base-calling of automated sequencer traces using phred.: II.: Error probabilities [J].
Ewing, B ;
Green, P .
GENOME RESEARCH, 1998, 8 (03) :186-194
[8]  
Frigo M, 1998, INT CONF ACOUST SPEE, P1381, DOI 10.1109/ICASSP.1998.681704
[9]   Statistical analysis of an RNA titration series evaluates microarray precision and sensitivity on a whole-array basis [J].
Holloway, Andrew J. ;
Oshlack, Alicia ;
Diyagama, Dileepa S. ;
Bowtell, David D. L. ;
Smyth, Gordon K. .
BMC BIOINFORMATICS, 2006, 7 (1)
[10]  
Li L, 1999, ELECTROPHORESIS, V20, P1433, DOI 10.1002/(SICI)1522-2683(19990601)20:7<1433::AID-ELPS1433>3.3.CO