Pacific biosciences sequencing technology for genotyping and variation discovery in human data

被引:162
作者
Carneiro, Mauricio O. [1 ]
Russ, Carsten [2 ]
Ross, Michael G. [2 ]
Gabriel, Stacey B. [1 ]
Nusbaum, Chad [2 ]
DePristo, Mark A. [1 ]
机构
[1] Broad Inst MIT & Harvard, Med & Populat Genet Program, Cambridge, MA 02141 USA
[2] Broad Inst MIT & Harvard, Genome Sequencing & Anal Program, Cambridge, MA 02141 USA
来源
BMC GENOMICS | 2012年 / 13卷
关键词
CANCER GENOMES; FRAMEWORK;
D O I
10.1186/1471-2164-13-375
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Pacific Biosciences technology provides a fundamentally new data type that provides the potential to overcome some limitations of current next generation sequencing platforms by providing significantly longer reads, single molecule sequencing, low composition bias and an error profile that is orthogonal to other platforms. With these potential advantages in mind, we here evaluate the utility of the Pacific Biosciences RS platform for human medical amplicon resequencing projects. Results: We evaluated the Pacific Biosciences technology for SNP discovery in medical resequencing projects using the Genome Analysis Toolkit, observing high sensitivity and specificity for calling differences in amplicons containing known true or false SNPs. We assessed data quality: most errors were indels (similar to 14%) with few apparent miscalls (similar to 1%). In this work, we define a custom data processing pipeline for Pacific Biosciences data for human data analysis. Conclusion: Critically, the error properties were largely free of the context-specific effects that affect other sequencing technologies. These data show excellent utility for follow-up validation and extension studies in human data and medical genetics projects, but can be extended to other organisms with a reference genome.
引用
收藏
页数:7
相关论文
共 19 条
[1]   A map of human genome variation from population-scale sequencing [J].
Altshuler, David ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Collins, Francis S. ;
De la Vega, Francisco M. ;
Donnelly, Peter ;
Egholm, Michael ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Knoppers, Bartha M. ;
Lander, Eric S. ;
Lehrach, Hans ;
Mardis, Elaine R. ;
McVean, Gil A. ;
Nickerson, DebbieA. ;
Peltonen, Leena ;
Schafer, Alan J. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Deiros, David ;
Metzker, Mike ;
Muzny, Donna ;
Reid, Jeff ;
Wheeler, David ;
Wang, Jun ;
Li, Jingxiang ;
Jian, Min ;
Li, Guoqing ;
Li, Ruiqiang ;
Liang, Huiqing ;
Tian, Geng ;
Wang, Bo ;
Wang, Jian ;
Wang, Wei ;
Yang, Huanming ;
Zhang, Xiuqing ;
Zheng, Huisong ;
Lander, Eric S. ;
Altshuler, David L. ;
Ambrogio, Lauren ;
Bloom, Toby ;
Cibulskis, Kristian ;
Fennell, Tim J. ;
Gabriel, Stacey B. .
NATURE, 2010, 467 (7319) :1061-1073
[2]  
[Anonymous], BRIEF BIOINFORM
[3]   Towards systematic functional characterization of cancer genomes [J].
Boehm, Jesse S. ;
Hahn, William C. .
NATURE REVIEWS GENETICS, 2011, 12 (07) :487-498
[4]   Mutation spectrum revealed by breakpoint sequencing of human germline CNVs [J].
Conrad, Donald F. ;
Bird, Christine ;
Blackburne, Ben ;
Lindsay, Sarah ;
Mamanova, Lira ;
Lee, Charles ;
Turner, Daniel J. ;
Hurles, Matthew E. .
NATURE GENETICS, 2010, 42 (05) :385-U43
[5]   A framework for variation discovery and genotyping using next-generation DNA sequencing data [J].
DePristo, Mark A. ;
Banks, Eric ;
Poplin, Ryan ;
Garimella, Kiran V. ;
Maguire, Jared R. ;
Hartl, Christopher ;
Philippakis, Anthony A. ;
del Angel, Guillermo ;
Rivas, Manuel A. ;
Hanna, Matt ;
McKenna, Aaron ;
Fennell, Tim J. ;
Kernytsky, Andrew M. ;
Sivachenko, Andrey Y. ;
Cibulskis, Kristian ;
Gabriel, Stacey B. ;
Altshuler, David ;
Daly, Mark J. .
NATURE GENETICS, 2011, 43 (05) :491-+
[6]   Analysis of next-generation genomic data in cancer: accomplishments and challenges [J].
Ding, Li ;
Wendl, Michael C. ;
Koboldt, Daniel C. ;
Mardis, Elaine R. .
HUMAN MOLECULAR GENETICS, 2010, 19 :R188-R196
[7]   High-quality draft assemblies of mammalian genomes from massively parallel sequence data [J].
Gnerre, Sante ;
MacCallum, Iain ;
Przybylski, Dariusz ;
Ribeiro, Filipe J. ;
Burton, Joshua N. ;
Walker, Bruce J. ;
Sharpe, Ted ;
Hall, Giles ;
Shea, Terrance P. ;
Sykes, Sean ;
Berlin, Aaron M. ;
Aird, Daniel ;
Costello, Maura ;
Daza, Riza ;
Williams, Louise ;
Nicol, Robert ;
Gnirke, Andreas ;
Nusbaum, Chad ;
Lander, Eric S. ;
Jaffe, David B. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2011, 108 (04) :1513-1518
[8]   Solution hybrid selection with ultra-long oligonucleotides for massively parallel targeted sequencing [J].
Gnirke, Andreas ;
Melnikov, Alexandre ;
Maguire, Jared ;
Rogov, Peter ;
LeProust, Emily M. ;
Brockman, William ;
Fennell, Timothy ;
Giannoukos, Georgia ;
Fisher, Sheila ;
Russ, Carsten ;
Gabriel, Stacey ;
Jaffe, David B. ;
Lander, Eric S. ;
Nusbaum, Chad .
NATURE BIOTECHNOLOGY, 2009, 27 (02) :182-189
[9]   Genomic epidemiology of the Escherichia coli O104:H4 outbreaks in Europe, 2011 [J].
Grad, Yonatan H. ;
Lipsitch, Marc ;
Feldgarden, Michael ;
Arachchi, Harindra M. ;
Cerqueira, Gustavo C. ;
FitzGerald, Michael ;
Godfrey, Paul ;
Haas, Brian J. ;
Murphy, Cheryl I. ;
Russ, Carsten ;
Sykes, Sean ;
Walker, Bruce J. ;
Wortman, Jennifer R. ;
Young, Sarah ;
Zeng, Qiandong ;
Abouelleil, Amr ;
Bochicchio, James ;
Chauvin, Sara ;
DeSmet, Timothy ;
Gujja, Sharvari ;
McCowan, Caryn ;
Montmayeur, Anna ;
Steelman, Scott ;
Frimodt-Moller, Jakob ;
Petersen, Andreas M. ;
Struve, Carsten ;
Krogfelt, Karen A. ;
Bingen, Edouard ;
Weill, Francois-Xavier ;
Lander, Eric S. ;
Nusbaum, Chad ;
Birren, Bruce W. ;
Hung, Deborah T. ;
Hanage, William P. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2012, 109 (08) :3065-3070
[10]  
Li H, 2011, BIOINFOMATICS