Efficient and Cost Effective Population Resequencing by Pooling and In-Solution Hybridization

被引:42
作者
Bansal, Vikas [1 ]
Tewhey, Ryan [1 ,2 ]
LeProust, Emily M. [3 ]
Schork, Nicholas J. [1 ,4 ]
机构
[1] Scripps Translat Sci Inst, La Jolla, CA 92037 USA
[2] Univ Calif San Diego, Div Biol Sci, La Jolla, CA 92093 USA
[3] Agilent Technol LLSU, Genom, Santa Clara, CA USA
[4] Scripps Res Inst, Dept Mol & Expt Med, La Jolla, CA 92037 USA
关键词
RARE VARIANTS; GENETIC-VARIATION; COMPLEX DISEASES; DNA; ASSOCIATION;
D O I
10.1371/journal.pone.0018353
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
070301 [无机化学]; 070403 [天体物理学]; 070507 [自然资源与国土空间规划学]; 090105 [作物生产系统与生态工程];
摘要
High-throughput sequencing of targeted genomic loci in large populations is an effective approach for evaluating the contribution of rare variants to disease risk. We evaluated the feasibility of using in-solution hybridization-based target capture on pooled DNA samples to enable cost-efficient population sequencing studies. For this, we performed pooled ;sequencing of 100 HapMap samples across similar to 600 kb of DNA sequence using the Illumina GAIIx. Using our accurate variant calling method for pooled sequence data, we were able to not only identify single nucleotide variants with a low false discovery rate (<1%) but also accurately detect short insertion/deletion variants. In addition, with sufficient coverage per individual in each pool (30-fold) we detected 97.2% of the total variants and 93.6% of variants below 5% in frequency. Finally, allele frequencies for single nucleotide variants (SNVs) estimated from the pooled data and the HapMap genotype data were tightly correlated (correlation coefficient >= 0.995).
引用
收藏
页数:6
相关论文
共 18 条
[1]
A map of human genome variation from population-scale sequencing [J].
Altshuler, David ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Collins, Francis S. ;
De la Vega, Francisco M. ;
Donnelly, Peter ;
Egholm, Michael ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Knoppers, Bartha M. ;
Lander, Eric S. ;
Lehrach, Hans ;
Mardis, Elaine R. ;
McVean, Gil A. ;
Nickerson, DebbieA. ;
Peltonen, Leena ;
Schafer, Alan J. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Deiros, David ;
Metzker, Mike ;
Muzny, Donna ;
Reid, Jeff ;
Wheeler, David ;
Wang, Jun ;
Li, Jingxiang ;
Jian, Min ;
Li, Guoqing ;
Li, Ruiqiang ;
Liang, Huiqing ;
Tian, Geng ;
Wang, Bo ;
Wang, Jian ;
Wang, Wei ;
Yang, Huanming ;
Zhang, Xiuqing ;
Zheng, Huisong ;
Lander, Eric S. ;
Altshuler, David L. ;
Ambrogio, Lauren ;
Bloom, Toby ;
Cibulskis, Kristian ;
Fennell, Tim J. ;
Gabriel, Stacey B. .
NATURE, 2010, 467 (7319) :1061-1073
[2]
Integrating common and rare genetic variation in diverse human populations [J].
Altshuler, David M. ;
Gibbs, Richard A. ;
Peltonen, Leena ;
Dermitzakis, Emmanouil ;
Schaffner, Stephen F. ;
Yu, Fuli ;
Bonnen, Penelope E. ;
de Bakker, Paul I. W. ;
Deloukas, Panos ;
Gabriel, Stacey B. ;
Gwilliam, Rhian ;
Hunt, Sarah ;
Inouye, Michael ;
Jia, Xiaoming ;
Palotie, Aarno ;
Parkin, Melissa ;
Whittaker, Pamela ;
Chang, Kyle ;
Hawes, Alicia ;
Lewis, Lora R. ;
Ren, Yanru ;
Wheeler, David ;
Muzny, Donna Marie ;
Barnes, Chris ;
Darvishi, Katayoon ;
Hurles, Matthew ;
Korn, Joshua M. ;
Kristiansson, Kati ;
Lee, Charles ;
McCarroll, Steven A. ;
Nemesh, James ;
Keinan, Alon ;
Montgomery, Stephen B. ;
Pollack, Samuela ;
Price, Alkes L. ;
Soranzo, Nicole ;
Gonzaga-Jauregui, Claudia ;
Anttila, Verneri ;
Brodeur, Wendy ;
Daly, Mark J. ;
Leslie, Stephen ;
McVean, Gil ;
Moutsianas, Loukas ;
Nguyen, Huy ;
Zhang, Qingrun ;
Ghori, Mohammed J. R. ;
McGinnis, Ralph ;
McLaren, William ;
Takeuchi, Fumihiko ;
Grossman, Sharon R. .
NATURE, 2010, 467 (7311) :52-58
[3]
A statistical method for the detection of variants from next-generation resequencing of DNA pools [J].
Bansal, Vikas .
BIOINFORMATICS, 2010, 26 (12) :i318-i324
[4]
High-throughput, pooled sequencing identifies mutations in NUBPL and FOXRED1 in human complex I deficiency [J].
Calvo, Sarah E. ;
Tucker, Elena J. ;
Compton, Alison G. ;
Kirby, Denise M. ;
Crawford, Gabriel ;
Burtt, Noel P. ;
Rivas, Manuel ;
Guiducci, Candace ;
Bruno, Damien L. ;
Goldberger, Olga A. ;
Redman, Michelle C. ;
Wiltshire, Esko ;
Wilson, Callum J. ;
Altshuler, David ;
Gabriel, Stacey B. ;
Daly, Mark J. ;
Thorburn, David R. ;
Mootha, Vamsi K. .
NATURE GENETICS, 2010, 42 (10) :851-+
[5]
Multiple rare Alleles contribute to low plasma levels of HDL cholesterol [J].
Cohen, JC ;
Kiss, RS ;
Pertsemlidis, A ;
Marcel, YL ;
McPherson, R ;
Hobbs, HH .
SCIENCE, 2004, 305 (5685) :869-872
[6]
Druley TE, 2009, NAT METHODS, V6, P263, DOI [10.1038/NMETH.1307, 10.1038/nmeth.1307]
[7]
Human genetic variation and its contribution to complex traits [J].
Frazer, Kelly A. ;
Murray, Sarah S. ;
Schork, Nicholas J. ;
Topol, Eric J. .
NATURE REVIEWS GENETICS, 2009, 10 (04) :241-251
[8]
Solution hybrid selection with ultra-long oligonucleotides for massively parallel targeted sequencing [J].
Gnirke, Andreas ;
Melnikov, Alexandre ;
Maguire, Jared ;
Rogov, Peter ;
LeProust, Emily M. ;
Brockman, William ;
Fennell, Timothy ;
Giannoukos, Georgia ;
Fisher, Sheila ;
Russ, Carsten ;
Gabriel, Stacey ;
Jaffe, David B. ;
Lander, Eric S. ;
Nusbaum, Chad .
NATURE BIOTECHNOLOGY, 2009, 27 (02) :182-189
[9]
Design of Association Studies with Pooled or Un-pooled Next-Generation Sequencing Data [J].
Kim, Su Yeon ;
Li, Yingrui ;
Guo, Yiran ;
Li, Ruiqiang ;
Holmkvist, Johan ;
Hansen, Torben ;
Pedersen, Oluf ;
Wang, Jun ;
Nielsen, Rasmus .
GENETIC EPIDEMIOLOGY, 2010, 34 (05) :479-491
[10]
Fast and accurate short read alignment with Burrows-Wheeler transform [J].
Li, Heng ;
Durbin, Richard .
BIOINFORMATICS, 2009, 25 (14) :1754-1760