Efficiency and Power as a Function of Sequence Coverage, SNP Array Density, and Imputation

被引:15
作者
Flannick, Jason [1 ,2 ,3 ]
Korn, Joshua M. [1 ,2 ,3 ,4 ,5 ]
Fontanillas, Pierre [1 ]
Grant, George B. [1 ]
Banks, Eric [1 ]
Depristo, Mark A. [1 ]
Altshuler, David [1 ,2 ,3 ,6 ]
机构
[1] Broad Inst Harvard & MIT, Cambridge, MA USA
[2] Massachusetts Gen Hosp, Dept Mol Biol, Boston, MA 02114 USA
[3] Massachusetts Gen Hosp, Diabet Unit, Boston, MA 02114 USA
[4] MIT, Harvard Mit Div Hlth Sci & Technol, Cambridge, MA 02139 USA
[5] Harvard Univ, Grad Program Biophys, Cambridge, MA 02138 USA
[6] Harvard Univ, Sch Med, Dept Genet & Med, Boston, MA USA
关键词
WHOLE-GENOME ASSOCIATION; WIDE ASSOCIATION; RARE VARIANTS; GENOTYPE; DISCOVERY; COMMON; LOCI; SUSCEPTIBILITY; INFERENCE; ACCURACY;
D O I
10.1371/journal.pcbi.1002604
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
High coverage whole genome sequencing provides near complete information about genetic variation. However, other technologies can be more efficient in some settings by (a) reducing redundant coverage within samples and (b) exploiting patterns of genetic variation across samples. To characterize as many samples as possible, many genetic studies therefore employ lower coverage sequencing or SNP array genotyping coupled to statistical imputation. To compare these approaches individually and in conjunction, we developed a statistical framework to estimate genotypes jointly from sequence reads, array intensities, and imputation. In European samples, we find similar sensitivity (89%) and specificity (99.6%) from imputation with either 1 x sequencing or 1 M SNP arrays. Sensitivity is increased, particularly for low-frequency polymorphisms (MAF <5%), when low coverage sequence reads are added to dense genome-wide SNP arrays - the converse, however, is not true. At sites where sequence reads and array intensities produce different sample genotypes, joint analysis reduces genotype errors and identifies novel error modes. Our joint framework informs the use of next-generation sequencing in genome wide association studies and supports development of improved methods for genotype calling.
引用
收藏
页数:13
相关论文
共 54 条
[31]   Low-coverage sequencing: Implications for design of complex trait association studies [J].
Li, Yun ;
Sidore, Carlo ;
Kang, Hyun Min ;
Boehnke, Michael ;
Abecasis, Goncalo R. .
GENOME RESEARCH, 2011, 21 (06) :940-951
[32]   Finding the missing heritability of complex diseases [J].
Manolio, Teri A. ;
Collins, Francis S. ;
Cox, Nancy J. ;
Goldstein, David B. ;
Hindorff, Lucia A. ;
Hunter, David J. ;
McCarthy, Mark I. ;
Ramos, Erin M. ;
Cardon, Lon R. ;
Chakravarti, Aravinda ;
Cho, Judy H. ;
Guttmacher, Alan E. ;
Kong, Augustine ;
Kruglyak, Leonid ;
Mardis, Elaine ;
Rotimi, Charles N. ;
Slatkin, Montgomery ;
Valle, David ;
Whittemore, Alice S. ;
Boehnke, Michael ;
Clark, Andrew G. ;
Eichler, Evan E. ;
Gibson, Greg ;
Haines, Jonathan L. ;
Mackay, Trudy F. C. ;
McCarroll, Steven A. ;
Visscher, Peter M. .
NATURE, 2009, 461 (7265) :747-753
[33]   Genotype imputation for genome-wide association studies [J].
Marchini, Jonathan ;
Howie, Bryan .
NATURE REVIEWS GENETICS, 2010, 11 (07) :499-511
[34]   Genome-wide association studies for complex traits: consensus, uncertainty and challenges [J].
McCarthy, Mark I. ;
Abecasis, Goncalo R. ;
Cardon, Lon R. ;
Goldstein, David B. ;
Little, Julian ;
Ioannidis, John P. A. ;
Hirschhorn, Joel N. .
NATURE REVIEWS GENETICS, 2008, 9 (05) :356-369
[35]   Genetic Heterogeneity in Human Disease [J].
McClellan, Jon ;
King, Mary-Claire .
CELL, 2010, 141 (02) :210-217
[36]   APPLICATIONS OF NEXT-GENERATION SEQUENCING Sequencing technologies - the next generation [J].
Metzker, Michael L. .
NATURE REVIEWS GENETICS, 2010, 11 (01) :31-46
[37]   Mapping copy number variation by population-scale genome sequencing [J].
Mills, Ryan E. ;
Walter, Klaudia ;
Stewart, Chip ;
Handsaker, Robert E. ;
Chen, Ken ;
Alkan, Can ;
Abyzov, Alexej ;
Yoon, Seungtai Chris ;
Ye, Kai ;
Cheetham, R. Keira ;
Chinwalla, Asif ;
Conrad, Donald F. ;
Fu, Yutao ;
Grubert, Fabian ;
Hajirasouliha, Iman ;
Hormozdiari, Fereydoun ;
Iakoucheva, Lilia M. ;
Iqbal, Zamin ;
Kang, Shuli ;
Kidd, Jeffrey M. ;
Konkel, Miriam K. ;
Korn, Joshua ;
Khurana, Ekta ;
Kural, Deniz ;
Lam, Hugo Y. K. ;
Leng, Jing ;
Li, Ruiqiang ;
Li, Yingrui ;
Lin, Chang-Yun ;
Luo, Ruibang ;
Mu, Xinmeng Jasmine ;
Nemesh, James ;
Peckham, Heather E. ;
Rausch, Tobias ;
Scally, Aylwyn ;
Shi, Xinghua ;
Stromberg, Michael P. ;
Stuetz, Adrian M. ;
Urban, Alexander Eckehart ;
Walker, Jerilyn A. ;
Wu, Jiantao ;
Zhang, Yujun ;
Zhang, Zhengdong D. ;
Batzer, Mark A. ;
Ding, Li ;
Marth, Gabor T. ;
McVean, Gil ;
Sebat, Jonathan ;
Snyder, Michael ;
Wang, Jun .
NATURE, 2011, 470 (7332) :59-65
[38]   Deep sequencing reveals 50 novel genes for recessive cognitive disorders [J].
Najmabadi, Hossein ;
Hu, Hao ;
Garshasbi, Masoud ;
Zemojtel, Tomasz ;
Abedini, Seyedeh Sedigheh ;
Chen, Wei ;
Hosseini, Masoumeh ;
Behjati, Farkhondeh ;
Haas, Stefan ;
Jamali, Payman ;
Zecha, Agnes ;
Mohseni, Marzieh ;
Puettmann, Lucia ;
Vahid, Leyla Nouri ;
Jensen, Corinna ;
Moheb, Lia Abbasi ;
Bienek, Melanie ;
Larti, Farzaneh ;
Mueller, Ines ;
Weissmann, Robert ;
Darvish, Hossein ;
Wrogemann, Klaus ;
Hadavi, Valeh ;
Lipkowitz, Bettina ;
Esmaeeli-Nieh, Sahar ;
Wieczorek, Dagmar ;
Kariminejad, Roxana ;
Firouzabadi, Saghar Ghasemi ;
Cohen, Monika ;
Fattahi, Zohreh ;
Rost, Imma ;
Mojahedi, Faezeh ;
Hertzberg, Christoph ;
Dehghan, Atefeh ;
Rajab, Anna ;
Banavandi, Mohammad Javad Soltani ;
Hoffer, Julia ;
Falah, Masoumeh ;
Musante, Luciana ;
Kalscheuer, Vera ;
Ullmann, Reinhard ;
Kuss, Andreas Walter ;
Tzschach, Andreas ;
Kahrizi, Kimia ;
Ropers, H. Hilger .
NATURE, 2011, 478 (7367) :57-63
[39]   Targeted capture and massively parallel sequencing of 12 human exomes [J].
Ng, Sarah B. ;
Turner, Emily H. ;
Robertson, Peggy D. ;
Flygare, Steven D. ;
Bigham, Abigail W. ;
Lee, Choli ;
Shaffer, Tristan ;
Wong, Michelle ;
Bhattacharjee, Arindam ;
Eichler, Evan E. ;
Bamshad, Michael ;
Nickerson, Deborah A. ;
Shendure, Jay .
NATURE, 2009, 461 (7261) :272-U153
[40]   Genotype and SNP calling from next-generation sequencing data [J].
Nielsen, Rasmus ;
Paul, Joshua S. ;
Albrechtsen, Anders ;
Song, Yun S. .
NATURE REVIEWS GENETICS, 2011, 12 (06) :443-451