Fast Principal-Component Analysis Reveals Convergent Evolution of ADH1B in Europe and East Asia

被引:270
作者
Galinsky, Kevin J. [1 ,2 ]
Bhatia, Gaurav [2 ,3 ]
Loh, Po-Ru [2 ,3 ]
Georgiev, Stoyan [4 ]
Mukherjee, Sayan [5 ,6 ]
Patterson, Nick J. [2 ]
Price, Alkes L. [1 ,2 ,3 ]
机构
[1] Harvard TH Chan Sch Publ Hlth, Dept Biostat, Boston, MA 02115 USA
[2] Broad Inst MIT & Harvard, Program Med & Populat Genet, Cambridge, MA 02142 USA
[3] Harvard TH Chan Sch Publ Hlth, Dept Epidemiol, Boston, MA 02115 USA
[4] Google, Palo Alto, CA 94043 USA
[5] Duke Univ, Dept Stat Sci, Dept Comp Sci, Durham, NC 27708 USA
[6] Duke Univ, Dept Math, Durham, NC 27708 USA
基金
美国国家科学基金会;
关键词
RECENT POSITIVE SELECTION; WHOLE-GENOME ASSOCIATION; NATURAL-SELECTION; POPULATION-STRUCTURE; ALCOHOL DEPENDENCE; GENETIC SIGNATURES; WIDE ASSOCIATION; LOCAL ADAPTATION; ANCESTRY; VARIANTS;
D O I
10.1016/j.ajhg.2015.12.022
中图分类号
Q3 [遗传学];
学科分类号
071007 [遗传学];
摘要
Searching for genetic variants with unusual differentiation between subpopulations is an established approach for identifying signals of natural selection. However, existing methods generally require discrete subpopulations. We introduce a method that infers selection using principal components (PCs) by identifying variants whose differentiation along top PCs is significantly greater than the null distribution of genetic drift. To enable the application of this method to large datasets, we developed the FastPCA software, which employs recent advances in random matrix theory to accurately approximate top PCs while reducing time and memory cost from quadratic to linear in the number of individuals, a computational improvement of many orders of magnitude. We apply FastPCA to a cohort of 54,734 European Americans, identifying 5 distinct subpopulations spanning the top 4 PCs. Using the PC-based test for natural selection, we replicate previously known selected loci and identify three new genome-wide significant signals of selection, including selection in Europeans at ADH1B. The coding variant rs1229984*T has previously been associated to a decreased risk of alcoholism and shown to be under selection in East Asians; we show that it is a rare example of independent evolution on two continents. We also detect selection signals at IGFBP3 and IGH, which have also previously been associated to human disease.
引用
收藏
页码:456 / 472
页数:17
相关论文
共 99 条
[1]
Fast Principal Component Analysis of Large-Scale Genome-Wide Data [J].
Abraham, Gad ;
Inouye, Michael .
PLOS ONE, 2014, 9 (04)
[2]
Interrogating a high-density SNP map for signatures of natural selection [J].
Akey, JM ;
Zhang, G ;
Zhang, K ;
Jin, L ;
Shriver, MD .
GENOME RESEARCH, 2002, 12 (12) :1805-1814
[3]
Fast model-based estimation of ancestry in unrelated individuals [J].
Alexander, David H. ;
Novembre, John ;
Lange, Kenneth .
GENOME RESEARCH, 2009, 19 (09) :1655-1664
[4]
A map of human genome variation from population-scale sequencing [J].
Altshuler, David ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Collins, Francis S. ;
De la Vega, Francisco M. ;
Donnelly, Peter ;
Egholm, Michael ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Knoppers, Bartha M. ;
Lander, Eric S. ;
Lehrach, Hans ;
Mardis, Elaine R. ;
McVean, Gil A. ;
Nickerson, DebbieA. ;
Peltonen, Leena ;
Schafer, Alan J. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Deiros, David ;
Metzker, Mike ;
Muzny, Donna ;
Reid, Jeff ;
Wheeler, David ;
Wang, Jun ;
Li, Jingxiang ;
Jian, Min ;
Li, Guoqing ;
Li, Ruiqiang ;
Liang, Huiqing ;
Tian, Geng ;
Wang, Bo ;
Wang, Jian ;
Wang, Wei ;
Yang, Huanming ;
Zhang, Xiuqing ;
Zheng, Huisong ;
Lander, Eric S. ;
Altshuler, David L. ;
Ambrogio, Lauren ;
Bloom, Toby ;
Cibulskis, Kristian ;
Fennell, Tim J. ;
Gabriel, Stacey B. .
NATURE, 2010, 467 (7319) :1061-1073
[5]
[Anonymous], EIGENGWAS FINDING LO
[6]
[Anonymous], 2012, Nature
[7]
[Anonymous], SCIENCE
[8]
Combining evidence of natural selection with association analysis increases power to detect malaria-resistance variants [J].
Ayodo, George ;
Price, Alkes L. ;
Keinan, Alon ;
Ajwang, Arthur ;
Otieno, Michael F. ;
Orago, Alloys S. S. ;
Patterson, Nick ;
Reich, David .
AMERICAN JOURNAL OF HUMAN GENETICS, 2007, 81 (02) :234-242
[9]
Characterizing Race/Ethnicity and Genetic Ancestry for 100,000 Subjects in the Genetic Epidemiology Research on Adult Health and Aging (GERA) Cohort [J].
Banda, Yambazi ;
Kvale, Mark N. ;
Hoffmann, Thomas J. ;
Hesselson, Stephanie E. ;
Ranatunga, Dilrini ;
Tang, Hua ;
Sabatti, Chiara ;
Croen, Lisa A. ;
Dispensa, Brad P. ;
Henderson, Mary ;
Iribarren, Carlos ;
Jorgenson, Eric ;
Kushi, Lawrence H. ;
Ludwig, Dana ;
Olberg, Diane ;
Quesenberry, Charles P., Jr. ;
Rowell, Sarah ;
Sadler, Marianne ;
Sakoda, Lori C. ;
Sciortino, Stanley ;
Shen, Ling ;
Smethurst, David ;
Somkin, Carol P. ;
Van Den Eeden, Stephen K. ;
Walter, Lawrence ;
Whitmer, Rachel A. ;
Kwok, Pui-Yan ;
Schaefer, Catherine ;
Risch, Neil .
GENETICS, 2015, 200 (04) :1285-+
[10]
Identifying adaptive genetic divergence among populations from genome scans [J].
Beaumont, MA ;
Balding, DJ .
MOLECULAR ECOLOGY, 2004, 13 (04) :969-980