Tracing Sub-Structure in the European American Population with PCA-Informative Markers

被引:53
作者
Paschou, Peristera [1 ]
Drineas, Petros [2 ]
Lewis, Jamey [2 ]
Nievergelt, Caroline M. [3 ,4 ]
Nickerson, Deborah A. [5 ]
Smith, Joshua D. [5 ]
Ridker, Paul M. [6 ,7 ]
Chasman, Daniel I. [7 ]
Krauss, Ronald M. [8 ]
Ziv, Elad [9 ,10 ]
机构
[1] Democritus Univ Thrace, Dept Mol Biol & Genet, Alexandroupolis, Greece
[2] Rensselaer Polytech Inst, Dept Comp Sci, Troy, NY 12180 USA
[3] Scripps Res Inst, Dept Mol & Expt Med, La Jolla, CA USA
[4] Univ Calif San Diego, Dept Psychiat, La Jolla, CA 92093 USA
[5] Univ Washington, Dept Genome Sci, Seattle, WA USA
[6] Brigham & Womens Hosp, Div Cardiovasc Dis, Ctr Cardiovasc Dis Prevent, Boston, MA 02115 USA
[7] Brigham & Womens Hosp, Div Prevent Med, Boston, MA 02115 USA
[8] Childrens Hosp Oakland, Res Inst, Oakland, CA 94609 USA
[9] Univ Calif San Francisco, Inst Human Genet, Div Gen Internal Med, San Francisco, CA 94143 USA
[10] Univ Calif San Francisco, Ctr Comprehens Canc, San Francisco, CA 94143 USA
来源
PLOS GENETICS | 2008年 / 4卷 / 07期
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
D O I
10.1371/journal.pgen.1000114
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Genetic structure in the European American population reflects waves of migration and recent gene flow among different populations. This complex structure can introduce bias in genetic association studies. Using Principal Components Analysis (PCA), we analyze the structure of two independent European American datasets (1,521 individuals-307,315 autosomal SNPs). Individual variation lies across a continuum with some individuals showing high degrees of admixture with non-European populations, as demonstrated through joint analysis with HapMap data. The CEPH Europeans only represent a small fraction of the variation encountered in the larger European American datasets we studied. We interpret the first eigenvector of this data as correlated with ancestry, and we apply an algorithm that we have previously described to select PCA-informative markers (PCAIMs) that can reproduce this structure. Importantly, we develop a novel method that can remove redundancy from the selected SNP panels and show that we can effectively remove correlated markers, thus increasing genotyping savings. Only 150-200 PCAIMs suffice to accurately predict fine structure in European American datasets, as identified by PCA. Simulating association studies, we couple our method with a PCA-based stratification correction tool and demonstrate that a small number of PCAIMs can efficiently remove false correlations with almost no loss in power. The structure informative SNPs that we propose are an important resource for genetic association studies of European Americans. Furthermore, our redundancy removal algorithm can be applied on sets of ancestry informative markers selected with any method in order to select the most uncorrelated SNPs, and significantly decreases genotyping costs.
引用
收藏
页数:13
相关论文
共 60 条
[41]   Association mapping in structured populations [J].
Pritchard, JK ;
Stephens, M ;
Rosenberg, NA ;
Donnelly, P .
AMERICAN JOURNAL OF HUMAN GENETICS, 2000, 67 (01) :170-181
[42]  
REICHENSTEIN W, 2001, J WEALTH MANAGEMENT, V4, P16
[43]   Polymorphisms of the HNF1A gene encoding hepatocyte nuclear factor-1α are associated with C-reactive protein [J].
Reiner, Alexander P. ;
Barber, Mathew J. ;
Guan, Yongtao ;
Ridker, Paul M. ;
Lange, Leslie A. ;
Chasman, Daniel I. ;
Walston, Jeremy D. ;
Cooper, Gregory M. ;
Jenny, Nancy S. ;
Rieder, Mark J. ;
Durda, J. Peter ;
Smith, Joshua D. ;
Novembre, John ;
Tracy, Russell P. ;
Rotter, Jerome I. ;
Stephens, Matthew ;
Nickerson, Deborah A. ;
Krauss, Ronald M. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2008, 82 (05) :1193-1201
[44]   In search of geographical patterns in European mitochondrial DNA [J].
Richards, M ;
Macaulay, V ;
Torroni, A ;
Bandelt, HJ .
AMERICAN JOURNAL OF HUMAN GENETICS, 2002, 71 (05) :1168-1174
[45]   Clines, clusters, and the effect of study design on the inference of human population structure [J].
Rosenberg, NA ;
Mahajan, S ;
Ramachandran, S ;
Zhao, CF ;
Pritchard, JK ;
Feldman, MW .
PLOS GENETICS, 2005, 1 (06) :660-671
[46]   Informativeness of genetic markers for inference of ancestry [J].
Rosenberg, NA ;
Li, LM ;
Ward, R ;
Pritchard, JK .
AMERICAN JOURNAL OF HUMAN GENETICS, 2003, 73 (06) :1402-1422
[47]   Accounting for unmeasured population substructure in case-control studies of genetic association using a novel latent-class model [J].
Satten, GA ;
Flanders, WD ;
Yang, QH .
AMERICAN JOURNAL OF HUMAN GENETICS, 2001, 68 (02) :466-477
[48]   Genome-wide association analysis identifies loci for type 2 diabetes and triglyceride levels [J].
Saxena, Richa ;
Voight, Benjamin F. ;
Lyssenko, Valeriya ;
Burtt, Noel P. ;
de Bakker, Paul I. W. ;
Chen, Hong ;
Roix, Jeffrey J. ;
Kathiresan, Sekar ;
Hirschhorn, Joel N. ;
Daly, Mark J. ;
Hughes, Thomas E. ;
Groop, Leif ;
Altshuler, David ;
Almgren, Peter ;
Florez, Jose C. ;
Meyer, Joanne ;
Ardlie, Kristin ;
Bostroem, Kristina Bengtsson ;
Isomaa, Bo ;
Lettre, Guillaume ;
Lindblad, Ulf ;
Lyon, Helen N. ;
Melander, Olle ;
Newton-Cheh, Christopher ;
Nilsson, Peter ;
Orho-Melander, Marju ;
Rastam, Lennart ;
Speliotes, Elizabeth K. ;
Taskinen, Marja-Riitta ;
Tuomi, Tiinamaija ;
Guiducci, Candace ;
Berglund, Anna ;
Carlson, Joyce ;
Gianniny, Lauren ;
Hackett, Rachel ;
Hall, Liselotte ;
Holmkvist, Johan ;
Laurila, Esa ;
Sjoegren, Marketa ;
Sterner, Maria ;
Surti, Aarti ;
Svensson, Margareta ;
Svensson, Malin ;
Tewhey, Ryan ;
Blumenstiel, Brendan ;
Parkin, Melissa ;
DeFelice, Matthew ;
Barry, Rachel ;
Brodeur, Wendy ;
Camarata, Jody .
SCIENCE, 2007, 316 (5829) :1331-1336
[49]   A genome-wide association study of type 2 diabetes in Finns detects multiple susceptibility variants [J].
Scott, Laura J. ;
Mohlke, Karen L. ;
Bonnycastle, Lori L. ;
Willer, Cristen J. ;
Li, Yun ;
Duren, William L. ;
Erdos, Michael R. ;
Stringham, Heather M. ;
Chines, Peter S. ;
Jackson, Anne U. ;
Prokunina-Olsson, Ludmila ;
Ding, Chia-Jen ;
Swift, Amy J. ;
Narisu, Narisu ;
Hu, Tianle ;
Pruim, Randall ;
Xiao, Rui ;
Li, Xiao-Yi ;
Conneely, Karen N. ;
Riebow, Nancy L. ;
Sprau, Andrew G. ;
Tong, Maurine ;
White, Peggy P. ;
Hetrick, Kurt N. ;
Barnhart, Michael W. ;
Bark, Craig W. ;
Goldstein, Janet L. ;
Watkins, Lee ;
Xiang, Fang ;
Saramies, Jouko ;
Buchanan, Thomas A. ;
Watanabe, Richard M. ;
Valle, Timo T. ;
Kinnunen, Leena ;
Abecasis, Gonalo R. ;
Pugh, Elizabeth W. ;
Doheny, Kimberly F. ;
Bergman, Richard N. ;
Tuomilehto, Jaakko ;
Collins, Francis S. ;
Boehnke, Michael .
SCIENCE, 2007, 316 (5829) :1341-1345
[50]  
SCOTT LJ, 2007, NATURE, V447, P661