Inferring Geographic Coordinates of Origin for Europeans Using Small Panels of Ancestry Informative Markers

被引:31
作者
Drineas, Petros [1 ]
Lewis, Jamey [1 ]
Paschou, Peristera [2 ]
机构
[1] Rensselaer Polytech Inst, Dept Comp Sci, Troy, NY 12180 USA
[2] Democritus Univ Thrace, Dept Mol Biol & Genet, Alexandroupolis, Greece
来源
PLOS ONE | 2010年 / 5卷 / 08期
基金
美国国家科学基金会;
关键词
GENETIC SUBSTRUCTURE; POPULATION-STRUCTURE; ADMIXTURE; DISEASE; GENOME; ASSOCIATION; STRATIFICATION; SELECTION; PATTERNS; LINKAGE;
D O I
10.1371/journal.pone.0011892
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Recent large-scale studies of European populations have demonstrated the existence of population genetic structure within Europe and the potential to accurately infer individual ancestry when information from hundreds of thousands of genetic markers is used. In fact, when genomewide genetic variation of European populations is projected down to a two-dimensional Principal Components Analysis plot, a surprising correlation with actual geographic coordinates of self-reported ancestry has been reported. This substructure can hamper the search of susceptibility genes for common complex disorders leading to spurious correlations. The identification of genetic markers that can correct for population stratification becomes therefore of paramount importance. Analyzing 1,200 individuals from 11 populations genotyped for more than 500,000 SNPs (Population Reference Sample), we present a systematic exploration of the extent to which geographic coordinates of origin within Europe can be predicted, with small panels of SNPs. Markers are selected to correlate with the top principal components of the dataset, as we have previously demonstrated. Performing thorough cross-validation experiments we show that it is indeed possible to predict individual ancestry within Europe down to a few hundred kilometers from actual individual origin, using information from carefully selected panels of 500 or 1,000 SNPs. Furthermore, we show that these panels can be used to correctly assign the HapMap Phase 3 European populations to their geographic origin. The SNPs that we propose can prove extremely useful in a variety of different settings, such as stratification correction or genetic ancestry testing, and the study of the history of European populations.
引用
收藏
页数:6
相关论文
共 27 条
[1]   Mechanisms of disease: The effect of infections on susceptibility to autoimmune and allergic diseases [J].
Bach, JF .
NEW ENGLAND JOURNAL OF MEDICINE, 2002, 347 (12) :911-920
[2]   Measuring European population stratification with microarray genotype data [J].
Bauchet, Marc ;
McEvoy, Brian ;
Pearson, Laurel N. ;
Quillen, Ellen E. ;
Sarkisian, Tamara ;
Hovhannesyan, Kristine ;
Deka, Ranjan ;
Bradley, Daniel G. ;
Shriver, Mark D. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2007, 80 (05) :948-956
[3]   Origins and evolution of the Europeans' genome: evidence from multiple microsatellite loci [J].
Belle, Elise M. S. ;
Landry, Pierre-Alexandre ;
Barbujani, Guido .
PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2006, 273 (1594) :1595-1602
[4]   Genome-wide Insights into the Patterns and Determinants of Fine-Scale Population Structure in Humans [J].
Biswas, Shameek ;
Scheinfeldt, Laura B. ;
Akey, Joshua M. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2009, 84 (05) :641-650
[5]   Demonstrating stratification in a European American population [J].
Campbell, CD ;
Ogburn, EL ;
Lunetta, KL ;
Lyon, HN ;
Freedman, ML ;
Groop, LC ;
Altshuler, D ;
Ardlie, KG ;
Hirschhorn, JN .
NATURE GENETICS, 2005, 37 (08) :868-872
[6]   Y genetic data support the Neolithic demic diffusion model [J].
Chikhi, L ;
Nichols, RA ;
Barbujani, G ;
Beaumont, MA .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (17) :11008-11013
[7]   Ethnic-difference markers for use in mapping by admixture linkage disequilibrium [J].
Collins-Schramm, HE ;
Phillips, CM ;
Operario, DJ ;
Lee, JS ;
Weber, JL ;
Hanson, RL ;
Knowler, WC ;
Cooper, R ;
Li, HZ ;
Seldin, MF .
AMERICAN JOURNAL OF HUMAN GENETICS, 2002, 70 (03) :737-750
[8]  
DEAN M, 1994, AM J HUM GENET, V55, P788
[9]   Investigation of the fine structure of European populations with applications to disease association studies [J].
Heath, Simon C. ;
Gut, Ivo G. ;
Brennan, Paul ;
McKay, James D. ;
Bencko, Vladimir ;
Fabianova, Eleonora ;
Foretova, Lenka ;
Georges, Michel ;
Janout, Vladimir ;
Kabesch, Michael ;
Krokan, Hans E. ;
Elvestad, Maiken B. ;
Lissowska, Jolanta ;
Mates, Dana ;
Rudnai, Peter ;
Skorpen, Frank ;
Schreiber, Stefan ;
Soria, Jose M. ;
Syvanen, Ann-Christine ;
Meneton, Pierre ;
Hercberg, Serge ;
Galan, Pilar ;
Szeszenia-Dabrowska, Neonilia ;
Zaridze, David ;
Genin, Emmanuel ;
Cardon, Lon R. ;
Lathrop, Mark .
EUROPEAN JOURNAL OF HUMAN GENETICS, 2008, 16 (12) :1413-1429
[10]   Correlation between genetic and geographic structure in Europe [J].
Lao, Oscar ;
Lu, Timothy T. ;
Nothnagel, Michael ;
Junge, Olaf ;
Freitag-Wolf, Sandra ;
Caliebe, Amke ;
Balascakova, Miroslava ;
Bertranpetit, Jaume ;
Bindoff, Laurence A. ;
Comas, David ;
Hoimlund, Gunilla ;
Kouvatsi, Anastasia ;
Macek, Milan ;
Mollet, Isabelle ;
Parson, Walther ;
Palo, Jukka ;
Ploski, Rafal ;
Sajantila, Antti ;
Tagliabracci, Adriano ;
Gether, Ulrik ;
Werge, Thomas ;
Rivadeneira, Fernando ;
Hofman, Albert ;
Uitterlinden, Andre G. ;
Gieger, Christian ;
Wichmann, Heinz-Erich ;
Ruether, Andreas ;
Schreiber, Stefan ;
Becker, Christian ;
Nuernberg, Peter ;
Nelson, Matthew R. ;
Krawczak, Michael ;
Kayser, Manfred .
CURRENT BIOLOGY, 2008, 18 (16) :1241-1248