Population structure and eigenanalysis

被引:3575
作者
Patterson, Nick [1 ]
Price, Alkes L.
Reich, David
机构
[1] Broad Inst Harvard & MIT, Cambridge, MA USA
[2] Harvard Univ, Sch Med, Dept Genet, Boston, MA USA
来源
PLOS GENETICS | 2006年 / 2卷 / 12期
基金
英国惠康基金;
关键词
D O I
10.1371/journal.pgen.0020190
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Current methods for inferring population structure from genetic data do not provide formal significance tests for population differentiation. We discuss an approach to studying population structure ( principal components analysis) that was first applied to genetic data by Cavalli-Sforza and colleagues. We place the method on a solid statistical footing, using results from modern statistics to develop formal significance tests. We also uncover a general "phase change'' phenomenon about the ability to detect structure in genetic data, which emerges from the statistical theory we use, and has an important implication for the ability to discover structure in genetic data: for a fixed but large dataset size, divergence between two populations (as measured, for example, by a statistic like F-ST) below a threshold is essentially undetectable, but a little above threshold, detection will be easy. This means that we can predict the dataset size needed to detect structure.
引用
收藏
页码:2074 / 2093
页数:20
相关论文
共 46 条
  • [21] Design and analysis of admixture mapping studies
    Hoggart, CJ
    Shriver, MD
    Kittles, RA
    Clayton, DG
    McKeigue, PM
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 2004, 74 (05) : 965 - 978
  • [22] On the distribution of the largest eigenvalue in principal components analysis
    Johnstone, IM
    [J]. ANNALS OF STATISTICS, 2001, 29 (02) : 295 - 327
  • [23] Ethiopia: between Sub-Saharan Africa and Western Eurasia
    Lovell, A
    Moreau, C
    Yotova, V
    Xiao, F
    Bourgeois, S
    Gehl, D
    Bertranpetit, J
    Schurr, E
    Labuda, D
    [J]. ANNALS OF HUMAN GENETICS, 2005, 69 : 275 - 287
  • [25] The effects of human population structure on large genetic association studies
    Marchini, J
    Cardon, LR
    Phillips, MS
    Donnelly, P
    [J]. NATURE GENETICS, 2004, 36 (05) : 512 - 517
  • [26] SYNTHETIC MAPS OF HUMAN GENE-FREQUENCIES IN EUROPEANS
    MENOZZI, P
    PIAZZA, A
    CAVALLISFORZA, L
    [J]. SCIENCE, 1978, 201 (4358) : 786 - 792
  • [27] Statistical tests for admixture mapping with case-control and cases-only data
    Montana, G
    Pritchard, JK
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 2004, 75 (05) : 771 - 789
  • [28] Assessing population differentiation and isolation from single-nucleotide polymorphism data
    Nicholson, G
    Smith, AV
    Jónsson, F
    Gústafsson, O
    Stefánsson, K
    Donnelly, P
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2002, 64 : 695 - 715
  • [29] Blocks of limited haplotype diversity revealed by high-resolution scanning of human chromosome 21
    Patil, N
    Berno, AJ
    Hinds, DA
    Barrett, WA
    Doshi, JM
    Hacker, CR
    Kautzer, CR
    Lee, DH
    Marjoribanks, C
    McDonough, DP
    Nguyen, BTN
    Norris, MC
    Sheehan, JB
    Shen, NP
    Stern, D
    Stokowski, RP
    Thomas, DJ
    Trulson, MO
    Vyas, KR
    Frazer, KA
    Fodor, SPA
    Cox, DR
    [J]. SCIENCE, 2001, 294 (5547) : 1719 - 1723
  • [30] Methods for high-density admixture mapping of disease genes
    Patterson, N
    Hattangadi, N
    Lane, B
    Lohmueller, KE
    Hafler, DA
    Oksenberg, JR
    Hauser, SL
    Smith, MW
    O'Brien, SJ
    Altshuler, D
    Daly, MJ
    Reich, D
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 2004, 74 (05) : 979 - 1000