Population Structure and Cryptic Relatedness in Genetic Association Studies

被引:304
作者
Astle, William [1 ]
Balding, David J. [1 ]
机构
[1] Univ London Imperial Coll Sci Technol & Med, Ctr Biostat, Dept Epidemiol & Publ Hlth, London W2 1PG, England
基金
英国医学研究理事会;
关键词
Cryptic relatedness; genomic control; kinship; mixed model; complex disease genetics; ascertainment; GENOMIC CONTROL; LINKAGE DISEQUILIBRIUM; DIABETES-MELLITUS; MIXED-MODEL; STRATIFICATION; INFERENCE; LOCI; BIAS; REGRESSION; IDENTITY;
D O I
10.1214/09-STS307
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We review the problem of confounding in genetic association studies, which arises principally because of population structure and cryptic relatedness. Many treatments of the problem consider only a simple "island" model of population structure. We take a broader approach, which views population structure and cryptic relatedness as different aspects of a single confounder: the unobserved pedigree defining the (often distant) relationships among the study subjects. Kinship is therefore a central concept, and we review methods of defining and estimating kinship coefficients, both pedigree-based and marker-based. In this unified framework we review solutions to the problem of population structure, including family-based study designs, genomic control, structured association, regression control, principal components adjustment and linear mixed models. The last solution makes the most explicit use of the kinships among the study subjects, and has an established role in the analysis of animal and plant breeding studies. Recent computational developments mean that analyses of human genetic association data are beginning to benefit from its powerful tests for association, which protect against population structure and cryptic kinship, as well as intermediate levels of confounding by the pedigree.
引用
收藏
页码:451 / 471
页数:21
相关论文
共 81 条
  • [1] Agresti A., 2013, Categorical data analysis, V341, P384
  • [2] Genetic Mapping in Human Disease
    Altshuler, David
    Daly, Mark J.
    Lander, Eric S.
    [J]. SCIENCE, 2008, 322 (5903) : 881 - 888
  • [3] Genomewide rapid association using mixed model and regression: A fast and simple method for genomewide pedigree-based quantitative trait loci association analysis
    Aulchenko, Yurii S.
    de Koning, Dirk-Jan
    Haley, Chris
    [J]. GENETICS, 2007, 177 (01) : 577 - 585
  • [4] The power of genomic control
    Bacanu, SA
    Devlin, B
    Roeder, K
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 2000, 66 (06) : 1933 - 1944
  • [5] Likelihood-based inference for genetic correlation coefficients
    Balding, DJ
    [J]. THEORETICAL POPULATION BIOLOGY, 2003, 63 (03) : 221 - 230
  • [6] A METHOD FOR QUANTIFYING DIFFERENTIATION BETWEEN POPULATIONS AT MULTI-ALLELIC LOCI AND ITS IMPLICATIONS FOR INVESTIGATING IDENTITY AND PATERNITY
    BALDING, DJ
    NICHOLS, RA
    [J]. GENETICA, 1995, 96 (1-2) : 3 - 12
  • [7] Accurate inference of relationships in sib-pair linkage studies
    Boehnke, M
    Cox, NJ
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 1997, 61 (02) : 423 - 429
  • [8] Novel case-control test in a founder population identifies P-selectin as an atopy-susceptibility locus
    Bourgain, C
    Hoffjan, S
    Nicolae, R
    Newman, D
    Steiner, L
    Walker, K
    Reynolds, R
    Ober, C
    McPeek, MS
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 2003, 73 (03) : 612 - 626
  • [9] Estimation of pairwise identity by descent from dense genetic marker data in a population sample of haplotypes
    Browning, Sharon R.
    [J]. GENETICS, 2008, 178 (04) : 2123 - 2132
  • [10] Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls
    Burton, Paul R.
    Clayton, David G.
    Cardon, Lon R.
    Craddock, Nick
    Deloukas, Panos
    Duncanson, Audrey
    Kwiatkowski, Dominic P.
    McCarthy, Mark I.
    Ouwehand, Willem H.
    Samani, Nilesh J.
    Todd, John A.
    Donnelly, Peter
    Barrett, Jeffrey C.
    Davison, Dan
    Easton, Doug
    Evans, David
    Leung, Hin-Tak
    Marchini, Jonathan L.
    Morris, Andrew P.
    Spencer, Chris C. A.
    Tobin, Martin D.
    Attwood, Antony P.
    Boorman, James P.
    Cant, Barbara
    Everson, Ursula
    Hussey, Judith M.
    Jolley, Jennifer D.
    Knight, Alexandra S.
    Koch, Kerstin
    Meech, Elizabeth
    Nutland, Sarah
    Prowse, Christopher V.
    Stevens, Helen E.
    Taylor, Niall C.
    Walters, Graham R.
    Walker, Neil M.
    Watkins, Nicholas A.
    Winzer, Thilo
    Jones, Richard W.
    McArdle, Wendy L.
    Ring, Susan M.
    Strachan, David P.
    Pembrey, Marcus
    Breen, Gerome
    St Clair, David
    Caesar, Sian
    Gordon-Smith, Katherine
    Jones, Lisa
    Fraser, Christine
    Green, Elain K.
    [J]. NATURE, 2007, 447 (7145) : 661 - 678