Design and analysis of admixture mapping studies

被引:244
作者
Hoggart, CJ
Shriver, MD
Kittles, RA
Clayton, DG
McKeigue, PM
机构
[1] Univ London London Sch Hyg & Trop Med, Noncommunicable Dis Epidemiol Unit, London WC1E 7HT, England
[2] Penn State Univ, Dept Anthropol, University Pk, PA 16802 USA
[3] Howard Univ, Natl Human Genome Ctr, Washington, DC 20059 USA
[4] Univ Cambridge, Cambridge Inst Med REs, Juvenile Diavet Res Fdn, Wellcome Trust Diabet & Inflammat Lab, Cambridge, England
[5] Univ Coll Dublin, Conway Inst, Dublin 2, Ireland
关键词
D O I
10.1086/420855
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Admixture between populations originating on different continents can be exploited to detect disease susceptibility loci at which risk alleles are distributed differentially between these populations. We first examine the statistical power and mapping resolution of this approach in the limiting situation in which gamete admixture and locus ancestry are measured without uncertainty. We show that, for a rare disease, the most efficient design is to study affected individuals only. In a typical African American population (two-way admixture proportions 0.8/0.2, ancestry crossover rate 2 per 100 cM), a study of 800 affected individuals has 90% power to detect at P values <10(-5) a locus that generates a risk ratio of 2 between populations, with an expected mapping resolution (size of 95% confidence region for the position of the locus) of 4 cM. In practice, to infer locus ancestry from marker data requires Bayesian computationally intensive methods, as implemented in the program ADMIXMAP. Affected-only study designs require strong prior information on the frequencies of each allele given locus ancestry. We show how data from unadmixed and admixed populations can be combined to estimate these ancestry-specific allele frequencies within the admixed population under study, allowing for variation between allele frequencies in unadmixed and admixed populations. Using simulated data based on the genetic structure of the African American population, we show that 60% of information can be extracted in a test for linkage using markers with an ancestry information content of 36% at 3-cM spacing. As in classic linkage studies, the most efficient strategy is to use markers at a moderate density for an initial genome search and then to saturate regions of putative linkage with additional markers, to extract nearly all information about locus ancestry.
引用
收藏
页码:965 / 978
页数:14
相关论文
共 28 条
  • [1] [Anonymous], APPL STAT, DOI DOI 10.2307/2347565
  • [2] [Anonymous], ANN EUGENICS
  • [3] Cavalli-Sforza L. L., 1994, HIST GEOGRAPHY HUMAN
  • [4] ADMIXTURE AS A TOOL FOR FINDING LINKED GENES AND DETECTING THAT DIFFERENCE FROM ALLELIC ASSOCIATION BETWEEN LOCI
    CHAKRABORTY, R
    WEISS, KM
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1988, 85 (23) : 9119 - 9123
  • [5] Analysis of multilocus models of association
    Devlin, B
    Roeder, K
    Wasserman, L
    [J]. GENETIC EPIDEMIOLOGY, 2003, 25 (01) : 36 - 47
  • [6] Falush D, 2003, GENETICS, V164, P1567
  • [7] Control of confounding of genetic associations in stratified populations
    Hoggart, CJ
    Parra, EJ
    Shriver, MD
    Bonilla, C
    Kittles, RA
    Clayton, DG
    McKeigue, PM
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 2003, 72 (06) : 1492 - 1504
  • [8] Jeffreys H., 1998, The Theory of Probability
  • [9] Bayesian analysis of multilocus association in quantitative and qualitative traits
    Kilpikari, R
    Sillanpää, MJ
    [J]. GENETIC EPIDEMIOLOGY, 2003, 25 (02) : 122 - 135
  • [10] CYP3A4-V and prostate cancer in African Americans: causal or confounding association because of population stratification?
    Kittles, RA
    Chen, WD
    Panguluri, RK
    Ahaghotu, C
    Jackson, A
    Adebamowo, CA
    Griffin, R
    Williams, T
    Ukoli, F
    Adams-Campbell, L
    Kwagyan, J
    Isaacs, W
    Freeman, V
    Dunston, GM
    [J]. HUMAN GENETICS, 2002, 110 (06) : 553 - 560