Estimating local ancestry in admixed populations

被引:242
作者
Sankararaman, Sriram [3 ]
Sridhar, Srinath [2 ]
Kimmel, Gad [3 ]
Halperin, Eran [1 ]
机构
[1] Int Comp Sci Inst, Berkeley, CA 94704 USA
[2] Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15213 USA
[3] Univ Calif Berkeley, Dept Comp Sci, Berkeley, CA 94720 USA
关键词
D O I
10.1016/j.ajhg.2007.09.022
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Large-scale genotyping of SNPs has shown a great promise in identifying markers that could be linked to diseases. One of the major obstacles involved in performing these studies is that the underlying population substructure could produce spurious associations. Population substructure can be caused by the presence of two distinct subpopulations or a single pool of admixed individuals. In this work, we focus on the latter, which is significantly harder to detect in practice. New advances in this research direction are expected to play a key role in identifying loci that are different among different populations and are still associated with a disease. We evaluated current methods for inference of population substructure in such cases and show that they might be quite inaccurate even in relatively simple scenarios. We therefore introduce a new method, LAMP (Local Ancestry in adMixed Populations), which infers the ancestry of each individual at every single-nucleotide polymorphism (SNP). LAMP computes the ancestry structure for overlapping windows of contiguous SNPs and combines the results with a majority vote. Our empirical results show that LAMP is significantly more accurate and more efficient than existing methods for inferrring locus-specific ancestries, enabling it to handle large-scale datasets. We further show that LAMP can be used to estimate the individual admixture of each individual. Our experimental evaluation indicates that this extension yields a considerably more accurate estimate of individual admixture than state-of-the-art methods such as STRUCTURE or EIGENSTRAT, which are frequently used for the correction of population stratification in association studies.
引用
收藏
页码:290 / 303
页数:14
相关论文
共 37 条
[1]  
BESAG J, 1986, J R STAT SOC B, V48, P259
[2]   Evaluating potential for whole-genome studies in Kosrae, an isolated population in Micronesia [J].
Bonnen, PE ;
Pe'er, I ;
Plenge, RM ;
Salit, J ;
Lowe, JK ;
Shapero, MH ;
Lifton, RP ;
Breslow, JL ;
Daly, MJ ;
Reich, DE ;
Jones, KW ;
Stoffel, M ;
Altshuler, D ;
Friedman, JM .
NATURE GENETICS, 2006, 38 (02) :214-217
[3]   Demonstrating stratification in a European American population [J].
Campbell, CD ;
Ogburn, EL ;
Lunetta, KL ;
Lyon, HN ;
Freedman, ML ;
Groop, LC ;
Altshuler, D ;
Ardlie, KG ;
Hirschhorn, JN .
NATURE GENETICS, 2005, 37 (08) :868-872
[4]   A MEASURE OF ASYMPTOTIC EFFICIENCY FOR TESTS OF A HYPOTHESIS BASED ON THE SUM OF OBSERVATIONS [J].
CHERNOFF, H .
ANNALS OF MATHEMATICAL STATISTICS, 1952, 23 (04) :493-507
[5]   Population structure, differential bias and genomic control in a large-scale, case-control association study [J].
Clayton, DG ;
Walker, NM ;
Smyth, DJ ;
Pask, R ;
Cooper, JD ;
Maier, LM ;
Smink, LJ ;
Lam, AC ;
Ovington, NR ;
Stevens, HE ;
Nutland, S ;
Howson, JMM ;
Faham, M ;
Moorhead, M ;
Jones, HB ;
Falkowski, M ;
Hardenbol, P ;
Willis, TD ;
Todd, JA .
NATURE GENETICS, 2005, 37 (11) :1243-1246
[6]   Markers informative for ancestry demonstrate consistent megabase-length linkage disequilibrium in the African American population [J].
Collins-Schramm, HE ;
Chima, B ;
Operario, DJ ;
Criswell, LA ;
Seldin, MF .
HUMAN GENETICS, 2003, 113 (03) :211-219
[7]   Genomic control for association studies [J].
Devlin, B ;
Roeder, K .
BIOMETRICS, 1999, 55 (04) :997-1004
[8]  
Falush D, 2003, GENETICS, V164, P1567
[9]   Assessing the impact of population stratification on genetic association studies [J].
Freedman, ML ;
Reich, D ;
Penney, KL ;
McDonald, GJ ;
Mignault, AA ;
Patterson, N ;
Gabriel, SB ;
Topol, EJ ;
Smoller, JW ;
Pato, CN ;
Pato, MT ;
Petryshen, TYL ;
Kolonel, LN ;
Lander, ES ;
Sklar, P ;
Henderson, B ;
Hirschhorn, JN ;
Altshuler, D .
NATURE GENETICS, 2004, 36 (04) :388-393
[10]  
Haldane JBS, 1919, J GENET, V8, P299