AWclust: point-and-click software for non-parametric population structure analysis

被引:66
作者
Gao, Xiaoyi [1 ]
Starmer, Joshua D. [2 ,3 ]
机构
[1] Univ Miami, Miller Sch Med, Miami Inst Human Genom, Miami, FL 33136 USA
[2] Univ N Carolina, Dept Genet, Chapel Hill, NC 27599 USA
[3] Univ N Carolina, Curriculum Toxicol, Chapel Hill, NC 27599 USA
关键词
D O I
10.1186/1471-2105-9-77
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Population structure analysis is important to genetic association studies and evolutionary investigations. Parametric approaches, e. g. STRUCTURE and L-POP, usually assume Hardy-Weinberg equilibrium (HWE) and linkage equilibrium among loci in sample population individuals. However, the assumptions may not hold and allele frequency estimation may not be accurate in some data sets. The improved version of STRUCTURE (version 2.1) can incorporate linkage information among loci but is still sensitive to high background linkage disequilibrium. Nowadays, large-scale single nucleotide polymorphisms (SNPs) are becoming popular in genetic studies. Therefore, it is imperative to have software that makes full use of these genetic data to generate inference even when model assumptions do not hold or allele frequency estimation suffers from high variation. Results: We have developed point-and-click software for non-parametric population structure analysis distributed as an R package. The software takes advantage of the large number of SNPs available to categorize individuals into ethnically similar clusters and it does not require assumptions about population models. Nor does it estimate allele frequencies. Moreover, this software can also infer the optimal number of populations. Conclusion: Our software tool employs non-parametric approaches to assign individuals to clusters using SNPs. It provides efficient computation and an intuitive way for researchers to explore ethnic relationships among individuals. It can be complementary to parametric approaches in population structure analysis.
引用
收藏
页数:6
相关论文
共 35 条
[1]   Measuring European population stratification with microarray genotype data [J].
Bauchet, Marc ;
McEvoy, Brian ;
Pearson, Laurel N. ;
Quillen, Ellen E. ;
Sarkisian, Tamara ;
Hovhannesyan, Kristine ;
Deka, Ranjan ;
Bradley, Daniel G. ;
Shriver, Mark D. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2007, 80 (05) :948-956
[2]   HIGH-RESOLUTION OF HUMAN EVOLUTIONARY TREES WITH POLYMORPHIC MICROSATELLITES [J].
BOWCOCK, AM ;
RUIZLINARES, A ;
TOMFOHRDE, J ;
MINCH, E ;
KIDD, JR ;
CAVALLISFORZA, LL .
NATURE, 1994, 368 (6470) :455-457
[3]  
CAVALLISFORZA LL, 1994, HIST GEORGRAPHY HUMA
[4]   BAPS 2:: enhanced possibilities for the analysis of genetic population structure [J].
Corander, J ;
Waldmann, P ;
Marttinen, P ;
Sillanpää, MJ .
BIOINFORMATICS, 2004, 20 (15) :2363-2369
[5]  
Corander J, 2003, GENETICS, V163, P367
[6]   A Bayesian approach to the identification of panmictic populations and the assignment of individuals [J].
Dawson, KJ ;
Belkhir, K .
GENETICAL RESEARCH, 2001, 78 (01) :59-77
[7]   Genomic control for association studies [J].
Devlin, B ;
Roeder, K .
BIOMETRICS, 1999, 55 (04) :997-1004
[8]   Genomic control, a new approach to genetic-based association studies [J].
Devlin, B ;
Roeder, K ;
Wasserman, L .
THEORETICAL POPULATION BIOLOGY, 2001, 60 (03) :155-166
[9]   Unbiased methods for population-based association studies [J].
Devlin, B ;
Roeder, K ;
Bacanu, SA .
GENETIC EPIDEMIOLOGY, 2001, 21 (04) :273-284
[10]  
EXCOFFIER L, 2005, EBO, V1, P47