Analysis of East Asia Genetic Substructure Using Genome-Wide SNP Arrays

被引:96
作者
Tian, Chao [1 ]
Kosoy, Roman [1 ]
Lee, Annette [2 ]
Ransom, Michael [1 ]
Belmont, John W. [3 ]
Gregersen, Peter K. [2 ]
Seldin, Michael F. [1 ]
机构
[1] Univ Calif Davis, Dept Biochem, Rowe Program Human Genet, Davis, CA 95616 USA
[2] N Shore LIJ Hlth Syst, Feinstein Inst Med Res, Robert S Boas Ctr Genom & Human Genet, Manhasset, NY USA
[3] Baylor Coll Med, Dept Mol & Human Genet, Houston, TX USA
来源
PLOS ONE | 2008年 / 3卷 / 12期
关键词
D O I
10.1371/journal.pone.0003862
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Accounting for population genetic substructure is important in reducing type 1 errors in genetic studies of complex disease. As efforts to understand complex genetic disease are expanded to different continental populations the understanding of genetic substructure within these continents will be useful in design and execution of association tests. In this study, population differentiation (Fst) and Principal Components Analyses (PCA) are examined using >200 K genotypes from multiple populations of East Asian ancestry. The population groups included those from the Human Genome Diversity Panel [Cambodian, Yi, Daur, Mongolian, Lahu, Dai, Hezhen, Miaozu, Naxi, Oroqen, She, Tu, Tujia, Naxi, Xibo, and Yakut], HapMap [ Han Chinese (CHB) and Japanese (JPT)], and East Asian or East Asian American subjects of Vietnamese, Korean, Filipino and Chinese ancestry. Paired Fst (Wei and Cockerham) showed close relationships between CHB and several large East Asian population groups (CHB/Korean, 0.0019; CHB/JPT, 00651; CHB/Vietnamese, 0.0065) with larger separation with Filipino (CHB/Filipino, 0.014). Low levels of differentiation were also observed between Dai and Vietnamese (0.0045) and between Vietnamese and Cambodian (0.0062). Similarly, small Fst's were observed among different presumed Han Chinese populations originating in different regions of mainland of China and Taiwan (Fst's <0.0025 with CHB). For PCA, the first two PC's showed a pattern of relationships that closely followed the geographic distribution of the different East Asian populations. PCA showed substructure both between different East Asian groups and within the Han Chinese population. These studies have also identified a subset of East Asian substructure ancestry informative markers (EASTASAIMS) that may be useful for future complex genetic disease association studies in reducing type 1 errors and in identifying homogeneous groups that may increase the power of such studies.
引用
收藏
页数:10
相关论文
共 39 条
[21]  
NEI M, 1993, MOL BIOL EVOL, V10, P927
[22]   Discerning the ancestry of European Americans in genetic association studies [J].
Price, Alkes L. ;
Butler, Johannah ;
Patterson, Nick ;
Capelli, Cristian ;
Pascali, Vincenzo L. ;
Scarnicci, Francesca ;
Ruiz-Linares, Andres ;
Groop, Leif ;
Saetta, Angelica A. ;
Korkolopoulou, Penelope ;
Seligsohn, Uri ;
Waliszewska, Alicja ;
Schirmer, Christine ;
Ardlie, Kristin ;
Ramos, Alexis ;
Nemesh, James ;
Arbeitman, Lori ;
Goldstein, David B. ;
Reich, David ;
Hirschhorn, Joel N. .
PLOS GENETICS, 2008, 4 (01) :0009-0017
[23]   Principal components analysis corrects for stratification in genome-wide association studies [J].
Price, Alkes L. ;
Patterson, Nick J. ;
Plenge, Robert M. ;
Weinblatt, Michael E. ;
Shadick, Nancy A. ;
Reich, David .
NATURE GENETICS, 2006, 38 (08) :904-909
[24]  
Pritchard JK, 2000, GENETICS, V155, P945
[25]   PLINK: A tool set for whole-genome association and population-based linkage analyses [J].
Purcell, Shaun ;
Neale, Benjamin ;
Todd-Brown, Kathe ;
Thomas, Lori ;
Ferreira, Manuel A. R. ;
Bender, David ;
Maller, Julian ;
Sklar, Pamela ;
de Bakker, Paul I. W. ;
Daly, Mark J. ;
Sham, Pak C. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2007, 81 (03) :559-575
[26]   Informativeness of genetic markers for inference of ancestry [J].
Rosenberg, NA ;
Li, LM ;
Ward, R ;
Pritchard, JK .
AMERICAN JOURNAL OF HUMAN GENETICS, 2003, 73 (06) :1402-1422
[27]   Application of ancestry informative markers to association studies in European Americans [J].
Seldin, Michael F. ;
Price, Alkes L. .
PLOS GENETICS, 2008, 4 (01)
[28]   European population substructure: Clustering of northern and southern populations [J].
Seldin, Michael F. ;
Shigeta, Russell ;
Villoslada, Pablo ;
Selmi, Carlo ;
Tuomilehto, Jaakko ;
Silva, Gabriel ;
Belmont, John W. ;
Klareskog, Lars ;
Gregersen, Peter K. .
PLOS GENETICS, 2006, 2 (09) :1339-1351
[29]   Y-chromosome evidence of southern origin of the East Asian - Specific haplogroup O3-M122 [J].
Shi, H ;
Dong, YL ;
Wen, B ;
Xiao, CJ ;
Underhill, PA ;
Shen, PD ;
Chakraborty, R ;
Jin, L ;
Su, B .
AMERICAN JOURNAL OF HUMAN GENETICS, 2005, 77 (03) :408-419
[30]   Replication of the genetic effects of IFN regulatory factor 5 (IRF5) on systemic lupus erythematosus in a Korean population [J].
Shin, Hyoung Doo ;
Sung, Yoon-Kyoung ;
Choi, Chan-Bum ;
Lee, Soo Ok ;
Lee, Hye Won ;
Bae, Sang-Cheol .
ARTHRITIS RESEARCH & THERAPY, 2007, 9 (02)