Genetic Structure of the Han Chinese Population Revealed by Genome-wide SNP Variation

被引:281
作者
Chen, Jieming [1 ]
Zheng, Houfeng [3 ,4 ,5 ,6 ]
Bei, Jin-Xin [7 ,8 ]
Sun, Liangdan [3 ,4 ,5 ,6 ]
Jia, Wei-hua [7 ,8 ]
Li, Tao [9 ,10 ,11 ]
Zhang, Furen [12 ]
Seielstad, Mark [1 ,2 ,13 ]
Zeng, Yi-Xin [7 ,8 ]
Zhang, Xuejun [3 ,4 ,5 ,6 ]
Liu, Jianjun [1 ,2 ,3 ,4 ,6 ]
机构
[1] Genome Inst Singapore, Singapore 138672, Singapore
[2] Natl Univ Singapore, Ctr Mol Epidemiol, Yong Loo Lin Sch Med, Singapore 117597, Singapore
[3] Anhui Med Univ, Inst Dermatol, Hefei 230032, Anhui, Peoples R China
[4] Anhui Med Univ, Dept Dermatol, Hosp 1, Hefei 230032, Anhui, Peoples R China
[5] Anhui Med Univ, Dept Dermatol & Venereol, Hefei 230032, Anhui, Peoples R China
[6] Minist Educ & Anhui Prov, Key Lab Gene Resource Utilizat Severe Dis, Hefei 230032, Anhui, Peoples R China
[7] State Key Lab Oncol So China, Guangzhou 510060, Guangdong, Peoples R China
[8] Sun Yat Sen Univ, Ctr Canc, Dept Expt Res, Guangzhou 510060, Guangdong, Peoples R China
[9] Sichuan Univ, W China Hosp, Dept Psychiat, Chengdu 610041, Sichuan, Peoples R China
[10] Sichuan Univ, W China Hosp, State Key Lab Biotherapy, Psychiat Lab, Chengdu 610041, Sichuan, Peoples R China
[11] Kings Coll London, Dept Psychol Med & Psychiat, Inst Psychiat, London SE5 8AF, England
[12] Shandong Acad Med Sci, Shandong Prov Inst Dermatol & Venereol, Jinan 250022, Shandong, Peoples R China
[13] Harvard Univ, Sch Publ Hlth, Dept Epidemiol, Boston, MA 02115 USA
基金
中国国家自然科学基金;
关键词
MULTILOCUS GENOTYPE DATA; EAST-ASIA; ASSOCIATION; HISTORY; SUBSTRUCTURE; HAPLOTYPE; INFERENCE; PROGRAM; CLUSTER; DNA;
D O I
10.1016/j.ajhg.2009.10.016
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Population stratification is a potential problem for genome-wide association studies (GWAS), confounding results and causing spurious associations. Hence, understanding how allele frequencies vary across geographic regions or among subpopulations is an important prelude to analyzing GWAS data. Using over 350,000 genome-wide autosomal SNPs in over 6000 Han Chinese samples from ten provinces of China, Our study revealed a one-dimensional "north-south" population structure and a close correlation between geography and the genetic structure of the Han Chinese. The north-south population structure is consistent with the historical migration pattern of the Han Chinese population. Metropolitan cities in China were, however, more diffused "outliers," probably because of the impact of modern migration of peoples. At a very local scale within the Guangdong province, we observed evidence of population structure among dialect groups, probably on account of endogamy within these dialects. Via simulation, we show that empirical levels of population structure observed across modern China can cause spurious associations in GWAS if not properly handled. In the Han Chinese, geographic matching is a good proxy for genetic matching, particularly in validation and candidate-gene studies in which population stratification cannot be directly accessed and accounted for because of the lack of genome-wide data, with the exception of the metropolitan cities, where geographical location is no longer a good indicator of ancestral origin. Our findings are important for designing GWAS in the Chinese population, an activity that is expected to intensify greatly in the near future.
引用
收藏
页码:775 / 785
页数:11
相关论文
共 31 条
[1]   A haplotype map of the human genome [J].
Altshuler, D ;
Brooks, LD ;
Chakravarti, A ;
Collins, FS ;
Daly, MJ ;
Donnelly, P ;
Gibbs, RA ;
Belmont, JW ;
Boudreau, A ;
Leal, SM ;
Hardenbol, P ;
Pasternak, S ;
Wheeler, DA ;
Willis, TD ;
Yu, FL ;
Yang, HM ;
Zeng, CQ ;
Gao, Y ;
Hu, HR ;
Hu, WT ;
Li, CH ;
Lin, W ;
Liu, SQ ;
Pan, H ;
Tang, XL ;
Wang, J ;
Wang, W ;
Yu, J ;
Zhang, B ;
Zhang, QR ;
Zhao, HB ;
Zhao, H ;
Zhou, J ;
Gabriel, SB ;
Barry, R ;
Blumenstiel, B ;
Camargo, A ;
Defelice, M ;
Faggart, M ;
Goyette, M ;
Gupta, S ;
Moore, J ;
Nguyen, H ;
Onofrio, RC ;
Parkin, M ;
Roy, J ;
Stahl, E ;
Winchester, E ;
Ziaugra, L ;
Shen, Y .
NATURE, 2005, 437 (7063) :1299-1320
[2]   Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls [J].
Burton, Paul R. ;
Clayton, David G. ;
Cardon, Lon R. ;
Craddock, Nick ;
Deloukas, Panos ;
Duncanson, Audrey ;
Kwiatkowski, Dominic P. ;
McCarthy, Mark I. ;
Ouwehand, Willem H. ;
Samani, Nilesh J. ;
Todd, John A. ;
Donnelly, Peter ;
Barrett, Jeffrey C. ;
Davison, Dan ;
Easton, Doug ;
Evans, David ;
Leung, Hin-Tak ;
Marchini, Jonathan L. ;
Morris, Andrew P. ;
Spencer, Chris C. A. ;
Tobin, Martin D. ;
Attwood, Antony P. ;
Boorman, James P. ;
Cant, Barbara ;
Everson, Ursula ;
Hussey, Judith M. ;
Jolley, Jennifer D. ;
Knight, Alexandra S. ;
Koch, Kerstin ;
Meech, Elizabeth ;
Nutland, Sarah ;
Prowse, Christopher V. ;
Stevens, Helen E. ;
Taylor, Niall C. ;
Walters, Graham R. ;
Walker, Neil M. ;
Watkins, Nicholas A. ;
Winzer, Thilo ;
Jones, Richard W. ;
McArdle, Wendy L. ;
Ring, Susan M. ;
Strachan, David P. ;
Pembrey, Marcus ;
Breen, Gerome ;
St Clair, David ;
Caesar, Sian ;
Gordon-Smith, Katherine ;
Jones, Lisa ;
Fraser, Christine ;
Green, Elain K. .
NATURE, 2007, 447 (7145) :661-678
[3]  
Cavalli-Sforza L.L., 1994, HIST GEOGRAPHY HUMAN
[4]   Genetic relationship of populations in China [J].
Chu, JY ;
Huang, W ;
Kuang, SQ ;
Wang, JM ;
Xu, JJ ;
Chu, ZT ;
Yang, ZQ ;
Lin, KQ ;
Li, P ;
Wu, M ;
Geng, ZC ;
Tan, CC ;
Du, RF ;
Jin, L .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (20) :11763-11768
[5]   Genomic control for association studies [J].
Devlin, B ;
Roeder, K .
BIOMETRICS, 1999, 55 (04) :997-1004
[6]   Population structure and history in East Asia [J].
Ding, YC ;
Wooding, S ;
Harpending, HC ;
Chi, HC ;
Li, HP ;
Fu, YX ;
Pang, JF ;
Yao, YG ;
Yu, JGX ;
Moyzis, R ;
Zhang, YP .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (25) :14003-14006
[7]  
Falush D, 2003, GENETICS, V164, P1567
[8]  
Ge J., 1997, The Migration History of China
[9]   The Genome-wide Patterns of Variation Expose Significant Substructure in a Founder Population [J].
Jakkula, Eveliina ;
Rehnstroem, Karola ;
Varilo, Teppo ;
Pietilaeinen, Olli P. H. ;
Paunio, Tiina ;
Pedersen, Nancy L. ;
deFaire, Ulf ;
Jaervelin, Marjo-Riitta ;
Saharinen, Juha ;
Freimer, Nelson ;
Ripatti, Samuli ;
Purcell, Shaun ;
Collins, Andrew ;
Daly, Mark J. ;
Palotie, Aarno ;
Peltonen, Leena .
AMERICAN JOURNAL OF HUMAN GENETICS, 2008, 83 (06) :787-794
[10]   Genotype, haplotype and copy-number variation in worldwide human populations [J].
Jakobsson, Mattias ;
Scholz, Sonja W. ;
Scheet, Paul ;
Gibbs, J. Raphael ;
VanLiere, Jenna M. ;
Fung, Hon-Chung ;
Szpiech, Zachary A. ;
Degnan, James H. ;
Wang, Kai ;
Guerreiro, Rita ;
Bras, Jose M. ;
Schymick, Jennifer C. ;
Hernandez, Dena G. ;
Traynor, Bryan J. ;
Simon-Sanchez, Javier ;
Matarin, Mar ;
Britton, Angela ;
van de Leemput, Joyce ;
Rafferty, Ian ;
Bucan, Maja ;
Cann, Howard M. ;
Hardy, John A. ;
Rosenberg, Noah A. ;
Singleton, Andrew B. .
NATURE, 2008, 451 (7181) :998-1003