Demographic history and rare allele sharing among human populations

被引:432
作者
Gravel, Simon [1 ]
Henn, Brenna M. [1 ]
Gutenkunst, Ryan N. [2 ]
Indap, Amit R. [3 ]
Marth, Gabor T. [3 ]
Clark, Andrew G. [4 ]
Yu, Fuli [5 ]
Gibbs, Richard A. [5 ]
Bustamante, Carlos D. [1 ]
机构
[1] Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA
[2] Univ Arizona, Dept Mol & Cellular Biol, Tucson, AZ 85721 USA
[3] Boston Coll, Dept Biol, Chestnut Hill, MA 02467 USA
[4] Cornell Univ, Dept Mol Biol & Genet, Ithaca, NY 14853 USA
[5] Baylor Coll Med, Human Genome Sequencing Ctr, Houston, TX 77030 USA
基金
英国医学研究理事会; 英国惠康基金;
关键词
demographic inference; genetic drift; population genetics; human evolution; CAPTURE PROBABILITIES VARY; FREQUENCY-SPECTRUM; HUMAN GENOME; DIVERGENCE; SELECTION; VARIANTS; ANIMALS; NUMBER; SIZES; RATES;
D O I
10.1073/pnas.1019276108
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
High-throughput sequencing technology enables population-level surveys of human genomic variation. Here, we examine the joint allele frequency distributions across continental human populations and present an approach for combining complementary aspects of whole-genome, low-coverage data and targeted high-coverage data. We apply this approach to data generated by the pilot phase of the Thousand Genomes Project, including whole-genome 2-4x coverage data for 179 samples from HapMap European, Asian, and African panels as well as high-coverage target sequencing of the exons of 800 genes from 697 individuals in seven populations. We use the site frequency spectra obtained from these data to infer demographic parameters for an Out-of-Africa model for populations of African, European, and Asian descent and to predict, by a jackknife-based approach, the amount of genetic diversity that will be discovered as sample sizes are increased. We predict that the number of discovered nonsynonymous coding variants will reach 100,000 in each population after similar to 1,000 sequenced chromosomes per population, whereas similar to 2,500 chromosomes will be needed for the same number of synonymous variants. Beyond this point, the number of segregating sites in the European and Asian panel populations is expected to overcome that of the African panel because of faster recent population growth. Overall, we find that the majority of human genomic variable sites are rare and exhibit little sharing among diverged populations. Our results emphasize that replication of disease association for specific rare genetic variants across diverged populations must overcome both reduced statistical power because of rarity and higher population divergence.
引用
收藏
页码:11983 / 11988
页数:6
相关论文
共 22 条
[1]   Maximum-likelihood estimation of demographic parameters using the frequency spectrum of unlinked single-nucleotide polymorphisms [J].
Adams, AM ;
Hudson, RR .
GENETICS, 2004, 168 (03) :1699-1712
[2]   A map of human genome variation from population-scale sequencing [J].
Altshuler, David ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Collins, Francis S. ;
De la Vega, Francisco M. ;
Donnelly, Peter ;
Egholm, Michael ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Knoppers, Bartha M. ;
Lander, Eric S. ;
Lehrach, Hans ;
Mardis, Elaine R. ;
McVean, Gil A. ;
Nickerson, DebbieA. ;
Peltonen, Leena ;
Schafer, Alan J. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Deiros, David ;
Metzker, Mike ;
Muzny, Donna ;
Reid, Jeff ;
Wheeler, David ;
Wang, Jun ;
Li, Jingxiang ;
Jian, Min ;
Li, Guoqing ;
Li, Ruiqiang ;
Liang, Huiqing ;
Tian, Geng ;
Wang, Bo ;
Wang, Jian ;
Wang, Wei ;
Yang, Huanming ;
Zhang, Xiuqing ;
Zheng, Huisong ;
Lander, Eric S. ;
Altshuler, David L. ;
Ambrogio, Lauren ;
Bloom, Toby ;
Cibulskis, Kristian ;
Fennell, Tim J. ;
Gabriel, Stacey B. .
NATURE, 2010, 467 (7319) :1061-1073
[3]  
[Anonymous], 1999, The Human Career: Human Biological and Cultural Origins
[5]   Assessing the evolutionary impact of amino acid mutations in the human genome [J].
Boyko, Adam R. ;
Williamson, Scott H. ;
Indap, Amit R. ;
Degenhardt, Jeremiah D. ;
Hernandez, Ryan D. ;
Lohmueller, Kirk E. ;
Adams, Mark D. ;
Schmidt, Steffen ;
Sninsky, John J. ;
Sunyaev, Shamil R. ;
White, Thomas J. ;
Nielsen, Rasmus ;
Clark, Andrew G. ;
Bustamante, Carlos D. .
PLOS GENETICS, 2008, 4 (05)
[6]   ESTIMATING THE NUMBER OF SPECIES - A REVIEW [J].
BUNGE, J ;
FITZPATRICK, M .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1993, 88 (421) :364-373
[7]  
BURNHAM KP, 1978, BIOMETRIKA, V65, P625, DOI 10.1093/biomet/65.3.625
[8]   ROBUST ESTIMATION OF POPULATION-SIZE WHEN CAPTURE PROBABILITIES VARY AMONG ANIMALS [J].
BURNHAM, KP ;
OVERTON, WS .
ECOLOGY, 1979, 60 (05) :927-936
[9]  
Bustamante CD, 2001, GENETICS, V159, P1779
[10]   Deep resequencing reveals excess rare recent variants consistent with explosive population growth [J].
Coventry, Alex ;
Bull-Otterson, Lara M. ;
Liu, Xiaoming ;
Clark, Andrew G. ;
Maxwell, Taylor J. ;
Crosby, Jacy ;
Hixson, James E. ;
Rea, Thomas J. ;
Muzny, Donna M. ;
Lewis, Lora R. ;
Wheeler, David A. ;
Sabo, Aniko ;
Lusk, Christine ;
Weiss, Kenneth G. ;
Akbar, Humeira ;
Cree, Andrew ;
Hawes, Alicia C. ;
Newsham, Irene ;
Varghese, Robin T. ;
Villasana, Donna ;
Gross, Shannon ;
Joshi, Vandita ;
Santibanez, Jireh ;
Morgan, Margaret ;
Chang, Kyle ;
Hale, Walker ;
Templeton, Alan R. ;
Boerwinkle, Eric ;
Gibbs, Richard ;
Sing, Charles F. .
NATURE COMMUNICATIONS, 2010, 1