An integrated map of genetic variation from 1,092 human genomes

被引:4874
作者
Altshuler, David M. [3 ]
Durbin, Richard M. [5 ]
Abecasis, Goncalo R. [6 ]
Bentley, David R. [7 ]
Chakravarti, Aravinda [8 ]
Clark, Andrew G. [9 ]
Donnelly, Peter [1 ,2 ]
Eichler, Evan E. [10 ,11 ]
Flicek, Paul [12 ]
Gabriel, Stacey B. [3 ]
Gibbs, Richard A. [13 ]
Green, Eric D.
Hurles, Matthew E. [5 ]
Knoppers, Bartha M. [14 ]
Korbel, Jan O. [15 ]
Lander, Eric S.
Lee, Charles [16 ]
Lehrach, Hans [17 ]
Mardis, Elaine R. [18 ]
Marth, Gabor T. [19 ]
McVean, Gil A. [1 ]
Nickerson, Deborah A. [20 ]
Schmidt, Jeanette P. [21 ]
Sherry, Stephen T. [22 ]
Wang, Jun [23 ]
Wilson, Richard K. [18 ]
Gibbs, Richard A. [13 ]
Dinh, Huyen [13 ]
Kovar, Christie [13 ]
Lee, Sandra [13 ]
Lewis, Lora [13 ]
Muzny, Donna [13 ]
Reid, Jeff [13 ]
Wang, Min [13 ]
Wang, Jun [23 ]
Fang, Xiaodong [23 ]
Guo, Xiaosen [23 ]
Jian, Min [23 ]
Jiang, Hui [23 ]
Jin, Xin [23 ]
Li, Guoqing [23 ]
Li, Jingxiang [23 ]
Li, Yingrui [23 ]
Li, Zhuo [23 ]
Liu, Xiao [23 ]
Lu, Yao [23 ]
Ma, Xuedi [23 ]
Su, Zhe [23 ]
Tai, Shuaishuai [23 ]
Tang, Meifang [23 ]
机构
[1] Univ Oxford, Wellcome Trust Ctr Human Genet, Oxford OX3 7BN, England
[2] Univ Oxford, Dept Stat, Oxford OX1 3TG, England
[3] Broad Inst MIT & Harvard, Cambridge, MA 02142 USA
[4] Harvard Univ, Sch Med, Dept Genet, Cambridge, MA 02142 USA
[5] Wellcome Trust Sanger Inst, Cambridge CB10 1SA, England
[6] Univ Michigan, Ctr Stat Genet, Ann Arbor, MI 48109 USA
[7] Illumina United Kingdom, Near Saffron Walden CB10 1XL, Essex, England
[8] Johns Hopkins Univ, Sch Med, McKusick Nathans Inst Genet Med, Baltimore, MD 21205 USA
[9] Cornell Univ, Ctr Comparat & Populat Genom, Ithaca, NY 14850 USA
[10] Univ Washington, Sch Med, Dept Genome Sci, Seattle, WA 98195 USA
[11] Howard Hughes Med Inst, Seattle, WA 98195 USA
[12] European Bioinformat Inst, Cambridge CB10 1SD, England
[13] Baylor Coll Med, Human Genome Sequencing Ctr, Houston, TX 77030 USA
[14] McGill Univ, Ctr Genom & Policy, Montreal, PQ H3A 1A4, Canada
[15] European Mol Biol Lab, Genome Biol Res Unit, D-69117 Heidelberg, Germany
[16] Brigham & Womens Hosp, Dept Pathol, Boston, MA 02115 USA
[17] Max Planck Inst Mol Genet, D-14195 Berlin, Germany
[18] Washington Univ, Sch Med, Genome Ctr, St Louis, MO 63108 USA
[19] Boston Coll, Dept Biol, Chestnut Hill, MA 02467 USA
[20] Univ Washington, Sch Med, Dept Genome Sci, Seattle, WA 98195 USA
[21] Affymetrix Inc, Santa Clara, CA 95051 USA
[22] US Natl Inst Hlth, Natl Ctr Biotechnol Informat, Bethesda, MD 20892 USA
[23] BGI Shenzhen, Shenzhen 518083, Peoples R China
[24] Alacris Theranost GmbH, D-14195 Berlin, Germany
[25] Albert Einstein Coll Med, Dept Genet, Bronx, NY 10461 USA
[26] Cold Spring Harbor Lab, Cold Spring Harbor, NY 11724 USA
[27] Mt Sinai Sch Med, Seaver Autism Ctr, New York, NY 10029 USA
[28] Dankook Univ, Dept Nanobiomed Sci, Cheonan 330714, South Korea
[29] Dankook Univ, Dept Biol Sci, Cheonan 330714, South Korea
[30] Cornell Univ, Dept Biol Stat & Computat Biol, Ithaca, NY 14853 USA
[31] Harvard Univ, Ctr Syst Biol, Cambridge, MA 02138 USA
[32] Harvard Univ, Dept Organism & Evolutionary Biol, Cambridge, MA 02138 USA
[33] Cardiff Univ, Sch Med, Inst Med Genet, Cardiff CF14 4XN, S Glam, Wales
[34] Illumina Inc, San Diego, CA 92122 USA
[35] Leiden Univ, Med Ctr, Dept Med Stat & Bioinformat, Mol Epidemiol Sect, NL-2333 ZA Leiden, Netherlands
[36] Louisiana State Univ, Dept Biol Sci, Baton Rouge, LA 70803 USA
[37] Massachusetts Gen Hosp, Analyt & Translat Genet Unit, Boston, MA 02114 USA
[38] Penn State Univ, Dept Anthropol, University Pk, PA 16802 USA
[39] Stanford Univ, Dept Genet, Stanford, CA 94305 USA
[40] Ancestry Com, San Francisco, CA 94107 USA
[41] Tel Aviv Univ, Blavatnik Sch Comp Sci, IL-69978 Tel Aviv, Israel
[42] Tel Aviv Univ, Dept Microbiol, IL-69978 Tel Aviv, Israel
[43] Int Comp Sci Inst, Berkeley, CA 94704 USA
[44] Translat Genom Res Inst, Phoenix, AZ 85004 USA
[45] Life Technol, Beverly, MA 01915 USA
[46] Univ Calif Los Angeles, David Geffen Sch ofMedicine, Dept Human Genet, Los Angeles, CA 90024 USA
[47] Univ Calif San Diego, Dept Psychiat, La Jolla, CA 92093 USA
[48] Univ Calif San Diego, Dept Cellular & Mol Med, La Jolla, CA 92093 USA
[49] Univ Calif San Diego, Dept Comp Sci, La Jolla, CA 92093 USA
[50] Albert Einstein Coll Med, Dept Epidemiol & Populat Hlth, Bronx, NY 10461 USA
基金
瑞士国家科学基金会; 英国生物技术与生命科学研究理事会; 英国惠康基金; 英国医学研究理事会; 中国国家自然科学基金; 美国国家卫生研究院;
关键词
COPY NUMBER VARIATION; WIDE ASSOCIATION; POPULATION-STRUCTURE; RARE; VARIANTS; LOCI; MUTATION; RISK;
D O I
10.1038/nature11632
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
By characterizing the geographic and functional spectrum of human genetic variation, the 1000 Genomes Project aims to build a resource to help to understand the genetic contribution to disease. Here we describe the genomes of 1,092 individuals from 14 populations, constructed using a combination of low-coverage whole-genome and exome sequencing. By developing methods to integrate information across several algorithms and diverse data sources, we provide a validated haplotype map of 38 million single nucleotide polymorphisms, 1.4 million short insertions and deletions, and more than 14,000 larger deletions. We show that individuals from different populations carry different profiles of rare and common variants, and that low-frequency variants show substantial geographic differentiation, which is further increased by the action of purifying selection. We show that evolutionary conservation and coding consequence are key determinants of the strength of purifying selection, that rare-variant load varies substantially across biological pathways, and that each individual contains hundreds of rare non-coding variants at conserved sites, such as motif-disrupting changes in transcription-factor-binding sites. This resource, which captures up to 98% of accessible single nucleotide polymorphisms at a frequency of 1% in related populations, enables analysis of common and low-frequency variants in individuals from diverse, including admixed, populations.
引用
收藏
页码:56 / 65
页数:10
相关论文
共 47 条
[1]   A map of human genome variation from population-scale sequencing [J].
Altshuler, David ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Collins, Francis S. ;
De la Vega, Francisco M. ;
Donnelly, Peter ;
Egholm, Michael ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Knoppers, Bartha M. ;
Lander, Eric S. ;
Lehrach, Hans ;
Mardis, Elaine R. ;
McVean, Gil A. ;
Nickerson, DebbieA. ;
Peltonen, Leena ;
Schafer, Alan J. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Deiros, David ;
Metzker, Mike ;
Muzny, Donna ;
Reid, Jeff ;
Wheeler, David ;
Wang, Jun ;
Li, Jingxiang ;
Jian, Min ;
Li, Guoqing ;
Li, Ruiqiang ;
Liang, Huiqing ;
Tian, Geng ;
Wang, Bo ;
Wang, Jian ;
Wang, Wei ;
Yang, Huanming ;
Zhang, Xiuqing ;
Zheng, Huisong ;
Lander, Eric S. ;
Altshuler, David L. ;
Ambrogio, Lauren ;
Bloom, Toby ;
Cibulskis, Kristian ;
Fennell, Tim J. ;
Gabriel, Stacey B. .
NATURE, 2010, 467 (7319) :1061-1073
[2]  
[Anonymous], AM J MED GENET A
[3]  
[Anonymous], NATURE GENE IN PRESS
[4]   Exome sequencing as a tool for Mendelian disease gene discovery [J].
Bamshad, Michael J. ;
Ng, Sarah B. ;
Bigham, Abigail W. ;
Tabor, Holly K. ;
Emond, Mary J. ;
Nickerson, Deborah A. ;
Shendure, Jay .
NATURE REVIEWS GENETICS, 2011, 12 (11) :745-755
[5]   Integrated genomic analyses of ovarian carcinoma [J].
Bell, D. ;
Berchuck, A. ;
Birrer, M. ;
Chien, J. ;
Cramer, D. W. ;
Dao, F. ;
Dhir, R. ;
DiSaia, P. ;
Gabra, H. ;
Glenn, P. ;
Godwin, A. K. ;
Gross, J. ;
Hartmann, L. ;
Huang, M. ;
Huntsman, D. G. ;
Iacocca, M. ;
Imielinski, M. ;
Kalloger, S. ;
Karlan, B. Y. ;
Levine, D. A. ;
Mills, G. B. ;
Morrison, C. ;
Mutch, D. ;
Olvera, N. ;
Orsulic, S. ;
Park, K. ;
Petrelli, N. ;
Rabeno, B. ;
Rader, J. S. ;
Sikic, B. I. ;
Smith-McCune, K. ;
Sood, A. K. ;
Bowtell, D. ;
Penny, R. ;
Testa, J. R. ;
Chang, K. ;
Dinh, H. H. ;
Drummond, J. A. ;
Fowler, G. ;
Gunaratne, P. ;
Hawes, A. C. ;
Kovar, C. L. ;
Lewis, L. R. ;
Morgan, M. B. ;
Newsham, I. F. ;
Santibanez, J. ;
Reid, J. G. ;
Trevino, L. R. ;
Wu, Y. -Q. ;
Wang, M. .
NATURE, 2011, 474 (7353) :609-615
[6]   Genome-wide patterns of population structure and admixture in West Africans and African Americans [J].
Bryc, Katarzyna ;
Auton, Adam ;
Nelson, Matthew R. ;
Oksenberg, Jorge R. ;
Hauser, Stephen L. ;
Williams, Scott ;
Froment, Alain ;
Bodo, Jean-Marie ;
Wambebe, Charles ;
Tishkoff, Sarah A. ;
Bustamante, Carlos D. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2010, 107 (02) :786-791
[7]   Genetic loci influencing kidney function and chronic kidney disease [J].
Chambers, John C. ;
Zhang, Weihua ;
Lord, Graham M. ;
van der Harst, Pim ;
Lawlor, Debbie A. ;
Sehmi, Joban S. ;
Gale, Daniel P. ;
Wass, Mark N. ;
Ahmadi, Kourosh R. ;
Bakker, Stephan J. L. ;
Beckmann, Jacqui ;
Bilo, Henk J. G. ;
Bochud, Murielle ;
Brown, Morris J. ;
Caulfield, Mark J. ;
Connell, John M. C. ;
Cook, H. Terence ;
Cotlarciuc, Ioana ;
Smith, George Davey ;
de Silva, Ranil ;
Deng, Guohong ;
Devuyst, Olivier ;
Dikkeschei, Lambert D. ;
Dimkovic, Nada ;
Dockrell, Mark ;
Dominiczak, Anna ;
Ebrahim, Shah ;
Eggermann, Thomas ;
Farrall, Martin ;
Ferrucci, Luigi ;
Floege, Jurgen ;
Forouhi, Nita G. ;
Gansevoort, Ron T. ;
Han, Xijin ;
Hedblad, Bo ;
van der Heide, Jaap J. Homan ;
Hepkema, Bouke G. ;
Hernandez-Fuentes, Maria ;
Hypponen, Elina ;
Johnson, Toby ;
de Jong, Paul E. ;
Kleefstra, Nanne ;
Lagou, Vasiliki ;
Lapsley, Marta ;
Li, Yun ;
Loos, Ruth J. F. ;
Luan, Jian'an ;
Luttropp, Karin ;
Marechal, Celine ;
Melander, Olle .
NATURE GENETICS, 2010, 42 (05) :373-375
[8]   Origins and functional impact of copy number variation in the human genome [J].
Conrad, Donald F. ;
Pinto, Dalila ;
Redon, Richard ;
Feuk, Lars ;
Gokcumen, Omer ;
Zhang, Yujun ;
Aerts, Jan ;
Andrews, T. Daniel ;
Barnes, Chris ;
Campbell, Peter ;
Fitzgerald, Tomas ;
Hu, Min ;
Ihm, Chun Hwa ;
Kristiansson, Kati ;
MacArthur, Daniel G. ;
MacDonald, Jeffrey R. ;
Onyiah, Ifejinelo ;
Pang, Andy Wing Chun ;
Robson, Sam ;
Stirrups, Kathy ;
Valsesia, Armand ;
Walter, Klaudia ;
Wei, John ;
Tyler-Smith, Chris ;
Carter, Nigel P. ;
Lee, Charles ;
Scherer, Stephen W. ;
Hurles, Matthew E. .
NATURE, 2010, 464 (7289) :704-712
[9]   Identifying a High Fraction of the Human Genome to be under Selective Constraint Using GERP plus [J].
Davydov, Eugene V. ;
Goode, David L. ;
Sirota, Marina ;
Cooper, Gregory M. ;
Sidow, Arend ;
Batzoglou, Serafim .
PLOS COMPUTATIONAL BIOLOGY, 2010, 6 (12)
[10]   DNase I sensitivity QTLs are a major determinant of human expression variation [J].
Degner, Jacob F. ;
Pai, Athma A. ;
Pique-Regi, Roger ;
Veyrieras, Jean-Baptiste ;
Gaffney, Daniel J. ;
Pickrell, Joseph K. ;
De Leon, Sherryl ;
Michelini, Katelyn ;
Lewellen, Noah ;
Crawford, Gregory E. ;
Stephens, Matthew ;
Gilad, Yoav ;
Pritchard, Jonathan K. .
NATURE, 2012, 482 (7385) :390-394