Evolutionary constraint facilitates interpretation of genetic variation in resequenced human genomes

被引:66
作者
Goode, David L.
Cooper, Gregory M. [2 ]
Schmutz, Jeremy [1 ,3 ]
Dickson, Mark [1 ]
Gonzales, Eidelyn [1 ]
Tsai, Ming [1 ,3 ]
Karra, Kalpana [4 ]
Davydov, Eugene [5 ]
Batzoglou, Serafim [5 ]
Myers, Richard M. [1 ,3 ]
Sidow, Arend [1 ,4 ]
机构
[1] Stanford Univ, Sch Med, Dept Genet, Stanford Human Genome Ctr, Stanford, CA 94305 USA
[2] Univ Washington, Dept Genome Sci, Seattle, WA 98195 USA
[3] HudsonAlpha Inst Biotechnol, Huntsville, AL 35806 USA
[4] Stanford Univ, Sch Med, Dept Pathol, Stanford, CA 94305 USA
[5] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
关键词
ULTRACONSERVED ELEMENTS; STRUCTURAL VARIATION; WIDE ASSOCIATION; SEQUENCE; MUTATIONS; SELECTION; SUSCEPTIBILITY; 1-PERCENT; SIGNALS; POWER;
D O I
10.1101/gr.102210.109
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
070307 [化学生物学]; 071010 [生物化学与分子生物学];
摘要
Here, we demonstrate how comparative sequence analysis facilitates genome-wide base-pair-level interpretation of individual genetic variation and address two questions of importance for human personal genomics: first, whether an individual's functional variation comes mostly from noncoding or coding polymorphisms; and, second, whether population-specific or globally-present polymorphisms contribute more to functional variation in any given individual. Neither has been definitively answered by analyses of existing variation data because of a focus on coding polymorphisms, ascertainment biases in favor of common variation, and a lack of base-pair-level resolution for identifying functional variants. We resequenced 575 amplicons within 432 individuals at genomic sites enriched for evolutionary constraint and also analyzed variation within three published human genomes. We find that single-site measures of evolutionary constraint derived from mammalian multiple sequence alignments are strongly predictive of reductions in modern-day genetic diversity across a range of annotation categories and across the allele frequency spectrum from rare (<1%) to high frequency (>10% minor allele frequency). Furthermore, we show that putatively functional variation in an individual genome is dominated by polymorphisms that do not change protein sequence and that originate from our shared ancestral population and commonly segregate in human populations. These observations show that common, noncoding alleles contribute substantially to human phenotypes and that constraint-based analyses will be of value to identify phenotypically relevant variants in individual genomes.
引用
收藏
页码:301 / 310
页数:10
相关论文
共 64 条
[1]
Medical sequencing at the extremes of human body mass [J].
Ahituv, Nadav ;
Kavaslar, Nihan ;
Schackwitz, Wendy ;
Ustaszewska, Anna ;
Martin, Joel ;
Hebert, Sybil ;
Doelle, Heather ;
Ersoy, Baran ;
Kryukov, Gregory ;
Schmidt, Steffen ;
Yosef, Nir ;
Ruppin, Eytan ;
Sharan, Roded ;
Vaisse, Christian ;
Sunyaev, Shamil ;
Dent, Robert ;
Cohen, Jonathan ;
McPherson, Ruth ;
Pennacchio, Len A. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2007, 80 (04) :779-791
[2]
Population history and natural selection shape patterns of genetic variation in 132 genes [J].
Akey, JM ;
Eberle, MA ;
Rieder, MJ ;
Carlson, CS ;
Shriver, MD ;
Nickerson, DA ;
Kruglyak, L .
PLOS BIOLOGY, 2004, 2 (10) :1591-1599
[3]
A haplotype map of the human genome [J].
Altshuler, D ;
Brooks, LD ;
Chakravarti, A ;
Collins, FS ;
Daly, MJ ;
Donnelly, P ;
Gibbs, RA ;
Belmont, JW ;
Boudreau, A ;
Leal, SM ;
Hardenbol, P ;
Pasternak, S ;
Wheeler, DA ;
Willis, TD ;
Yu, FL ;
Yang, HM ;
Zeng, CQ ;
Gao, Y ;
Hu, HR ;
Hu, WT ;
Li, CH ;
Lin, W ;
Liu, SQ ;
Pan, H ;
Tang, XL ;
Wang, J ;
Wang, W ;
Yu, J ;
Zhang, B ;
Zhang, QR ;
Zhao, HB ;
Zhao, H ;
Zhou, J ;
Gabriel, SB ;
Barry, R ;
Blumenstiel, B ;
Camargo, A ;
Defelice, M ;
Faggart, M ;
Goyette, M ;
Gupta, S ;
Moore, J ;
Nguyen, H ;
Onofrio, RC ;
Parkin, M ;
Roy, J ;
Stahl, E ;
Winchester, E ;
Ziaugra, L ;
Shen, Y .
NATURE, 2005, 437 (7063) :1299-1320
[4]
Widely distributed noncoding purifying selection in the human genome [J].
Asthana, Saurabh ;
Noble, William S. ;
Kryukov, Gregory ;
Grantt, Charles E. ;
Sunyaev, Shamil ;
Stamatoyannopoulos, John A. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2007, 104 (30) :12410-12415
[5]
Analysis of sequence conservation at nucleotide resolution [J].
Asthana, Saurabh ;
Roytberg, Mikhail ;
Stamatoyannopoulos, John ;
Sunyaev, Shamil .
PLOS COMPUTATIONAL BIOLOGY, 2007, 3 (12) :2559-2568
[6]
Accurate whole human genome sequencing using reversible terminator chemistry [J].
Bentley, David R. ;
Balasubramanian, Shankar ;
Swerdlow, Harold P. ;
Smith, Geoffrey P. ;
Milton, John ;
Brown, Clive G. ;
Hall, Kevin P. ;
Evers, Dirk J. ;
Barnes, Colin L. ;
Bignell, Helen R. ;
Boutell, Jonathan M. ;
Bryant, Jason ;
Carter, Richard J. ;
Cheetham, R. Keira ;
Cox, Anthony J. ;
Ellis, Darren J. ;
Flatbush, Michael R. ;
Gormley, Niall A. ;
Humphray, Sean J. ;
Irving, Leslie J. ;
Karbelashvili, Mirian S. ;
Kirk, Scott M. ;
Li, Heng ;
Liu, Xiaohai ;
Maisinger, Klaus S. ;
Murray, Lisa J. ;
Obradovic, Bojan ;
Ost, Tobias ;
Parkinson, Michael L. ;
Pratt, Mark R. ;
Rasolonjatovo, Isabelle M. J. ;
Reed, Mark T. ;
Rigatti, Roberto ;
Rodighiero, Chiara ;
Ross, Mark T. ;
Sabot, Andrea ;
Sankar, Subramanian V. ;
Scally, Aylwyn ;
Schroth, Gary P. ;
Smith, Mark E. ;
Smith, Vincent P. ;
Spiridou, Anastassia ;
Torrance, Peta E. ;
Tzonev, Svilen S. ;
Vermaas, Eric H. ;
Walter, Klaudia ;
Wu, Xiaolin ;
Zhang, Lu ;
Alam, Mohammed D. ;
Anastasi, Carole .
NATURE, 2008, 456 (7218) :53-59
[7]
Estimating coverage and power for genetic association studies using near-complete variation data [J].
Bhangale, Tushar R. ;
Rieder, Mark J. ;
Nickerson, Deborah A. .
NATURE GENETICS, 2008, 40 (07) :841-843
[8]
Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project [J].
Birney, Ewan ;
Stamatoyannopoulos, John A. ;
Dutta, Anindya ;
Guigo, Roderic ;
Gingeras, Thomas R. ;
Margulies, Elliott H. ;
Weng, Zhiping ;
Snyder, Michael ;
Dermitzakis, Emmanouil T. ;
Stamatoyannopoulos, John A. ;
Thurman, Robert E. ;
Kuehn, Michael S. ;
Taylor, Christopher M. ;
Neph, Shane ;
Koch, Christoph M. ;
Asthana, Saurabh ;
Malhotra, Ankit ;
Adzhubei, Ivan ;
Greenbaum, Jason A. ;
Andrews, Robert M. ;
Flicek, Paul ;
Boyle, Patrick J. ;
Cao, Hua ;
Carter, Nigel P. ;
Clelland, Gayle K. ;
Davis, Sean ;
Day, Nathan ;
Dhami, Pawandeep ;
Dillon, Shane C. ;
Dorschner, Michael O. ;
Fiegler, Heike ;
Giresi, Paul G. ;
Goldy, Jeff ;
Hawrylycz, Michael ;
Haydock, Andrew ;
Humbert, Richard ;
James, Keith D. ;
Johnson, Brett E. ;
Johnson, Ericka M. ;
Frum, Tristan T. ;
Rosenzweig, Elizabeth R. ;
Karnani, Neerja ;
Lee, Kirsten ;
Lefebvre, Gregory C. ;
Navas, Patrick A. ;
Neri, Fidencio ;
Parker, Stephen C. J. ;
Sabo, Peter J. ;
Sandstrom, Richard ;
Shafer, Anthony .
NATURE, 2007, 447 (7146) :799-816
[9]
Aligning multiple genomic sequences with the threaded blockset aligner [J].
Blanchette, M ;
Kent, WJ ;
Riemer, C ;
Elnitski, L ;
Smit, AFA ;
Roskin, KM ;
Baertsch, R ;
Rosenbloom, K ;
Clawson, H ;
Green, ED ;
Haussler, D ;
Miller, W .
GENOME RESEARCH, 2004, 14 (04) :708-715
[10]
Common and rare variants in multifactorial susceptibility to common diseases [J].
Bodmer, Walter ;
Bonilla, Carolina .
NATURE GENETICS, 2008, 40 (06) :695-701