A formal perturbation equation between genotype and phenotype determines the Evolutionary Action of protein-coding variations on fitness

被引:113
作者
Katsonis, Panagiotis [1 ]
Lichtarge, Olivier [1 ,2 ,3 ,4 ]
机构
[1] Baylor Coll Med, Dept Mol & Human Genet, Houston, TX 77030 USA
[2] Baylor Coll Med, Dept Biochem & Mol Biol, Houston, TX 77030 USA
[3] Baylor Coll Med, Dept Pharmacol, Houston, TX 77030 USA
[4] Baylor Coll Med, Computat & Integrat Biomed Res Ctr, Houston, TX 77030 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
SEQUENCE ALIGNMENT; COUPLED RECEPTORS; MUTATION; TRACE; RESIDUES; COMMON; IDENTIFICATION; PREDICTION; DISEASE; DNA;
D O I
10.1101/gr.176214.114
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The relationship between genotype mutations and phenotype variations determines health in the short term and evolution over the long term, and it hinges on the action of mutations on fitness. A fundamental difficulty in determining this action, however, is that it depends on the unique context of each mutation, which is complex and often cryptic. As a result, the effect of most genome variations on molecular function and overall fitness remains unknown and stands apart from population genetics theories linking fitness effect to polymorphism frequency. Here, we hypothesize that evolution is a continuous and differentiable physical process coupling genotype to phenotype. This leads to a formal equation for the action of coding mutations on fitness that can be interpreted as a product of the evolutionary importance of the mutated site with the difference in amino acid similarity. Approximations for these terms are readily computable from phylogenetic sequence analysis, and we show mutational, clinical, and population genetic evidence that this action equation predicts the effect of point mutations in vivo and in vitro in diverse proteins, correlates disease-causing gene mutations with morbidity, and determines the frequency of human coding polymorphisms, respectively. Thus, elementary calculus and phylogenetics can be integrated into a perturbation analysis of the evolutionary relationship between genotype and phenotype that quantitatively links point mutations to function and fitness and that opens a new analytic framework for equations of biology. In practice, this work explicitly bridges molecular evolution with population genetics with applications from protein redesign to the clinical assessment of human genetic variations.
引用
收藏
页码:2050 / 2058
页数:9
相关论文
共 59 条
[1]   Separation of Recombination and SOS Response in Escherichia coli RecA Suggests LexA Interaction Sites [J].
Adikesavan, Anbu K. ;
Katsonis, Panagiotis ;
Marciano, David C. ;
Lua, Rhonald ;
Herman, Christophe ;
Lichtarge, Olivier .
PLOS GENETICS, 2011, 7 (09)
[2]   A method and server for predicting damaging missense mutations [J].
Adzhubei, Ivan A. ;
Schmidt, Steffen ;
Peshkin, Leonid ;
Ramensky, Vasily E. ;
Gerasimova, Anna ;
Bork, Peer ;
Kondrashov, Alexey S. ;
Sunyaev, Shamil R. .
NATURE METHODS, 2010, 7 (04) :248-249
[3]  
[Anonymous], P NATL ACAD SCI
[4]  
[Anonymous], 2012, Nature
[5]  
[Anonymous], 1999, The genetical theory of natural selection: a complete variorum edition
[6]   Common and rare variants in multifactorial susceptibility to common diseases [J].
Bodmer, Walter ;
Bonilla, Carolina .
NATURE GENETICS, 2008, 40 (06) :695-701
[7]   Discovering genotypes underlying human phenotypes: past successes for mendelian disease, future approaches for complex disease [J].
Botstein, D ;
Risch, N .
NATURE GENETICS, 2003, 33 (Suppl 3) :228-237
[8]   Epistasis as the primary factor in molecular evolution [J].
Breen, Michael S. ;
Kemena, Carsten ;
Vlasov, Peter K. ;
Notredame, Cedric ;
Kondrashov, Fyodor A. .
NATURE, 2012, 490 (7421) :535-+
[9]   Genetic diagnosis by whole exome capture and massively parallel DNA sequencing [J].
Choi, Murim ;
Scholl, Ute I. ;
Ji, Weizhen ;
Liu, Tiewen ;
Tikhonova, Irina R. ;
Zumbo, Paul ;
Nayir, Ahmet ;
Bakkaloglu, Aysin ;
Ozen, Seza ;
Sanjad, Sami ;
Nelson-Williams, Carol ;
Farhi, Anita ;
Mane, Shrikant ;
Lifton, Richard P. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2009, 106 (45) :19096-19101
[10]   Basing population genetic inferences and models of molecular evolution upon desired stationary distributions of DNA or protein sequences [J].
Choi, Sang Chul ;
Redelings, Benjamin D. ;
Thorne, Jeffrey L. .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2008, 363 (1512) :3931-3939