Looking at structure, stability, and evolution of proteins through the principal eigenvector of contact matrices and hydrophobicity profiles

被引:17
作者
Bastolla, U
Porto, M
Roman, HE
Vendrusolo, M
机构
[1] CSIC, INTA, Ctr Astrobiol, E-28850 Torrejon de Ardoz, Madrid, Spain
[2] Tech Univ Darmstadt, Inst Festkorperphys, D-64289 Darmstadt, Germany
[3] Univ Milan, Ist Nazl Fis Nucl, Dipartimento Fis, I-20133 Milan, Italy
[4] Univ Cambridge, Dept Chem, Cambridge CB2 1EW, England
关键词
protein evolution; protein folding; maximum likelihood; site-specific amino acid distributions; principal eigenvector of the contact matrix; hydrophobicity vector; interactivity;
D O I
10.1016/j.gene.2004.12.015
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
We review and further develop an analytical model that describes how thermodynamic constraints on the stability of the native state influence protein evolution in a site-specific manner. To this end, we represent both protein sequences and protein structures as vectors: structures are represented by the principal eigenvector (PE) of the protein contact matrix, a quantity that resembles closely the effective connectivity of each site; sequences are represented through the "interactivity" of each amino acid type, using novel parameters that are correlated with hydropathy scales. These interactivity parameters are more strongly correlated than the other hydropathy scales that we examine with: (1) the change upon Mutations of the unfolding free energy of proteins with two-states thermodynamics; (2) genomic properties as the genome-size and the genome-wide GC content; (3) the main eigenvectors of the substitution matrices. The evolutionary average of the interactivity vector correlates very strongly with the PE of a protein structure. Using this result, we derive an analytic expression for site-specific distributions of amino acids across protein families in the form of Boltzmann distributions whose "inverse temperature" is a function of the PE component. We show that our predictions are in agreement with site-specific amino acid distributions obtained from the Protein Data Bank, and we determine the mutational model that best fits the observed site-specific amino acid distributions. Interestingly, the optimal model almost minimizes the rate at which deleterious mutations are eliminated by natural selection. (c) 2004 Elsevier B.V All rights reserved.
引用
收藏
页码:219 / 230
页数:12
相关论文
共 67 条
[1]   FREE-ENERGY LANDSCAPE FOR PROTEIN-FOLDING KINETICS - INTERMEDIATES, TRAPS, AND MULTIPLE PATHWAYS IN THEORY AND LATTICE MODEL SIMULATIONS [J].
ABKEVICH, VI ;
GUTIN, AM ;
SHAKHNOVICH, EI .
JOURNAL OF CHEMICAL PHYSICS, 1994, 101 (07) :6052-6062
[2]   Neutral networks in protein space: a computational study based on knowledge-based potentials of mean force [J].
Babajide, A ;
Hofacker, IL ;
Sippl, MJ ;
Stadler, PF .
FOLDING & DESIGN, 1997, 2 (05) :261-269
[3]   Neutral evolution of model proteins: Diffusion in sequence space and overdispersion [J].
Bastolla, U ;
Roman, HE ;
Vendruscolo, M .
JOURNAL OF THEORETICAL BIOLOGY, 1999, 200 (01) :49-64
[4]   Principal eigenvector of contact matrices and hydrophobicity profiles in proteins [J].
Bastolla, U ;
Porto, M ;
Roman, HE ;
Vendruscolo, M .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 58 (01) :22-30
[5]   Genomic determinants of protein folding thermodynamics in prokaryotic organisms [J].
Bastolla, U ;
Moya, A ;
Viguera, E ;
van Ham, RCHJ .
JOURNAL OF MOLECULAR BIOLOGY, 2004, 343 (05) :1451-1466
[6]   Statistical properties of neutral evolution [J].
Bastolla, U ;
Porto, M ;
Roman, HE ;
Vendruscolo, M .
JOURNAL OF MOLECULAR EVOLUTION, 2003, 57 (Suppl 1) :S103-S119
[7]   Lack of self-averaging in neutral evolution of proteins [J].
Bastolla, U ;
Porto, M ;
Roman, HE ;
Vendruscolo, M .
PHYSICAL REVIEW LETTERS, 2002, 89 (20) :208101-208101
[8]   Connectivity of neutral networks, overdispersion, and structural conservation in protein evolution [J].
Bastolla, U ;
Porto, M ;
Roman, HE ;
Vendruscolo, M .
JOURNAL OF MOLECULAR EVOLUTION, 2003, 56 (03) :243-254
[9]   How to guarantee optimal stability for most representative structures in the protein data bank [J].
Bastolla, U ;
Farwer, J ;
Knapp, EW ;
Vendruscolo, M .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 2001, 44 (02) :79-96
[10]  
Bastolla U, 1998, PROTEINS, V32, P52