Sequence variation in ligand binding sites in proteins

被引:66
作者
Magliery, TJ
Regan, L
机构
[1] Yale Univ, Dept Mol Biophys & Biochem, New Haven, CT 06520 USA
[2] Yale Univ, Dept Chem, New Haven, CT USA
关键词
D O I
10.1186/1471-2105-6-240
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The recent explosion in the availability of complete genome sequences has led to the cataloging of tens of thousands of new proteins and putative proteins. Many of these proteins can be structurally or functionally categorized from sequence conservation alone. In contrast, little attention has been given to the meaning of poorly-conserved sites in families of proteins, which are typically assumed to be of little structural or functional importance. Results: Recently, using statistical free energy analysis of tetratricopeptide repeat (TPR) domains, we observed that positions in contact with peptide ligands are more variable than surface positions in general. Here we show that statistical analysis of TPRs, ankyrin repeats, Cys(2)His(2) zinc fingers and PDZ domains accurately identifies specificity-determining positions by their sequence variation. Sequence variation is measured as deviation from a neutral reference state, and we present probabilistic and information theory formalisms that improve upon recently suggested methods such as statistical free energies and sequence entropies. Conclusion: Sequence variation has been used to identify functionally-important residues in four selected protein families. With TPRs and ankyrin repeats, protein families that bind highly diverse ligands, the effect is so pronounced that sequence "hypervariation" alone can be used to predict ligand binding sites.
引用
收藏
页数:11
相关论文
共 45 条
[21]   The KEGG databases at GenomeNet [J].
Kanehisa, M ;
Goto, S ;
Kawashima, S ;
Nakaya, A .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :42-46
[22]   Three-dimensional cluster analysis identifies interfaces and functional residue clusters in proteins [J].
Landgraf, R ;
Xenarios, I ;
Eisenberg, D .
JOURNAL OF MOLECULAR BIOLOGY, 2001, 307 (05) :1487-1502
[23]   An evolutionary trace method defines binding surfaces common to protein families [J].
Lichtarge, O ;
Bourne, HR ;
Cohen, FE .
JOURNAL OF MOLECULAR BIOLOGY, 1996, 257 (02) :342-358
[24]   Evolutionarily conserved pathways of energetic connectivity in protein families [J].
Lockless, SW ;
Ranganathan, R .
SCIENCE, 1999, 286 (5438) :295-299
[25]   Protein-protein interactions: Structurally conserved residues distinguish between binding sites and exposed protein surfaces [J].
Ma, BY ;
Elkayam, T ;
Wolfson, H ;
Nussinov, R .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (10) :5772-5777
[26]   Beyond consensus: Statistical free energies reveal hidden interactions in the design of a TPR motif [J].
Magliery, TJ ;
Regan, L .
JOURNAL OF MOLECULAR BIOLOGY, 2004, 343 (03) :731-745
[27]   Automatic methods for predicting functionally important residues [J].
Mesa, AD ;
Pazos, F ;
Valencia, A .
JOURNAL OF MOLECULAR BIOLOGY, 2003, 326 (04) :1289-1302
[28]   A family of evolution-entropy hybrid methods for ranking protein residues by importance [J].
Mihalek, I ;
Res, I ;
Lichtarge, O .
JOURNAL OF MOLECULAR BIOLOGY, 2004, 336 (05) :1265-1282
[29]   Design and selection of novel Cys2His2 zinc finger proteins [J].
Pabo, CO ;
Peisach, E ;
Grant, RA .
ANNUAL REVIEW OF BIOCHEMISTRY, 2001, 70 :313-340
[30]   Computer-aided design of a PDZ domain to recognize new target sequences [J].
Reina, J ;
Lacroix, E ;
Hobson, SD ;
Fernandez-Ballester, G ;
Rybin, V ;
Schwab, MS ;
Serrano, L ;
Gonzalez, C .
NATURE STRUCTURAL BIOLOGY, 2002, 9 (08) :621-627