The ConSurf-HSSP database: The mapping of evolutionary conservation among homologs onto PDB structures

被引:114
作者
Glaser, F
Rosenberg, Y
Kessel, A
Pupko, T
Ben-Tal, N [1 ]
机构
[1] Tel Aviv Univ, George S Wise Fac Life Sci, Dept Biochem, IL-69978 Tel Aviv, Israel
[2] Tel Aviv Univ, Dept Cell Res & Immunol, IL-69978 Tel Aviv, Israel
关键词
evolutionary rate; amino acid conservation; protein evolution; ConSurf; phylogeny; Rate4Site; pyruvate kinase;
D O I
10.1002/prot.20305
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The HSSP (Homology-Derived Secondary Structure of Proteins) database provides multiple sequence alignments (MSAs) for proteins of known three-dimensional (3D) structure in the Protein Data Bank (PDB). The database also contains an estimate of the degree of evolutionary conservation at each amino acid position. This estimate, which is based on the relative entropy, correlates with the functional importance of the position; evolutionarily conserved positions (i.e., positions with limited variability and low entropy) are occasionally important to maintain the 3D structure and biological function(s) of the protein. We recently developed the Rate4Site algorithm for scoring amino acid conservation based on their calculated evolutionary rate. This algorithm takes into account the phylogenetic relationships between the homologs and the stochastic nature of the evolutionary process. Here we present the ConSurf-HSSP database of Rate4Site estimates of the evolutionary rates of the amino acid positions, calculated using HSSP's MSAs. The database provides precalculated evolutionary rates for nearly all of the PDB. These rates are projected, using a color code, onto the protein structure, and can be viewed online using the Con-Surf server interface. To exemplify the database, we analyzed in detail the conservation pattern obtained for pyruvate kinase and compared the results with those observed using the relative entropy scores of the HSSP database. It is reassuring to know that the main functional region of the enzyme is detectable using both conservation scores. Interestingly, the ConSurf-HSSP calculations mapped additional functionally important regions, which are moderately conserved and were overlooked by the original HSSP estimate. The ConSurf-HSSP data-base is available online (http://consurf-hssp.tau.ac.il). (C) 2004Wiley-Liss, Inc.
引用
收藏
页码:610 / 617
页数:8
相关论文
共 52 条
  • [1] Automated structure-based prediction of functional sites in proteins: Applications to assessing the validity of inheriting protein function from homology in genome annotation and to protein docking
    Aloy, P
    Querol, E
    Aviles, FX
    Sternberg, MJE
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2001, 311 (02) : 395 - 408
  • [2] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [3] ConSurf: An algorithmic tool for the identification of functional regions in proteins by surface mapping of phylogenetic information
    Armon, A
    Graur, D
    Ben-Tal, N
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2001, 307 (01) : 447 - 463
  • [4] In silico identification of functional protein interfaces
    Bell, RE
    Ben-Tal, N
    [J]. COMPARATIVE AND FUNCTIONAL GENOMICS, 2003, 4 (04): : 420 - 423
  • [5] Benson DA, 2003, NUCLEIC ACIDS RES, V31, P23, DOI 10.1093/nar/gkg057
  • [6] The Protein Data Bank
    Berman, HM
    Westbrook, J
    Feng, Z
    Gilliland, G
    Bhat, TN
    Weissig, H
    Shindyalov, IN
    Bourne, PE
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 235 - 242
  • [7] Inferring functional constraints and divergence in protein families using 3D mapping of phylogenetic information
    Blouin, C
    Boucher, Y
    Roger, AJ
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (02) : 790 - 797
  • [8] A tour of structural genomics
    Brenner, SE
    [J]. NATURE REVIEWS GENETICS, 2001, 2 (10) : 801 - 809
  • [9] Dean A M, 2000, Pac Symp Biocomput, P6
  • [10] The HSSP database of protein structure sequence alignments and family profiles
    Dodge, C
    Schneider, R
    Sander, C
    [J]. NUCLEIC ACIDS RESEARCH, 1998, 26 (01) : 313 - 315