A simple topological representation of protein structure: Implications for new, fast, and robust structural classification

被引:19
作者
Bostick, DL
Shen, M
Vaisman, II [1 ]
机构
[1] George Mason Univ, Sch Computat Sci, Lab Struct Bioinformat, Manassas, VA 20110 USA
[2] Univ N Carolina, Dept Phys, Chapel Hill, NC USA
[3] Univ N Carolina, Program Mol Cell BIophys, Chapel Hill, NC USA
[4] Univ N Carolina, Sch Pharm, Lab Mol Modeling, Chapel Hill, NC USA
关键词
protein topology; hierarchical classification; protein structure comparison; computational geometry; protein evolution; Delaunay tessellation;
D O I
10.1002/prot.20146
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A topological representation of proteins is developed that makes use of two metrics: the Euclidean metric for identifying natural nearest neighboring residues via the Delaunay tessellation in Cartesian space and the distance between residues in sequence space. Using this representation, we introduce a quantitative and computationally inexpensive method for the comparison of protein structural topology. The method ultimately results in a numerical score quantifying the distance between proteins in a heuristically defined topological space. The properties of this scoring scheme are investigated and correlated with the standard C-alpha distance root-mean-square deviation measure of protein similarity calculated by rigid body structural alignment. The topological comparison method is shown to have a characteristic dependence on protein conformational. differences and secondary structure. This distinctive behavior is also observed in the comparison of proteins within families of structural relatives. The ability of the comparison method to successfully classify proteins into classes, super-families, folds, and families that are consistent with standard classification methods, both automated and human-driven, is demonstrated. Furthermore, it is shown that the scoring method allows for a fine-grained classification on the family, protein, and species level that agrees very well with currently established phylogenetic hierarchies. This fine classification is achieved without requiring visual inspection of proteins, sequence analysis, or the use of structural superimposition methods. Implications of the method for a fast, automated, topological hierarchical classification of proteins are discussed. (C) 2004 Wiley-Liss, Inc.
引用
收藏
页码:487 / 501
页数:15
相关论文
共 51 条
[1]   Interchanges of spatially neighbouring residues in structurally conserved environments [J].
Azarya-Sprinzak, E ;
Naor, D ;
Wolfson, HJ ;
Nussinov, R .
PROTEIN ENGINEERING, 1997, 10 (10) :1109-1122
[2]   GROMACS - A MESSAGE-PASSING PARALLEL MOLECULAR-DYNAMICS IMPLEMENTATION [J].
BERENDSEN, HJC ;
VANDERSPOEL, D ;
VANDRUNEN, R .
COMPUTER PHYSICS COMMUNICATIONS, 1995, 91 (1-3) :43-56
[3]   Molecular dynamics simulations of a fluid bilayer of dipalmitoylphosphatidylcholine at full hydration, constant pressure, and constant temperature [J].
Berger, O ;
Edholm, O ;
Jahnig, F .
BIOPHYSICAL JOURNAL, 1997, 72 (05) :2002-2013
[4]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[5]   A new topological method to measure protein structure similarity [J].
Bostick, D ;
Vaisman, II .
BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2003, 304 (02) :320-325
[6]  
Brenner SE, 1996, METHOD ENZYMOL, V266, P635
[7]   How are close residues of protein structures distributed in primary sequence? [J].
Brocchieri, L ;
Karlin, S .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1995, 92 (26) :12136-12140
[8]   Protein fold similarity estimated by a probabilistic approach based on Cα-Cα distance comparison [J].
Carugo, O ;
Pongor, S .
JOURNAL OF MOLECULAR BIOLOGY, 2002, 315 (04) :887-898
[9]   A normalized root-mean-square distance for comparing protein three-dimensional structures [J].
Carugo, O ;
Pongor, S .
PROTEIN SCIENCE, 2001, 10 (07) :1470-1473
[10]   A SMOOTH PARTICLE MESH EWALD METHOD [J].
ESSMANN, U ;
PERERA, L ;
BERKOWITZ, ML ;
DARDEN, T ;
LEE, H ;
PEDERSEN, LG .
JOURNAL OF CHEMICAL PHYSICS, 1995, 103 (19) :8577-8593