Local feature frequency profile: A method to measure structural similarity in proteins

被引:63
作者
Choi, IG
Kwon, J
Kim, SH [1 ]
机构
[1] Univ Calif Berkeley, Dept Chem, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
[3] Lawrence Berkeley Natl Lab, Berkeley, CA 94720 USA
关键词
protein structural similarity; protein distance matrix; local protein structural features profile; protein fold; protein fold space;
D O I
10.1073/pnas.0308656100
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Measures of structural similarity between known protein structures provide an objective basis for classifying protein folds and for revealing a global view of the protein structure universe. Here, we describe a rapid method to measure structural similarity based on the profiles of representative local features of C. distance matrices of compared protein structures. We first extract a finite number of representative local feature (LF) patterns from the distance matrices of all protein fold families by medoid analysis. Then, each C. distance matrix of a protein structure is encoded by labeling all its submatrices by the index of the nearest representative LF patterns. Finally, the structure is represented by the frequency distribution of these indices, which we call the LF frequency (LFF) profile of the protein. The LFF profile allows one to calculate structural similarity scores among a large number of protein structures quickly, and also to construct and update the "map" of the protein structure universe easily. The LFF profile method efficiently maps complex protein structures into a common Euclidean space without prior assignment of secondary structure information or structural alignment.
引用
收藏
页码:3797 / 3802
页数:6
相关论文
共 23 条
[1]   Structural genomics - Tapping DNA for structures produces a trickle [J].
Service, RF .
SCIENCE, 2002, 298 (5595) :948-950
[2]  
Aung ZY, 2003, EIGHTH INTERNATIONAL CONFERENCE ON DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PROCEEDINGS, P311
[3]  
Bartlett Gail J, 2003, Methods Biochem Anal, V44, P387
[4]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[5]   Protein fold similarity estimated by a probabilistic approach based on Cα-Cα distance comparison [J].
Carugo, O ;
Pongor, S .
JOURNAL OF MOLECULAR BIOLOGY, 2002, 315 (04) :887-898
[6]   ASTRAL compendium enhancements [J].
Chandonia, JM ;
Walker, NS ;
Conte, LL ;
Koehl, P ;
Levitt, M ;
Brenner, SE .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :260-263
[7]  
GABRIEL KR, 1971, BIOMETRIKA, V58, P453, DOI 10.2307/2334381
[8]   Surprising similarities in structure comparison [J].
Gibrat, JF ;
Madej, T ;
Bryant, SH .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 1996, 6 (03) :377-385
[9]   Quantifying the similarities within fold space [J].
Harrison, A ;
Pearl, F ;
Mott, R ;
Thornton, J ;
Orengo, C .
JOURNAL OF MOLECULAR BIOLOGY, 2002, 323 (05) :909-926
[10]   THE COMBINATORIAL DISTANCE GEOMETRY METHOD FOR THE CALCULATION OF MOLECULAR-CONFORMATION .1. A NEW APPROACH TO AN OLD PROBLEM [J].
HAVEL, TF ;
KUNTZ, ID ;
CRIPPEN, GM .
JOURNAL OF THEORETICAL BIOLOGY, 1983, 104 (03) :359-381