Large-scale assessment of the utility of low-resolution protein structures for biochemical function assignment

被引:44
作者
Arakaki, AK [1 ]
Zhang, Y [1 ]
Skolnick, J [1 ]
机构
[1] SUNY Buffalo, Ctr Excellence Bioinformat, Buffalo, NY 14203 USA
基金
美国国家卫生研究院;
关键词
D O I
10.1093/bioinformatics/bth044
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Several protein function prediction methods employ structural features captured in three-dimensional (3D) descriptors of biologically relevant sites. These methods are successful when applied to high-resolution structures, but their detection ability in lower resolution predicted structures has only been tested for a few cases. Results: A method that automatically generates a library of 3D functional descriptors for the structure-based prediction of enzyme active sites (automated functional templates, 593 in total for 162 different enzymes), based on functional and structural information automatically extracted from public databases, has been developed and evaluated using decoy structures. The applicability to predicted structures was investigated by analyzing decoys of varying quality, derived from enzyme native structures. For 35% of decoy structures, our method identifies the active site in models having 3-4 Angstrom coordinate root mean square deviation from the native structure, a quality that is reachable using state of the art protein structure prediction algorithms.
引用
收藏
页码:1087 / 1096
页数:10
相关论文
共 45 条
[21]   The active site architecture of Pisum sativum β-carbonic anhydrase is a mirror image of that of α-carbonic anhydrases [J].
Kimber, MS ;
Pai, EF .
EMBO JOURNAL, 2000, 19 (07) :1407-1418
[22]   Recognition of spatial motifs in protein structures [J].
Kleywegt, GJ .
JOURNAL OF MOLECULAR BIOLOGY, 1999, 285 (04) :1887-1897
[23]  
Koonin EV, 1996, METHOD ENZYMOL, V266, P295
[24]  
Liang M P, 2003, Pac Symp Biocomput, P204
[25]   The mirrored methionine sulfoxide reductases of Neisseria gonorrhoeae pilB [J].
Lowther, WT ;
Weissbach, H ;
Etienne, F ;
Brot, N ;
Matthews, BW .
NATURE STRUCTURAL BIOLOGY, 2002, 9 (05) :348-352
[26]  
MURZIN AG, 1995, J MOL BIOL, V247, P536, DOI 10.1016/S0022-2836(05)80134-2
[27]   Data mining the protein data bank: Residue interactions [J].
Oldfield, TJ .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2002, 49 (04) :510-528
[28]   CATH - a hierarchic classification of protein domain structures [J].
Orengo, CA ;
Michie, AD ;
Jones, S ;
Jones, DT ;
Swindells, MB ;
Thornton, JM .
STRUCTURE, 1997, 5 (08) :1093-1108
[29]   A geometric algorithm to find small but highly similar 3D substructures in proteins [J].
Pennec, X ;
Ayache, N .
BIOINFORMATICS, 1998, 14 (06) :516-522
[30]   Genes and proteins of Escherichia coli K-12 (GenProtEC) [J].
Riley, M .
NUCLEIC ACIDS RESEARCH, 1998, 26 (01) :54-54