An accurate, sensitive, and scalable method to identify functional sites in protein structures

被引:151
作者
Yao, H
Kristensen, DM
Mihalek, I
Sowa, ME
Shaw, C
Kimmel, M
Kavraki, L
Lichtarge, O
机构
[1] Baylor Coll Med, Dept Mol & Human Genet, Houston, TX 77030 USA
[2] Baylor Coll Med, Program Struct & Computat Biol & Mol Biophys, Houston, TX 77030 USA
[3] WM Keck Ctr Computat Biol, Houston, TX 77030 USA
[4] Baylor Coll Med, Verna & Marrs McLean Dept Biochem & Mol Biol, Houston, TX 77030 USA
[5] Rice Univ, Dept Comp Sci, Houston, TX 77030 USA
[6] Rice Univ, Dept Bioengn, Houston, TX 77030 USA
[7] Baylor Coll Med, Program Dev Biol, Houston, TX 77030 USA
[8] Baylor Coll Med, Human Genome Sequencing Ctr, Houston, TX 77030 USA
关键词
molecular recognition; protein evolution; protein interactions; structural motif; drug target;
D O I
10.1016/S0022-2836(02)01336-0
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Functional sites determine the activity and interactions of proteins and as such constitute the targets of most drugs. However, the exponential growth of sequence and structure data far exceeds the ability of experimental techniques to identify their locations and key amino acids. To fill this gap we developed a computational Evolutionary Trace method that ranks the evolutionary importance of amino acids in protein sequences. Studies show that the best-ranked residues form fewer and larger structural clusters than expected by chance and overlap with functional sites, but until now the significance of this overlap has remained qualitative. Here, we use 86 diverse protein structures, including 20 determined by the structural genomics initiative, to show that this overlap is a recurrent and statistically significant feature. An automated ET correctly identifies seven of ten functional sites by the least favorable statistical measure, and nine of ten by the most favorable ones These results quantitatively,, demonstrate that a large fraction of functional sites in the proteome may be accurately identified from sequence and structure. This should help focus structure-function studies, rational drug design, protein engineering, and functional annotation to the relevant regions of a protein. (C) 2003 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:255 / 261
页数:7
相关论文
共 31 条