Toward a "Structural BLAST": Using structural relationships to infer function

被引:15
作者
Dey, Fabian
Zhang, Qiangfeng Cliff
Petrey, Donald
Honig, Barry [1 ,2 ]
机构
[1] Columbia Univ, Howard Hughes Med Inst, Dept Biochem & Mol Biophys, Ctr Computat Biol & Bioinformat, New York, NY 10032 USA
[2] Columbia Univ, Initiat Syst Biol, New York, NY 10032 USA
基金
瑞士国家科学基金会;
关键词
protein interaction prediction; protein interface prediction; structure; function relationship; machine learning; PROTEIN-PROTEIN INTERACTIONS; LIKELY COMPLETENESS; FOLD SPACE; PREDICTION; ALIGNMENT; DATABASE; INTERFACES; SERVER; CONSERVATION; EVOLUTIONARY;
D O I
10.1002/pro.2225
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We outline a set of strategies to infer protein function from structure. The overall approach depends on extensive use of homology modeling, the exploitation of a wide range of global and local geometric relationships between protein structures and the use of machine learning techniques. The combination of modeling with broad searches of protein structure space defines a structural BLAST approach to infer function with high genomic coverage. Applications are described to the prediction of proteinprotein and proteinligand interactions. In the context of proteinprotein interactions, our structure-based prediction algorithm, PrePPI, has comparable accuracy to high-throughput experiments. An essential feature of PrePPI involves the use of Bayesian methods to combine structure-derived information with non-structural evidence (e.g. co-expression) to assign a likelihood for each predicted interaction. This, combined with a structural BLAST approach significantly expands the range of applications of protein structure in the annotation of protein function, including systems level biological applications where it has previously played little role.
引用
收藏
页码:359 / 366
页数:8
相关论文
共 50 条
[1]   Cradle-loop barrels and the concept of metafolds in protein classification by natural descent [J].
Alva, Vikram ;
Koretke, Kristin K. ;
Coles, Murray ;
Lupas, Andrei N. .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 2008, 18 (03) :358-365
[2]   SCOP database in 2004: refinements integrate structure and sequence family data [J].
Andreeva, A ;
Howorth, D ;
Brenner, SE ;
Hubbard, TJP ;
Chothia, C ;
Murzin, AG .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D226-D229
[3]   Monophyly of class I aminoacyl tRNA synthetase, USPA, ETFP, photolyase, and PP-ATPase nucleotide-binding domains: Implications for protein evolution in the RNA world [J].
Aravind, L ;
Anantharaman, V ;
Koonin, EV .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2002, 48 (01) :1-14
[4]   A threading-based method (FINDSITE) for ligand-binding site prediction and functional annotation [J].
Brylinski, Michal ;
Skolnick, Jeffrey .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2008, 105 (01) :129-134
[5]   Protein complex compositions predicted by structural similarity [J].
Davis, Fred P. ;
Braberg, Hannes ;
Shen, Min-Yi ;
Pieper, Ursula ;
Sali, Andrei ;
Madhusudhan, M. S. .
NUCLEIC ACIDS RESEARCH, 2006, 34 (10) :2943-2952
[6]   LigASite -: a database of biologically relevant binding sites in proteins with known apo-structures [J].
Dessailly, Benoit H. ;
Lensink, Marc F. ;
Orengo, Christine A. ;
Wodak, Shoshana J. .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D667-D673
[7]  
Dey F, PREDICTION LIG UNPUB
[8]   Expanding protein universe and its origin from the biological Big Bang [J].
Dokholyan, NV ;
Shakhnovich, B ;
Shakhnovich, EI .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (22) :14132-14136
[9]   MarkUs: a server to navigate sequence-structure-function space [J].
Fischer, Markus ;
Zhang, Qiangfeng Cliff ;
Dey, Fabian ;
Chen, Brian Y. ;
Honig, Barry ;
Petrey, Donald .
NUCLEIC ACIDS RESEARCH, 2011, 39 :W357-W361
[10]   Using an alignment of fragment strings for comparing protein structures [J].
Friedberg, Iddo ;
Harder, Tim ;
Kolodny, Rachel ;
Sitbon, Einat ;
Li, Zhanwen ;
Godzik, Adam .
BIOINFORMATICS, 2007, 23 (02) :E219-E224