MULTIPROSPECTOR: An algorithm for the prediction of protein-protein interactions by multimeric threading

被引:182
作者
Lu, L
Lu, H
Skolnick, J
机构
[1] Donald Danforth Plant Sci Ctr, Lab Computat Genom, St Louis, MO 63132 USA
[2] Washington Univ, Sch Med, Dept Biochem & Mol Biophys, St Louis, MO 63132 USA
关键词
protein-protein interactions; threading; interfacial potentials; protein dimers; genomic scale predictions;
D O I
10.1002/prot.10222
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
In this postgenomic era, the ability to identify protein-protein interactions on a genomic scale is very important to assist in the assignment of physiological function. Because of the increasing number of solved structures involving protein complexes, the time is ripe to extend threading to the prediction of quaternary structure. In this spirit, a multimeric threading approach has been developed. The approach is comprised of two phases. In the first phase, traditional threading on a single chain is applied to generate a set of potential structures for the query sequences. In particular, we use our recently developed threading algorithm, PROSPECTOR. Then, for those proteins whose template structures are part of a known complex, we re-thread on both partners in the complex and now include a protein-protein interfacial energy. To perform this analysis, a database of multimeric protein structures has been constructed, the necessary interfacial pairwise potentials have been derived, and a set of empirical indicators to identify true multimers based on the threading Z-score and the magnitude of the interfacial energy have been established. The algorithm has been tested on a benchmark set comprised of 40 homodimers, 15 heterodimers, and 69 monomers that were scanned against a protein library of 2478 structures that comprise a representative set of structures in the Protein Data Bank. Of these, the method correctly recognized and assigned 36 homodimers, 15 heterodimers, and 65 monomers. This protocol was applied to identify partners and assign quaternary structures of proteins found in the yeast database of interacting proteins. Our multimeric threading algorithm correctly predicts 144 interacting proteins, compared to the 56 (26) cases assigned by PSI-BLAST using a (less) permissive E-value of 1 (0.01). Next, all possible pairs of yeast proteins have been examined. Predictions (n = 2865) of protein-protein interactions are made; 1138 of these 2865 interactions have counterparts in the Database of Interacting Proteins. In contrast, PSI-BLAST made 1781 predictions, and 1215 have counterparts in DIP. An estimation of the false-negative rate for yeast-predicted interactions has also been provided. Thus, a promising approach to help assist in the assignment of protein-protein interactions on a genomic scale has been developed. (C) 2002 Wiley-Liss, Inc.
引用
收藏
页码:350 / 364
页数:15
相关论文
共 52 条
[1]  
Alberts B., 1994, MOL BIOL CELL
[2]   Interrogating protein interaction networks through structural biology [J].
Aloy, P ;
Russell, RB .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (09) :5896-5901
[3]   Structural similarity to link sequence space: New potential superfamilies and implications for structural genomics [J].
Aloy, P ;
Oliva, B ;
Querol, E ;
Aviles, FX ;
Russell, RB .
PROTEIN SCIENCE, 2002, 11 (05) :1101-1116
[4]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[5]   Protein structure prediction and structural genomics [J].
Baker, D ;
Sali, A .
SCIENCE, 2001, 294 (5540) :93-96
[6]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[7]   The complete genome sequence of Escherichia coli K-12 [J].
Blattner, FR ;
Plunkett, G ;
Bloch, CA ;
Perna, NT ;
Burland, V ;
Riley, M ;
ColladoVides, J ;
Glasner, JD ;
Rode, CK ;
Mayhew, GF ;
Gregor, J ;
Davis, NW ;
Kirkpatrick, HA ;
Goeden, MA ;
Rose, DJ ;
Mau, B ;
Shao, Y .
SCIENCE, 1997, 277 (5331) :1453-+
[8]   Predicting protein-protein interactions from primary structure [J].
Bock, JR ;
Gough, DA .
BIOINFORMATICS, 2001, 17 (05) :455-460
[9]  
Bollag D M, 1994, Methods Mol Biol, V36, P1
[10]   3-DIMENSIONAL STRUCTURE OF THE COMPLEX BETWEEN PANCREATIC SECRETORY TRYPSIN-INHIBITOR (KAZAL TYPE) AND TRYPSINOGEN AT 1-8 A RESOLUTION - STRUCTURE SOLUTION, CRYSTALLOGRAPHIC REFINEMENT AND PRELIMINARY STRUCTURAL INTERPRETATION [J].
BOLOGNESI, M ;
GATTI, G ;
MENEGATTI, E ;
GUARNERI, M ;
MARQUART, M ;
PAPAMOKOS, E ;
HUBER, R .
JOURNAL OF MOLECULAR BIOLOGY, 1982, 162 (04) :839-868