The inference of protein-protein interactions by co-evolutionary analysis is improved by excluding the information about the phylogenetic relationships

被引:93
作者
Sato, T [1 ]
Yamanishi, Y
Kanehisa, M
Toh, H
机构
[1] Kyoto Univ, Chem Res Inst, Bioinformat Ctr, Kyoto 6110011, Japan
[2] Ecole Mines Paris, Ctr Geostat, F-77305 Fontainebleau, France
[3] Kyushu Univ, Med Inst Bioregulat, Div Bioinformat, Fukuoka 8128582, Japan
基金
日本学术振兴会;
关键词
D O I
10.1093/bioinformatics/bti564
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The prediction of protein-protein interactions is currently an important issue in bioinformatics. The mirror tree method uses evolutionary information to predict protein-protein interactions. However, it has been recognized that predictions by the mirror tree method lead to many false positives. The incentive of our study was to solve this problem by improving the method of extracting the co-evolutionary information regarding the protein pairs. Results: We developed a novel method to predict protein-protein interactions from co-evolutionary information in the framework of the mirror tree method. The originality is the use of the projection operator to exclude the information about the phylogenetic relationships among the source organisms from the distance matrix. Each distance matrix was transformed into a vector for the operation. The vector is referred to as a 'phylogenetic vector'. We have proposed three ways to extract the phylogenetic information: (1) using the 16S rRNA from the same source organisms as the proteins under consideration, (2) averaging the phylogenetic vectors and (3) analyzing the principal components of the phylogenetic vectors. We examined the performance of the proposed methods to predict interacting protein pairs from Escherichia coli, using experimentally verified data. Our method was successful, and it drastically reduced the number of false positives in the prediction.
引用
收藏
页码:3482 / 3489
页数:8
相关论文
共 23 条
[1]   Conservation of gene order: a fingerprint of proteins that physically interact [J].
Dandekar, T ;
Snel, B ;
Huynen, M ;
Bork, P .
TRENDS IN BIOCHEMICAL SCIENCES, 1998, 23 (09) :324-328
[2]   Protein interaction maps for complete genomes based on gene fusion events [J].
Enright, AJ ;
Iliopoulos, I ;
Kyrpides, NC ;
Ouzounis, CA .
NATURE, 1999, 402 (6757) :86-90
[3]  
FELSENSTEIN J, 2004, PHYLIP VERS 3 6 DIST
[4]   Functional organization of the yeast proteome by systematic analysis of protein complexes [J].
Gavin, AC ;
Bösche, M ;
Krause, R ;
Grandi, P ;
Marzioch, M ;
Bauer, A ;
Schultz, J ;
Rick, JM ;
Michon, AM ;
Cruciat, CM ;
Remor, M ;
Höfert, C ;
Schelder, M ;
Brajenovic, M ;
Ruffner, H ;
Merino, A ;
Klein, K ;
Hudak, M ;
Dickson, D ;
Rudi, T ;
Gnau, V ;
Bauch, A ;
Bastuck, S ;
Huhse, B ;
Leutwein, C ;
Heurtier, MA ;
Copley, RR ;
Edelmann, A ;
Querfurth, E ;
Rybin, V ;
Drewes, G ;
Raida, M ;
Bouwmeester, T ;
Bork, P ;
Seraphin, B ;
Kuster, B ;
Neubauer, G ;
Superti-Furga, G .
NATURE, 2002, 415 (6868) :141-147
[5]   Inferring protein interactions from phylogenetic distance matrices [J].
Gertz, J ;
Elfond, G ;
Shustrova, A ;
Weisinger, M ;
Pellegrini, M ;
Cokus, S ;
Rothschild, B .
BIOINFORMATICS, 2003, 19 (16) :2039-2045
[6]   Co-evolution of proteins with their interaction partners [J].
Goh, CS ;
Bogan, AA ;
Joachimiak, M ;
Walther, D ;
Cohen, FE .
JOURNAL OF MOLECULAR BIOLOGY, 2000, 299 (02) :283-293
[7]   ASRP:: the Arabidopsis Small RNA Project Database [J].
Gustafson, AM ;
Allen, E ;
Givan, S ;
Smith, D ;
Carrington, JC ;
Kasschau, KD .
NUCLEIC ACIDS RESEARCH, 2005, 33 :D637-D640
[8]   Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometry [J].
Ho, Y ;
Gruhler, A ;
Heilbut, A ;
Bader, GD ;
Moore, L ;
Adams, SL ;
Millar, A ;
Taylor, P ;
Bennett, K ;
Boutilier, K ;
Yang, LY ;
Wolting, C ;
Donaldson, I ;
Schandorff, S ;
Shewnarane, J ;
Vo, M ;
Taggart, J ;
Goudreault, M ;
Muskat, B ;
Alfarano, C ;
Dewar, D ;
Lin, Z ;
Michalickova, K ;
Willems, AR ;
Sassi, H ;
Nielsen, PA ;
Rasmussen, KJ ;
Andersen, JR ;
Johansen, LE ;
Hansen, LH ;
Jespersen, H ;
Podtelejnikov, A ;
Nielsen, E ;
Crawford, J ;
Poulsen, V ;
Sorensen, BD ;
Matthiesen, J ;
Hendrickson, RC ;
Gleeson, F ;
Pawson, T ;
Moran, MF ;
Durocher, D ;
Mann, M ;
Hogue, CWV ;
Figeys, D ;
Tyers, M .
NATURE, 2002, 415 (6868) :180-183
[9]   A comprehensive two-hybrid analysis to explore the yeast protein interactome [J].
Ito, T ;
Chiba, T ;
Ozawa, R ;
Yoshida, M ;
Hattori, M ;
Sakaki, Y .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (08) :4569-4574
[10]   THE RAPID GENERATION OF MUTATION DATA MATRICES FROM PROTEIN SEQUENCES [J].
JONES, DT ;
TAYLOR, WR ;
THORNTON, JM .
COMPUTER APPLICATIONS IN THE BIOSCIENCES, 1992, 8 (03) :275-282