Correlated sequence-signatures as markers of protein-protein interaction

被引:281
作者
Sprinzak, E [1 ]
Margalit, H [1 ]
机构
[1] Hebrew Univ Jerusalem, Hadassah Med Sch, Dept Mol Genet & Biotechnol, IL-91120 Jerusalem, Israel
基金
以色列科学基金会;
关键词
protein-protein interaction; functional genomics; proteomics; bioinformatics; sequence-signature;
D O I
10.1006/jmbi.2001.4920
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
As protein-protein interaction is intrinsic to most cellular processes, the ability to predict which proteins in the cell interact can aid significantly in identifying the function of newly discovered proteins, and in understanding the molecular networks they participate in. Here we demonstrate that characteristic pairs of sequence-signatures can be learned from a database of experimentally determined interacting proteins, where one protein contains the one sequence-signature and its interacting partner contains the other sequence-signature. The sequence-signatures that recur in concert in various pairs of interacting proteins are termed correlated sequence-signatures, and it is proposed that they can be used for predicting putative pairs of interacting partners in the cell. We demonstrate the potential of this approach on a comprehensive database of experimentally determined pairs of interacting proteins in the yeast Saccharomyces cerevisiae. The proteins in this database have been characterized by their sequence-signatures, as defined by the InterPro classification. A statistical analysis performed on all possible combinations of sequence-signature pairs has identified those pairs that are over-represented in the database of yeast interacting proteins. It is demonstrated how the use of the correlated sequence-signatures as identifiers of interacting proteins can reduce significantly the search space, and enable directed experimental interaction screens. (C) 2001 Academic Press.
引用
收藏
页码:681 / 692
页数:12
相关论文
共 48 条
  • [11] Predicting subcellular localization of proteins based on their N-terminal amino acid sequence
    Emanuelsson, O
    Nielsen, H
    Brunak, S
    von Heijne, G
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2000, 300 (04) : 1005 - 1016
  • [12] Protein interaction maps for complete genomes based on gene fusion events
    Enright, AJ
    Iliopoulos, I
    Kyrpides, NC
    Ouzounis, CA
    [J]. NATURE, 1999, 402 (6757) : 86 - 90
  • [13] Fellenberg M, 2000, Proc Int Conf Intell Syst Mol Biol, V8, P152
  • [14] VESICLE FUSION FROM YEAST TO MAN
    FERRONOVICK, S
    JAHN, R
    [J]. NATURE, 1994, 370 (6486) : 191 - 193
  • [15] A genomic approach of the hepatitis C virus generates a protein interaction map
    Flajolet, M
    Rotondo, G
    Daviet, L
    Bergametti, F
    Inchauspé, G
    Tiollais, P
    Transy, C
    Legrain, P
    [J]. GENE, 2000, 242 (1-2) : 369 - 379
  • [16] THE BROMODOMAIN - A CONSERVED SEQUENCE FOUND IN HUMAN, DROSOPHILA AND YEAST PROTEINS
    HAYNES, SR
    DOLLARD, C
    WINSTON, F
    BECK, S
    TROWSDALE, J
    DAWID, IB
    [J]. NUCLEIC ACIDS RESEARCH, 1992, 20 (10) : 2603 - 2603
  • [17] The PROSITE database, its status in 1999
    Hofmann, K
    Bucher, P
    Falquet, L
    Bairoch, A
    [J]. NUCLEIC ACIDS RESEARCH, 1999, 27 (01) : 215 - 219
  • [18] A guided tour in protein interaction space: Coiled coils from the yeast proteome
    Hu, JC
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (24) : 12935 - 12936
  • [19] Computational identification of cis-regulatory elements associated with groups of functionally related genes in Saccharomyces cerevisiae
    Hughes, JD
    Estep, PW
    Tavazoie, S
    Church, GM
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2000, 296 (05) : 1205 - 1214
  • [20] Toward a protein-protein interaction map of the budding yeast: A comprehensive system to examine two-hybrid interactions in all possible combinations between the yeast proteins
    Ito, T
    Tashiro, K
    Muta, S
    Ozawa, R
    Chiba, T
    Nishizawa, M
    Yamamoto, K
    Kuhara, S
    Sakaki, Y
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (03) : 1143 - 1147