Correlated sequence-signatures as markers of protein-protein interaction

被引:281
作者
Sprinzak, E [1 ]
Margalit, H [1 ]
机构
[1] Hebrew Univ Jerusalem, Hadassah Med Sch, Dept Mol Genet & Biotechnol, IL-91120 Jerusalem, Israel
基金
以色列科学基金会;
关键词
protein-protein interaction; functional genomics; proteomics; bioinformatics; sequence-signature;
D O I
10.1006/jmbi.2001.4920
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
As protein-protein interaction is intrinsic to most cellular processes, the ability to predict which proteins in the cell interact can aid significantly in identifying the function of newly discovered proteins, and in understanding the molecular networks they participate in. Here we demonstrate that characteristic pairs of sequence-signatures can be learned from a database of experimentally determined interacting proteins, where one protein contains the one sequence-signature and its interacting partner contains the other sequence-signature. The sequence-signatures that recur in concert in various pairs of interacting proteins are termed correlated sequence-signatures, and it is proposed that they can be used for predicting putative pairs of interacting partners in the cell. We demonstrate the potential of this approach on a comprehensive database of experimentally determined pairs of interacting proteins in the yeast Saccharomyces cerevisiae. The proteins in this database have been characterized by their sequence-signatures, as defined by the InterPro classification. A statistical analysis performed on all possible combinations of sequence-signature pairs has identified those pairs that are over-represented in the database of yeast interacting proteins. It is demonstrated how the use of the correlated sequence-signatures as identifiers of interacting proteins can reduce significantly the search space, and enable directed experimental interaction screens. (C) 2001 Academic Press.
引用
收藏
页码:681 / 692
页数:12
相关论文
共 48 条
  • [1] The InterPro database, an integrated documentation resource for protein families, domains and functional sites
    Apweiler, R
    Attwood, TK
    Bairoch, A
    Bateman, A
    Birney, E
    Biswas, M
    Bucher, P
    Cerutti, T
    Corpet, F
    Croning, MDR
    Durbin, R
    Falquet, L
    Fleischmann, W
    Gouzy, J
    Hermjakob, H
    Hulo, N
    Jonassen, I
    Kahn, D
    Kanapin, A
    Karavidopoulou, Y
    Lopez, R
    Marx, B
    Mulder, NJ
    Oinn, TM
    Pagni, M
    Servant, F
    Sigrist, CJA
    Zdobnov, EM
    [J]. NUCLEIC ACIDS RESEARCH, 2001, 29 (01) : 37 - 40
  • [2] PRINTS-S: the database formerly known as PRINTS
    Attwood, TK
    Croning, MDR
    Flower, DR
    Lewis, AP
    Mabey, JE
    Scordis, P
    Selley, JN
    Wright, W
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 225 - 227
  • [3] Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkh121, 10.1093/nar/gkr1065]
  • [4] VIRAL MYB ONCOGENE ENCODES A SEQUENCE-SPECIFIC DNA-BINDING ACTIVITY
    BIEDENKAPP, H
    BORGMEYER, U
    SIPPEL, AE
    KLEMPNAUER, KH
    [J]. NATURE, 1988, 335 (6193) : 835 - 837
  • [5] High-throughput screening for protein-protein interactions using two-hybrid assay
    Cagney, G
    Uetz, P
    Fields, S
    [J]. APPLICATIONS OF CHIMERIC GENES AND HYBRID PROTEINS, PT C, 2000, 328 : 3 - 14
  • [6] CHAPMAN ER, 1994, J BIOL CHEM, V269, P27427
  • [7] CLEVELAND DW, 1985, ANNU REV BIOCHEM, V54, P331, DOI 10.1146/annurev.bi.54.070185.001555
  • [8] ProDom and ProDom-CG: tools for protein domain analysis and whole genome comparisons
    Corpet, F
    Servant, F
    Gouzy, J
    Kahn, D
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 267 - 269
  • [9] Conservation of gene order: a fingerprint of proteins that physically interact
    Dandekar, T
    Snel, B
    Huynen, M
    Bork, P
    [J]. TRENDS IN BIOCHEMICAL SCIENCES, 1998, 23 (09) : 324 - 328
  • [10] Protein function in the post-genomic era
    Eisenberg, D
    Marcotte, EM
    Xenarios, I
    Yeates, TO
    [J]. NATURE, 2000, 405 (6788) : 823 - 826