Predicting protein-protein interactions from sequences in a hybridization space

被引:143
作者
Chou, KC
Cai, YD
机构
[1] Gordon Life Sci Inst, San Diego, CA 92130 USA
[2] Shanghai Univ, Coll Sci, Dept Chem, Shanghai 200436, Peoples R China
[3] UMIST, Dept Biomol Sci, Manchester M60 1QD, Lancs, England
关键词
genomic scale; gene ontology; pseudo-amino acid composition; ISort predictor; GO-PseAA fusion classifier; network biology; yeast;
D O I
10.1021/pr050331g
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
To understand the networks in living cells, it is indispensably important to identify protein-protein interactions on a genomic scale. Unfortunately, it is both time-consuming and expensive to do so solely based on experiments due to the nature of the problem whose complexity is obviously overwhelming, just like the fact that "life is complicated". Therefore, developing computational techniques for predicting protein-protein interactions would be of significant value in this regard. By fusing the approach based on the gene ontology and the approach of pseudo-amino acid composition, a predictor called "GO-PseAA" predictor was established to deal with this problem. As a showcase, prediction was performed on 6323 protein pairs from yeast. To avoid redundancy and homology bias, none of the protein pairs investigated has >= 40% sequence identity with any other. The overall success rate obtained by jackknife cross-validation was 81.6%, indicating the GO-PseAA predictor is very promising for predicting protein-protein interactions from protein sequences, and might become a useful vehicle for studying the network biology in the postgenomic era.
引用
收藏
页码:316 / 322
页数:7
相关论文
共 32 条
[1]   The InterPro database, an integrated documentation resource for protein families, domains and functional sites [J].
Apweiler, R ;
Attwood, TK ;
Bairoch, A ;
Bateman, A ;
Birney, E ;
Biswas, M ;
Bucher, P ;
Cerutti, T ;
Corpet, F ;
Croning, MDR ;
Durbin, R ;
Falquet, L ;
Fleischmann, W ;
Gouzy, J ;
Hermjakob, H ;
Hulo, N ;
Jonassen, I ;
Kahn, D ;
Kanapin, A ;
Karavidopoulou, Y ;
Lopez, R ;
Marx, B ;
Mulder, NJ ;
Oinn, TM ;
Pagni, M ;
Servant, F ;
Sigrist, CJA ;
Zdobnov, EM .
NUCLEIC ACIDS RESEARCH, 2001, 29 (01) :37-40
[2]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[3]   Solution structure of BID, an intracellular amplifier of apoptotic signaling [J].
Chou, JJ ;
Li, HL ;
Salvesen, GS ;
Yuan, JY ;
Wagner, G .
CELL, 1999, 96 (05) :615-624
[4]   Using functional domain composition and support vector machines for prediction of protein subcellular location [J].
Chou, KC ;
Cai, YD .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2002, 277 (48) :45765-45769
[5]   Protein subcellular location prediction [J].
Chou, KC ;
Elrod, DW .
PROTEIN ENGINEERING, 1999, 12 (02) :107-118
[6]   PREDICTION OF PROTEIN STRUCTURAL CLASSES [J].
CHOU, KC ;
ZHANG, CT .
CRITICAL REVIEWS IN BIOCHEMISTRY AND MOLECULAR BIOLOGY, 1995, 30 (04) :275-349
[7]   A NOVEL-APPROACH TO PREDICTING PROTEIN STRUCTURAL CLASSES IN A (20-1)-D AMINO-ACID-COMPOSITION SPACE [J].
CHOU, KC .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1995, 21 (04) :319-344
[8]  
CHOU KC, 1994, J BIOL CHEM, V269, P22014
[9]   Prediction of protein cellular attributes using pseudo-amino acid composition [J].
Chou, KC .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 2001, 43 (03) :246-255
[10]   Using pair-coupled amino acid composition to predict protein secondary structure content [J].
Chou, KC .
JOURNAL OF PROTEIN CHEMISTRY, 1999, 18 (04) :473-480