In Silico Prediction of Human Protein Interactions Using Fuzzy-SVM Mixture Models and Its Application to Cancer Research

被引:8
作者
Chiang, Jung-Hsien [1 ,2 ]
Lee, Tsung-Lu Michael [1 ,2 ]
机构
[1] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan 701, Taiwan
[2] Inst Syst Biol, Seattle, WA 98103 USA
关键词
Fuzzy modeling; fuzzy-SVM mixture models (FSMMs); mixture models; support vector machines (SVMs);
D O I
10.1109/TFUZZ.2007.914041
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Proteomics technologies and bioinformatics tools have been widely used to analyze protein-protein interactions of complex biological systems, which are essential for understanding the mechanisms of human and cancer biology. Although many studies have tackled the problem of high-throughput protein-protein interaction identifications in Saccharomyces cerevisiae, Caenorhabditis elegans, and Drosophila melanogaster, the effort to predict human and cancer-related protein-protein interaction is still limited. Moreover, low consistency and high false positive rates are major drawbacks of these high-throughput methods. In this research, the focus is on predicting human cancer-related protein-protein interaction and reducing false positive rates with integrated classifiers. We propose a hybrid machine learning system by merging fuzzy multiset-based classifiers and support vector machines (SVMs) into fuzzy-SVM mixture models (FSMMs). Our experimental result of the FSMMs approach achieves consistent prediction accuracy on human protein-protein interactions (PPIs) with an receiver operating curve score of 0.965 that outperforms other models. Overall, prediction results on cancer-related protein pairs indicate that our proposed system is effective for identifying both known and novel PPIs to assist cancer research in discovering novel interactions.
引用
收藏
页码:1087 / 1095
页数:9
相关论文
共 53 条
[1]  
Bader GD, 2003, NUCLEIC ACIDS RES, V31, P248, DOI 10.1093/nar/gkg056
[2]   Analyzing yeast protein-protein interaction data obtained from different sources [J].
Bader, GD ;
Hogue, CWV .
NATURE BIOTECHNOLOGY, 2002, 20 (10) :991-997
[3]   The universal protein resource (UniProt) [J].
Bairoch, Amos ;
Bougueleret, Lydie ;
Altairac, Severine ;
Amendolia, Valeria ;
Auchincloss, Andrea ;
Puy, Ghislaine Argoud ;
Axelsen, Kristian ;
Baratin, Delphine ;
Blatter, Marie-Claude ;
Boeckmann, Brigitte ;
Bollondi, Laurent ;
Boutet, Emmanuel ;
Quintaje, Silvia Braconi ;
Breuza, Lionel ;
Bridge, Alan ;
deCastro, Edouard ;
Coral, Danielle ;
Coudert, Elisabeth ;
Cusin, Isabelle ;
Dobrokhotov, Pavel ;
Dornevil, Dolnide ;
Duvaud, Severine ;
Estreicher, Anne ;
Famiglietti, Livia ;
Feuermann, Marc ;
Gehant, Sebastian ;
Farriol-Mathis, Nathalie ;
Ferro, Serenella ;
Gasteiger, Elisabeth ;
Gateau, Alain ;
Gerritsen, Vivienne ;
Gos, Arnaud ;
Gruaz-Gumowski, Nadine ;
Hinz, Ursula ;
Hulo, Chantal ;
Hulo, Nicolas ;
Ioannidis, Vassilios ;
Ivanyi, Ivan ;
James, Janet ;
Jain, Eric ;
Jimenez, Silvia ;
Jungo, Florence ;
Junker, Vivien ;
Keller, Guillaume ;
Lachaize, Corinne ;
Lane-Guermonprez, Lydie ;
Langendijk-Genevaux, Petra ;
Lara, Vicente ;
Lemercier, Philippe ;
Le Saux, Virginie .
NUCLEIC ACIDS RESEARCH, 2007, 35 :D193-D197
[4]  
Bateman A, 2002, NUCLEIC ACIDS RES, V30, P276, DOI [10.1093/nar/gkr1065, 10.1093/nar/gkp985, 10.1093/nar/gkh121]
[5]   Choosing negative examples for the prediction of protein-protein interactions [J].
Ben-Hur, A ;
Noble, WS .
BMC BIOINFORMATICS, 2006, 7 (Suppl 1)
[6]   Kernel methods for predicting protein-protein interactions [J].
Ben-Hur, A ;
Noble, WS .
BIOINFORMATICS, 2005, 21 :I38-I46
[7]   Predicting protein-protein interactions from primary structure [J].
Bock, JR ;
Gough, DA .
BIOINFORMATICS, 2001, 17 (05) :455-460
[8]  
Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
[9]   Online predicted human interaction database [J].
Brown, KR ;
Jurisica, I .
BIOINFORMATICS, 2005, 21 (09) :2076-2082
[10]   A tutorial on Support Vector Machines for pattern recognition [J].
Burges, CJC .
DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) :121-167