Statistical prediction of protein-chemical interactions based on chemical structure and mass spectrometry data

被引:81
作者
Nagamine, Nobuyoshi [1 ]
Sakakibara, Yasuburni [1 ]
机构
[1] Keio Univ, Dept Biosci & Informat, Kohoku Ku, Yokohama, Kanagawa 2238522, Japan
基金
日本科学技术振兴机构;
关键词
D O I
10.1093/bioinformatics/btm266
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Prediction of interactions between proteins and chemical compounds is of great benefit in drug discovery processes. In this field, 3D structure-based methods such as docking analysis have been developed. However, the genomewide application of these methods is not really feasible as 3D structural information is limited in availability. Results: We describe a novel method for predicting protein-chemical interaction using SVM. We utilize very general protein data, i.e. amino acid sequences, and combine these with chemical structures and mass spectrometry (MS) data. MS data can be of great use in finding new chemical compounds in the future. We assessed the validity of our method in the dataset of the binding of existing drugs and found that more than 80% accuracy could be obtained. Furthermore, we conducted comprehensive target protein predictions for MDMA, and validated the biological significance of our method by successfully finding proteins relevant to its known functions.
引用
收藏
页码:2004 / 2012
页数:9
相关论文
共 37 条
[1]   Determination of N-glycosylation sites and site heterogeneity in glycoproteins [J].
An, HJ ;
Peavy, TR ;
Hedrick, JL ;
Lebrilla, CB .
ANALYTICAL CHEMISTRY, 2003, 75 (20) :5628-5637
[2]  
Apweiler R, 2004, NUCLEIC ACIDS RES, V32, pD115, DOI [10.1093/nar/gkh131, 10.1093/nar/gkw1099]
[3]   GPCRpred: an SVM-based method for prediction of families and subfamilies of G-protein coupled receptors [J].
Bhasin, M ;
Raghava, GPS .
NUCLEIC ACIDS RESEARCH, 2004, 32 :W383-W389
[4]   Predicting protein-protein interactions from primary structure [J].
Bock, JR ;
Gough, DA .
BIOINFORMATICS, 2001, 17 (05) :455-460
[5]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[6]   Generalized fragment-substructure based property prediction method [J].
Clark, M .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2005, 45 (01) :30-38
[7]   BIAS REDUCTION OF MAXIMUM-LIKELIHOOD-ESTIMATES [J].
FIRTH, D .
BIOMETRIKA, 1993, 80 (01) :27-38
[8]   Learning to predict protein-protein interactions from protein sequences [J].
Gomez, SM ;
Noble, WS ;
Rzhetsky, A .
BIOINFORMATICS, 2003, 19 (15) :1875-1881
[9]  
Hosmer D, 2000, Applied Logistic Regression, V2nd, DOI DOI 10.1002/0471722146.CH4
[10]   Development and validation of a genetic algorithm for flexible docking [J].
Jones, G ;
Willett, P ;
Glen, RC ;
Leach, AR ;
Taylor, R .
JOURNAL OF MOLECULAR BIOLOGY, 1997, 267 (03) :727-748