Classifying "kinase inhibitor-likeness" by using machine-learning methods

被引:49
作者
Briem, H [1 ]
Günther, J [1 ]
机构
[1] Schering AG, Res Ctr Europe, CDCC Computat Chem, D-13342 Berlin, Germany
关键词
computer chemistry; drug design; inhibitors; kinases; machine learning;
D O I
10.1002/cbic.200400109
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
By using an in-house data set of small-molecule structures, encoded by Ghose-Crippen parameters, several machine learning techniques were applied to distinguish between kinase inhibitors and other molecules with no reported activity on any protein kinase. All four approaches pursued-support-vector machines (SVM), artificial neural networks (ANN), k nearest neighbor classification with GA-optimized feature selection (GAANN), and recursive partitioning (RP)-proved capable of providing a reasonable discrimination. Nevertheless, substantial differences in performance among the methods were observed. For all techniques tested, the use of a consensus vote of the 13 different models derived improved the quality of the predictions in terms of accuracy, precision, recall, and F1 value. Support-vector machines, followed by the GA/kNN combination, outperformed the other techniques when comparing the average of individual models. By using the respective majority votes, the prediction of neural networks yielded the highest F1 value, followed by SVMs.
引用
收藏
页码:558 / 566
页数:9
相关论文
共 40 条
[1]   Can we learn to distinguish between "drug-like" and "nondrug-like" molecules? [J].
Ajay ;
Walters, WP ;
Murcko, MA .
JOURNAL OF MEDICINAL CHEMISTRY, 1998, 41 (18) :3314-3324
[2]  
[Anonymous], 1994, SIGIR
[3]   Hit and lead generation:: Beyond high-throughput screening [J].
Bleicher, KH ;
Böhm, HJ ;
Müller, K ;
Alanine, AI .
NATURE REVIEWS DRUG DISCOVERY, 2003, 2 (05) :369-378
[4]   Drug design by machine learning: support vector machines for pharmaceutical data analysis [J].
Burbidge, R ;
Trotter, M ;
Buxton, B ;
Holden, S .
COMPUTERS & CHEMISTRY, 2001, 26 (01) :5-14
[5]   Comparison of support vector machine and artificial neural network systems for drug/nondrug classification [J].
Byvatov, E ;
Fechner, U ;
Sadowski, J ;
Schneider, G .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2003, 43 (06) :1882-1889
[6]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[7]   Sequence and structure classification of kinases [J].
Cheek, S ;
Zhang, H ;
Grishin, NV .
JOURNAL OF MOLECULAR BIOLOGY, 2002, 320 (04) :855-881
[8]   Recent kinase and kinase inhibitor X-ray structures: Mechanisms of inhibition and selectivity insights [J].
Cherry, M ;
Williams, DH .
CURRENT MEDICINAL CHEMISTRY, 2004, 11 (06) :663-673
[9]   Protein kinases - the major drug targets of the twenty-first century? [J].
Cohen, P .
NATURE REVIEWS DRUG DISCOVERY, 2002, 1 (04) :309-315
[10]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411