Drug design by machine learning: support vector machines for pharmaceutical data analysis

被引:562
作者
Burbidge, R [1 ]
Trotter, M [1 ]
Buxton, B [1 ]
Holden, S [1 ]
机构
[1] UCL, London WC1E 6BT, England
来源
COMPUTERS & CHEMISTRY | 2001年 / 26卷 / 01期
基金
英国工程与自然科学研究理事会;
关键词
machine learning; support vector machines; model selection; structure-activity relationship analysis; cheminformatics;
D O I
10.1016/S0097-8485(01)00094-8
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
We show that the support vector machine (SVM) classification algorithm, a recent development from the machine learning community, proves its potential for structure-activity relationship analysis. In a benchmark test, the SVM is compared to several machine learning techniques currently used in the field. The classification task involves predicting the inhibition of dihydrofolate reductase by pyrimidines, using data obtained from the UCI machine learning repository. Three artificial neural networks, a radial basis function network, and a C5.0 decision tree are all outperformed by the SVM. The SVM is significantly better than all of these, bar a manually capacity-controlled neural network, which takes considerably longer to train. (C) 2001 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:5 / 14
页数:10
相关论文
共 32 条
[1]  
[Anonymous], 1989, GENETIC ALGORITHM SE
[2]  
[Anonymous], 1997, 1602 AI MIT
[3]  
Bishop C. M., 1995, NEURAL NETWORKS PATT
[4]  
Blake C.L., 1998, UCI repository of machine learning databases
[5]  
Breiman L., 1984, BIOMETRICS, DOI DOI 10.2307/2530946
[6]   Knowledge-based analysis of microarray gene expression data by using support vector machines [J].
Brown, MPS ;
Grundy, WN ;
Lin, D ;
Cristianini, N ;
Sugnet, CW ;
Furey, TS ;
Ares, M ;
Haussler, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (01) :262-267
[7]   A tutorial on Support Vector Machines for pattern recognition [J].
Burges, CJC .
DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) :121-167
[8]  
Cristianini N., 2000, INTRO SUPPORT VECTOR, DOI [10.1017/CBO9780511801389, DOI 10.1017/CBO9780511801389]
[9]  
DEVILLERS J, 1999, GENETIC ALGORITHMS M
[10]  
DEVILLERS J, 1999, NEURAL NETWORKS DRUG