Toward an optimal procedure for variable selection and QSAR model building

被引:164
作者
Yasri, A [1 ]
Hartsough, D [1 ]
机构
[1] ArQule Inc, Computat Design Grp, Woburn, MA 01801 USA
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 2001年 / 41卷 / 05期
关键词
D O I
10.1021/ci010291a
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In this work, we report the development of a novel QSAR technique combining genetic algorithms and neural networks for selecting a subset of relevant descriptors and building the optimal neural network architecture for QSAR studies. This technique uses a neural network to map the dependent property of interest with the descriptors preselected by the genetic algorithm. This technique differs from other variable selection techniques combining genetic algorithms to neural networks by two main features: (1) The variable selection search performed by the genetic algorithm is not constrained to a defined number of descriptors. (2) The optimal neural network architecture is explored in parallel with the variable selection by dynamically modifying the size of the hidden layer. By using both artificial data and real biological data, we show that this technique can be used to build both classification and regression models and outperforms simpler variable selection techniques mainly for nonlinear data sets. The results obtained on real data are compared to previous work using other modeling techniques. We also discuss some important issues in building QSAR models and good practices for QSAR studies.
引用
收藏
页码:1218 / 1227
页数:10
相关论文
共 57 条
[11]   COMPARATIVE MOLECULAR-FIELD ANALYSIS (COMFA) .1. EFFECT OF SHAPE ON BINDING OF STEROIDS TO CARRIER PROTEINS [J].
CRAMER, RD ;
PATTERSON, DE ;
BUNCE, JD .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 1988, 110 (18) :5959-5967
[12]  
ERB RJ, 1992, PHARM RES, V9, P293
[13]  
Exner O., 1972, ADV LINEAR FREE ENER, P1
[14]  
FAHLMAN SE, 1988, FASTER LEARNING VARI
[15]   Blood-brain barrier permeation: Molecular parameters governing passive diffusion [J].
Fischer, H ;
Gottschlich, R ;
Seelig, A .
JOURNAL OF MEMBRANE BIOLOGY, 1998, 165 (03) :201-211
[16]  
GHOSE AK, 1990, MOL PHARMACOL, V37, P725
[17]  
Goldberg D. E., 1989, GENETIC ALGORITHMS S
[18]   STUDY OF BENZODIAZEPINES RECEPTOR-SITES USING A COMBINED QSAR-COMFA APPROACH [J].
GRECO, G ;
NOVELLINO, E ;
SILIPO, C ;
VITTORIA, A .
QUANTITATIVE STRUCTURE-ACTIVITY RELATIONSHIPS, 1992, 11 (04) :461-477
[19]  
Haefely W., 1985, ADV DRUG RES, V14, P165
[20]  
Hall L. H., 1991, Reviews in Computational Chemistry, P367, DOI [10.1002/9780470125793.ch9, DOI 10.1002/9780470125793.CH9]