Toward an optimal procedure for variable selection and QSAR model building

被引:164
作者
Yasri, A [1 ]
Hartsough, D [1 ]
机构
[1] ArQule Inc, Computat Design Grp, Woburn, MA 01801 USA
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 2001年 / 41卷 / 05期
关键词
D O I
10.1021/ci010291a
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In this work, we report the development of a novel QSAR technique combining genetic algorithms and neural networks for selecting a subset of relevant descriptors and building the optimal neural network architecture for QSAR studies. This technique uses a neural network to map the dependent property of interest with the descriptors preselected by the genetic algorithm. This technique differs from other variable selection techniques combining genetic algorithms to neural networks by two main features: (1) The variable selection search performed by the genetic algorithm is not constrained to a defined number of descriptors. (2) The optimal neural network architecture is explored in parallel with the variable selection by dynamically modifying the size of the hidden layer. By using both artificial data and real biological data, we show that this technique can be used to build both classification and regression models and outperforms simpler variable selection techniques mainly for nonlinear data sets. The results obtained on real data are compared to previous work using other modeling techniques. We also discuss some important issues in building QSAR models and good practices for QSAR studies.
引用
收藏
页码:1218 / 1227
页数:10
相关论文
共 57 条
[1]   Designing libraries with CNS activity [J].
Ajay ;
Bemis, GW ;
Murcko, MA .
JOURNAL OF MEDICINAL CHEMISTRY, 1999, 42 (24) :4942-4951
[2]  
*ANL, PGAPACK PAR GEN ALG
[3]  
[Anonymous], EXPLORING QSAR FUNDA
[4]  
[Anonymous], P 11 INT JOINT C ART
[5]   Prediction of hydroxyl radical rate constants from molecular structure [J].
Bakken, GA ;
Jurs, PC .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1999, 39 (06) :1064-1075
[6]  
Bentley P, 1999, EVOLUTIONARY DESIGN BY COMPUTERS, P1
[7]  
BONO L, 1999, J CHEM INF COMP SCI, V39, P121
[8]  
*CHEM COMP GROUP I, 1997, MOE 2000 02 MOL OP E
[9]   Antitumor agents .174. 2',3',4',5,6,7-substituted 2-phenyl-1,8-naphthyridin-4-ones: Their synthesis, cytotoxicity, and inhibition of tubulin polymerization [J].
Chen, K ;
Kuo, SC ;
Hsieh, MC ;
Mauger, A ;
Lin, CM ;
Hamel, E ;
Lee, KH .
JOURNAL OF MEDICINAL CHEMISTRY, 1997, 40 (14) :2266-2275
[10]   Rational combinatorial library design. 2. Rational design of targeted combinatorial peptide libraries using chemical similarity probe and the inverse QSAR approaches [J].
Cho, SJ ;
Zheng, WF ;
Tropsha, A .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1998, 38 (02) :259-268