Application of genetic algorithm-kernel partial least square as a novel nonlinear feature selection method: Activity of carbonic anhydrase II inhibitors

被引:55
作者
Jalah-Heravi, Mehdi [1 ]
Kyani, Anahita [1 ]
机构
[1] Sharif Univ Technol, Dept Chem, Tehran, Iran
关键词
carbonic anhydrase II inhibitors; genetic algorithm-kernel partial least square; QSAR; artificial neural networks;
D O I
10.1016/j.ejmech.2006.12.020
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
This paper introduces the genetic algorithm-kernel partial least square (GA-KPLS), as a novel nonlinear feature selection method. This technique combines genetic algorithms (GAs) as powerful optimization methods with KPLS as a robust nonlinear statistical method for variable selection. This feature selection method is combined with artificial neural network to develop a nonlinear QSAR model for predicting activities of a series of substituted aromatic sulfonamides as carbonic anhydrase II (CA II) inhibitors. Eight simple one- and two-dimensional descriptors were selected by GA-KPLS and considered as inputs for developing artificial neural networks (ANNs). These parameters represent the role of acceptor-donor pair, hydrogen bonding, hydrosolubility and lipophilicity of the active sites and also the size of the inhibitors on inhibitor-isozyme interaction. The accuracy of 8-4-1 networks was illustrated by validation techniques of leave-one-out (LOO) and leave-multiple-out (LMO) cross-validations and Y-randomization. Superiority of this method (GA-KPLS-ANN) over the linear one (MLR) in a previous work and also the GA-PLS-ANN in which a linear feature selection method has been used indicates that the GA-KPLS approach is a powerful method for the variable selection in nonlinear systems. (c) 2007 Elsevier Masson SAS. All rights reserved.
引用
收藏
页码:649 / 659
页数:11
相关论文
共 42 条