Evolutionary Computation and QSAR Research

被引:23
作者
Aguiar-Pulido, Vanessa [1 ]
Gestal, Marcos [1 ]
Cruz-Monteagudo, Maykel [2 ,3 ,4 ]
Rabunal, Juan R. [1 ]
Dorado, Julian [1 ]
Munteanu, Cristian R. [1 ]
机构
[1] Univ A Coruna, Dept Informat & Commun Technol, Fac Comp Sci, La Coruna 15071, Spain
[2] Univ Porto, Fac Sci, Dept Chem & Biochem, CIQ, P-4169007 Oporto, Portugal
[3] Univ Porto, Dept Chem & Biochem, REQUIMTE, Fac Sci, P-4169007 Oporto, Portugal
[4] Univ Cent Marta Abreu Las Villas, Fac Quim & Farm, CEQA, Santa Clara 54830, Cuba
关键词
Evolutionary computation; feature extraction; genetic algorithms; genetic programming; molecular descriptors; quantitative structure-activity relationships; QSAR; variable selection; PARTICLE SWARM OPTIMIZATION; JOINT FEATURE-SELECTION; 2-D GRAPHICAL REPRESENTATION; MULTIPLE LINEAR-REGRESSION; ARTIFICIAL NEURAL-NETWORKS; SUPPORT VECTOR MACHINE; VARIABLE SELECTION; GENETIC ALGORITHM; MULTIOBJECTIVE OPTIMIZATION; TOPOLOGICAL INDEXES;
D O I
10.2174/1573409911309020006
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
The successful high throughput screening of molecule libraries for a specific biological property is one of the main improvements in drug discovery. The virtual molecular filtering and screening relies greatly on quantitative structure-activity relationship (QSAR) analysis, a mathematical model that correlates the activity of a molecule with molecular descriptors. QSAR models have the potential to reduce the costly failure of drug candidates in advanced (clinical) stages by filtering combinatorial libraries, eliminating candidates with a predicted toxic effect and poor pharmacokinetic profiles, and reducing the number of experiments. To obtain a predictive and reliable QSAR model, scientists use methods from various fields such as molecular modeling, pattern recognition, machine learning or artificial intelligence. QSAR modeling relies on three main steps: molecular structure codification into molecular descriptors, selection of relevant variables in the context of the analyzed activity, and search of the optimal mathematical model that correlates the molecular descriptors with a specific activity. Since a variety of techniques from statistics and artificial intelligence can aid variable selection and model building steps, this review focuses on the evolutionary computation methods supporting these tasks. Thus, this review explains the basic of the genetic algorithms and genetic programming as evolutionary computation approaches, the selection methods for high-dimensional data in QSAR, the methods to build QSAR models, the current evolutionary feature selection methods and applications in QSAR and the future trend on the joint or multi-task feature selection methods.
引用
收藏
页码:206 / 225
页数:20
相关论文
共 204 条