Evolutionary Computation and QSAR Research

被引:23
作者
Aguiar-Pulido, Vanessa [1 ]
Gestal, Marcos [1 ]
Cruz-Monteagudo, Maykel [2 ,3 ,4 ]
Rabunal, Juan R. [1 ]
Dorado, Julian [1 ]
Munteanu, Cristian R. [1 ]
机构
[1] Univ A Coruna, Dept Informat & Commun Technol, Fac Comp Sci, La Coruna 15071, Spain
[2] Univ Porto, Fac Sci, Dept Chem & Biochem, CIQ, P-4169007 Oporto, Portugal
[3] Univ Porto, Dept Chem & Biochem, REQUIMTE, Fac Sci, P-4169007 Oporto, Portugal
[4] Univ Cent Marta Abreu Las Villas, Fac Quim & Farm, CEQA, Santa Clara 54830, Cuba
关键词
Evolutionary computation; feature extraction; genetic algorithms; genetic programming; molecular descriptors; quantitative structure-activity relationships; QSAR; variable selection; PARTICLE SWARM OPTIMIZATION; JOINT FEATURE-SELECTION; 2-D GRAPHICAL REPRESENTATION; MULTIPLE LINEAR-REGRESSION; ARTIFICIAL NEURAL-NETWORKS; SUPPORT VECTOR MACHINE; VARIABLE SELECTION; GENETIC ALGORITHM; MULTIOBJECTIVE OPTIMIZATION; TOPOLOGICAL INDEXES;
D O I
10.2174/1573409911309020006
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
The successful high throughput screening of molecule libraries for a specific biological property is one of the main improvements in drug discovery. The virtual molecular filtering and screening relies greatly on quantitative structure-activity relationship (QSAR) analysis, a mathematical model that correlates the activity of a molecule with molecular descriptors. QSAR models have the potential to reduce the costly failure of drug candidates in advanced (clinical) stages by filtering combinatorial libraries, eliminating candidates with a predicted toxic effect and poor pharmacokinetic profiles, and reducing the number of experiments. To obtain a predictive and reliable QSAR model, scientists use methods from various fields such as molecular modeling, pattern recognition, machine learning or artificial intelligence. QSAR modeling relies on three main steps: molecular structure codification into molecular descriptors, selection of relevant variables in the context of the analyzed activity, and search of the optimal mathematical model that correlates the molecular descriptors with a specific activity. Since a variety of techniques from statistics and artificial intelligence can aid variable selection and model building steps, this review focuses on the evolutionary computation methods supporting these tasks. Thus, this review explains the basic of the genetic algorithms and genetic programming as evolutionary computation approaches, the selection methods for high-dimensional data in QSAR, the methods to build QSAR models, the current evolutionary feature selection methods and applications in QSAR and the future trend on the joint or multi-task feature selection methods.
引用
收藏
页码:206 / 225
页数:20
相关论文
共 204 条
  • [1] Abraham A., 2005, Handb. Meas. Syst. Des, P272, DOI [DOI 10.1002/0471497398.MM423, 10.1002/0471497398.mm423]
  • [2] A novel simple QSAR model for the prediction of anti-HIV activity using multiple linear regression analysis
    Afantitis, Antreas
    Melagraki, Georgia
    Sarimveis, Haralambos
    Koutentis, Panayiotis A.
    Markopoulos, John
    Igglessi-Markopoulou, Olga
    [J]. MOLECULAR DIVERSITY, 2006, 10 (03) : 405 - 414
  • [3] Feature selection for structure-activity correlation using binary particle swarms
    Agrafiotis, DK
    Cedeño, W
    [J]. JOURNAL OF MEDICINAL CHEMISTRY, 2002, 45 (05) : 1098 - 1107
  • [4] [Anonymous], INTERNET ELECT J MOL
  • [5] [Anonymous], 2003, Internet Electron. J. Mol. Des
  • [6] Genetic programming for QSAR investigation of docking energy
    Archetti, Francesco
    Giordani, Ilaria
    Vanneschi, Leonardo
    [J]. APPLIED SOFT COMPUTING, 2010, 10 (01) : 170 - 182
  • [7] QSAR Models for CXCR2 Receptor Antagonists Based on the Genetic Algorithm for Data Preprocessing Prior to Application of the PLS Linear Regression Method and Design of the New Compounds Using In Silico Virtual Screening
    Asadollahi, Tahereh
    Dadfarnia, Shayessteh
    Shabani, Ali Mohammad Haji
    Ghasemi, Jahan B.
    Sarkhosh, Maryam
    [J]. MOLECULES, 2011, 16 (03): : 1928 - 1955
  • [8] Four new topological indices based on the molecular path code
    Balaban, Alexandru T.
    Beteringhe, Adrian
    Constantinescu, Titus
    Filip, Petru A.
    Ivanciuc, Ovidiu
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2007, 47 (03) : 716 - 731
  • [9] HIGHLY DISCRIMINATING DISTANCE-BASED TOPOLOGICAL INDEX
    BALABAN, AT
    [J]. CHEMICAL PHYSICS LETTERS, 1982, 89 (05) : 399 - 404
  • [10] Chemical fragment generation and clustering software
    Barnard, JM
    Downs, GM
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1997, 37 (01): : 141 - 142