On the interpretation and interpretability of quantitative structure-activity relationship models

被引:71
作者
Guha, Rajarshi [1 ]
机构
[1] Indiana Univ, Sch Informat, Bloomington, IN 47408 USA
关键词
Quantitative structure-activity relationship (QSAR); Interpretation; Linear regression; Partial least squares (PLS); Neural network;
D O I
10.1007/s10822-008-9240-5
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The goal of a quantitative structure-activity relationship (QSAR) model is to encode the relationship between molecular structure and biological activity or physical property. Based on this encoding, such models can be used for predictive purposes. Assuming the use of relevant and meaningful descriptors, and a statistically significant model, extraction of the encoded structure-activity relationships (SARs) can provide insight into what makes a molecule active or inactive. Such analyses by QSAR models are useful in a number of scenarios, such as suggesting structural modifications to enhance activity, explanation of outliers and exploratory analysis of novel SARs. In this paper we discuss the need for interpretation and an overview of the factors that affect interpretability of QSAR models. We then describe interpretation protocols for different types of models, highlighting the different types of interpretations, ranging from very broad, global, trends to very specific, case-by-case, descriptions of the SAR, using examples from the training set. Finally, we discuss a number of case studies where workers have provide some form of interpretation of a QSAR model.
引用
收藏
页码:857 / 871
页数:15
相关论文
共 117 条
[11]   A virtual screening filter for identification of cytochrome P4502C9 (CYP2C9) inhibitors [J].
Byvatov, Evgeny ;
Baringhaus, Karl-Heinz ;
Schneider, Gisbert ;
Matter, Hans .
QSAR & COMBINATORIAL SCIENCE, 2007, 26 (05) :618-628
[12]   HOW SIMILAR IS A MOLECULE TO ANOTHER - AN ELECTRON-DENSITY MEASURE OF SIMILARITY BETWEEN 2 MOLECULAR-STRUCTURES [J].
CARBO, R ;
LEYDA, L ;
ARNAU, M .
INTERNATIONAL JOURNAL OF QUANTUM CHEMISTRY, 1980, 17 (06) :1185-1189
[13]   STRUCTURE MUSK ODOR RELATIONSHIPS FOR TETRALINS AND INDANS USING NEURAL NETWORKS (ON THE CONTRIBUTION OF DESCRIPTORS TO THE CLASSIFICATION) [J].
CHASTRETTE, M ;
ZAKARYA, D ;
PEYRAUD, JF .
EUROPEAN JOURNAL OF MEDICINAL CHEMISTRY, 1994, 29 (05) :343-348
[14]  
Chatterjee S., 1986, STAT SCI, V1, P379, DOI DOI 10.1214/SS/1177013622
[15]   Development of neural network QSPR models for Hansch substituent constants. 2. Applications in QSAR studies of HIV-1 reverse transcriptase and dihydrofolate reductase inhibitors [J].
Chin, TL ;
So, SS .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2004, 44 (01) :154-160
[16]   Nonlinear support vector machine visualization for risk factor analysis using nomograms and localized radial basis function kernels [J].
Cho, Baek Hwan ;
Yu, Hwanjo ;
Lee, Jongshill ;
Chee, Young Joon ;
Kim, In Young ;
Kim, Sun I. .
IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE, 2008, 12 (02) :247-256
[17]   A rapid computational filter for cytochrome P450 1A2 inhibition potential of compound libraries [J].
Chohan, KK ;
Paine, SW ;
Mistry, J ;
Barton, P ;
Davis, AM .
JOURNAL OF MEDICINAL CHEMISTRY, 2005, 48 (16) :5154-5161
[18]   Cheminformatic models to predict binding affinities to human serum albumin [J].
Colmenarejo, G ;
Alvarez-Pedraglio, A ;
Lavandera, JL .
JOURNAL OF MEDICINAL CHEMISTRY, 2001, 44 (25) :4370-4378
[19]   Structure/response correlations and similarity/diversity analysis by GETAWAY descriptors. 2. Application of the novel 3D molecular descriptors to QSAR/QSPR studies [J].
Consonni, V ;
Todeschini, R ;
Pavan, M ;
Gramatica, P .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2002, 42 (03) :693-705
[20]   Computational modeling tools for the design of potent antimalarial bisbenzamidines:: Overcoming the antimalarial potential of pentamidine [J].
Cruz-Monteagudo, Maykel ;
Borges, Fernanda ;
Perez Gonzalez, Maykel ;
Dias Soeiro Cordeiro, M. Natalia .
BIOORGANIC & MEDICINAL CHEMISTRY, 2007, 15 (15) :5322-5339