On the interpretation and interpretability of quantitative structure-activity relationship models

被引:71
作者
Guha, Rajarshi [1 ]
机构
[1] Indiana Univ, Sch Informat, Bloomington, IN 47408 USA
关键词
Quantitative structure-activity relationship (QSAR); Interpretation; Linear regression; Partial least squares (PLS); Neural network;
D O I
10.1007/s10822-008-9240-5
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The goal of a quantitative structure-activity relationship (QSAR) model is to encode the relationship between molecular structure and biological activity or physical property. Based on this encoding, such models can be used for predictive purposes. Assuming the use of relevant and meaningful descriptors, and a statistically significant model, extraction of the encoded structure-activity relationships (SARs) can provide insight into what makes a molecule active or inactive. Such analyses by QSAR models are useful in a number of scenarios, such as suggesting structural modifications to enhance activity, explanation of outliers and exploratory analysis of novel SARs. In this paper we discuss the need for interpretation and an overview of the factors that affect interpretability of QSAR models. We then describe interpretation protocols for different types of models, highlighting the different types of interpretations, ranging from very broad, global, trends to very specific, case-by-case, descriptions of the SAR, using examples from the training set. Finally, we discuss a number of case studies where workers have provide some form of interpretation of a QSAR model.
引用
收藏
页码:857 / 871
页数:15
相关论文
共 117 条
[91]  
SELASSIE CD, 1991, J MED CHEM, V34, P46
[92]   Empirical regioselectivity models for human cytochromes p450 3A4, 2D6, and 2C9 [J].
Sheridan, Robert P. ;
Korzekwa, Kenneth R. ;
Torres, Rhonda A. ;
Walker, Matthew J. .
JOURNAL OF MEDICINAL CHEMISTRY, 2007, 50 (14) :3173-3184
[93]   DEVELOPMENT AND USE OF CHARGED PARTIAL SURFACE-AREA STRUCTURAL DESCRIPTORS IN COMPUTER-ASSISTED QUANTITATIVE STRUCTURE PROPERTY RELATIONSHIP STUDIES [J].
STANTON, DT ;
JURS, PC .
ANALYTICAL CHEMISTRY, 1990, 62 (21) :2323-2329
[94]   Development and use of hydrophobic surface area (HSA) descriptors for computer-assisted quantitative structure-activity and structure-property relationship studies [J].
Stanton, DT ;
Mattioni, BE ;
Knittel, JJ ;
Jurs, PC .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2004, 44 (03) :1010-1023
[95]   On the physical interpretation of QSAR models [J].
Stanton, DT .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2003, 43 (05) :1423-1433
[96]   A 2.13 Å structure of E-coli dihydrofolate reductase bound to a novel competitive inhibitor reveals a new binding surface involving the M20 loop region [J].
Summerfield, Rachael L. ;
Daigle, Denis M. ;
Mayer, Stanislas ;
Mallik, Debasis ;
Hughes, Donald W. ;
Jackson, Sean G. ;
Sulek, Margaret ;
Organ, Michael G. ;
Brown, Eric D. ;
Junop, Murray S. .
JOURNAL OF MEDICINAL CHEMISTRY, 2006, 49 (24) :6977-6986
[97]   AUTOMATED DESCRIPTOR SELECTION FOR QUANTITATIVE STRUCTURE-ACTIVITY-RELATIONSHIPS USING GENERALIZED SIMULATED ANNEALING [J].
SUTTER, JM ;
DIXON, SL ;
JURS, PC .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1995, 35 (01) :77-84
[98]   Symbolic interpretation of artificial neural networks [J].
Taha, IA ;
Ghosh, J .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1999, 11 (03) :448-463
[99]  
TAKAHASHI T, 1991, NEURAL NETWORKS, V2, P645
[100]   Radial basis function network-based transform for a nonlinear support vector machine as optimized by a particle swarm optimization algorithm with application to QSAR studies [J].
Tang, Li-Juan ;
Zhou, Yan-Ping ;
Jiang, Jian-Hui ;
Zou, Hong-Yan ;
Wu, Hai-Long ;
Shen, Guo-Li ;
Yu, Ru-Qin .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2007, 47 (04) :1438-1445