On the interpretation and interpretability of quantitative structure-activity relationship models

被引:71
作者
Guha, Rajarshi [1 ]
机构
[1] Indiana Univ, Sch Informat, Bloomington, IN 47408 USA
关键词
Quantitative structure-activity relationship (QSAR); Interpretation; Linear regression; Partial least squares (PLS); Neural network;
D O I
10.1007/s10822-008-9240-5
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The goal of a quantitative structure-activity relationship (QSAR) model is to encode the relationship between molecular structure and biological activity or physical property. Based on this encoding, such models can be used for predictive purposes. Assuming the use of relevant and meaningful descriptors, and a statistically significant model, extraction of the encoded structure-activity relationships (SARs) can provide insight into what makes a molecule active or inactive. Such analyses by QSAR models are useful in a number of scenarios, such as suggesting structural modifications to enhance activity, explanation of outliers and exploratory analysis of novel SARs. In this paper we discuss the need for interpretation and an overview of the factors that affect interpretability of QSAR models. We then describe interpretation protocols for different types of models, highlighting the different types of interpretations, ranging from very broad, global, trends to very specific, case-by-case, descriptions of the SAR, using examples from the training set. Finally, we discuss a number of case studies where workers have provide some form of interpretation of a QSAR model.
引用
收藏
页码:857 / 871
页数:15
相关论文
共 117 条
[111]   Probabilistic neural network model for the in silico evaluation of anti-HIV activity and mechanism of action [J].
Vilar, S ;
Santana, L ;
Uriarte, E .
JOURNAL OF MEDICINAL CHEMISTRY, 2006, 49 (03) :1118-1124
[112]   Developing a methodology for an inverse quantitative structure-activity relationship using the signature molecular descriptor [J].
Visco, DP ;
Pophale, RS ;
Rintoul, MD ;
Faulon, JL .
JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2002, 20 (06) :429-438
[113]   SMILES .2. ALGORITHM FOR GENERATION OF UNIQUE SMILES NOTATION [J].
WEININGER, D ;
WEININGER, A ;
WEININGER, JL .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1989, 29 (02) :97-101
[114]   QSAR studies of HIV-1 integrase inhibition [J].
Yuan, HB ;
Parrill, AL .
BIOORGANIC & MEDICINAL CHEMISTRY, 2002, 10 (12) :4169-4183
[115]   QSAR for anti-malarial activity of 2-aziridinyl and 2,3-bis(aziridinyl)-1,4-naphthoquinonyl sulfonate and acylate derivatives [J].
Zahouily, M ;
Lazar, M ;
Elmakssoudi, A ;
Rakik, J ;
Elaychi, S ;
Rayadh, A .
JOURNAL OF MOLECULAR MODELING, 2006, 12 (04) :398-405
[116]   QSAR study of a large set of 3-pyridyl ethers as ligands of the α4β2 nicotinic acetylcholine receptor [J].
Zhang, Huabei ;
Li, Hua ;
Ma, Qinqin .
JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2007, 26 (01) :226-235
[117]   Scores of extended connectivity fingerprint as descriptors in QSPR study of melting point and aqueous solubility [J].
Zhou, Diansong ;
Alelyunas, Yun ;
Liu, Ruifeng .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2008, 48 (05) :981-987