Nonparametric regression applied to quantitative structure - Activity relationships

被引:22
作者
Constans, P [1 ]
Hirst, JD [1 ]
机构
[1] Scripps Res Inst, Dept Mol Biol, La Jolla, CA 92037 USA
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 2000年 / 40卷 / 02期
关键词
D O I
10.1021/ci990082e
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Several nonparametric regressors have been applied to modeling quantitative structure-activity relationship (QSAR) data. The simplest regressor, the Nadaraya-Watson, was assessed in a genuine multivariate setting. Other regressors, the local linear and the shifted Nadaraya-Watson, were implemented within additive models-a computationally more expedient approach, better suited for low-density designs. Performances were benchmarked against the nonlinear method of smoothing splines. A linear reference point was provided by multilinear regression (MLR). Variable selection was explored using systematic combinations of different variables and combinations of principal components. For the data set examined, 47 inhibitors of dopamine beta-hydroxylase, the additive nonparametric regressors have greater predictive accuracy las measured by the mean absolute error of the predictions or the Pearson correlation in cross-validation trails than MLR. The use of principal components did not improve the performance of the nonparametric regressors over use of the original descriptors, since the original descriptors are not strongly correlated. It remains to be seen if the nonparametric regressors can be successfully coupled with better variable selection and dimensionality reduction in the context of high-dimensional QSARs.
引用
收藏
页码:452 / 459
页数:8
相关论文
共 65 条
[41]   Multivariate regression outperforms several robust architectures of neural networks in QSAR modeling [J].
Lucic, B ;
Trinajstic, N .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1999, 39 (01) :121-132
[42]   A new efficient approach for variable selection based on multiregression: Prediction of gas chromatographic retention times and response factors [J].
Lucic, B ;
Trinajstic, N ;
Sild, S ;
Karelson, M ;
Katritzky, AR .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1999, 39 (03) :610-621
[43]   PREDICTION OF RECEPTOR PROPERTIES AND BINDING-AFFINITY OF LIGANDS TO BENZODIAZEPINE/GABA(A) RECEPTORS USING ARTIFICIAL NEURAL NETWORKS [J].
MADDALENA, DJ ;
JOHNSTON, GAR .
JOURNAL OF MEDICINAL CHEMISTRY, 1995, 38 (04) :715-724
[44]   Mass recentred kernel smoothers [J].
Mammen, E ;
Marron, JS .
BIOMETRIKA, 1997, 84 (04) :765-777
[45]   ANALYSIS OF LINEAR AND NONLINEAR QSAR DATA USING NEURAL NETWORKS [J].
MANALLACK, DT ;
ELLIS, DD ;
LIVINGSTONE, DJ .
JOURNAL OF MEDICINAL CHEMISTRY, 1994, 37 (22) :3758-3767
[46]   ON NON-PARAMETRIC ESTIMATES OF DENSITY FUNCTIONS AND REGRESSION CURVES [J].
NADARAYA, EA .
THEORY OF PROBILITY AND ITS APPLICATIONS,USSR, 1965, 10 (01) :186-&
[47]   Using multivariate adaptive regression splines to QSAR studies of dihydroartemisinin derivatives [J].
NguyenCong, V ;
VanDang, G ;
Rode, BM .
EUROPEAN JOURNAL OF MEDICINAL CHEMISTRY, 1996, 31 (10) :797-803
[48]   Quantitative electronic structure-activity relationships of pyridinium cephalosporins using nonparametric regression methods [J].
NguyenCong, V ;
Rode, BM .
EUROPEAN JOURNAL OF MEDICINAL CHEMISTRY, 1996, 31 (06) :479-484
[49]   Reliability of comparative molecular field analysis models: Effects of data scaling and variable selection using a set of human synovial fluid phospholipase A(2) inhibitors [J].
Ortiz, AR ;
Pastor, M ;
Palomer, A ;
Cruciani, G ;
Gago, F ;
Wade, RC .
JOURNAL OF MEDICINAL CHEMISTRY, 1997, 40 (07) :1136-1148
[50]   ESTIMATION OF A PROBABILITY DENSITY-FUNCTION AND MODE [J].
PARZEN, E .
ANNALS OF MATHEMATICAL STATISTICS, 1962, 33 (03) :1065-&