Nonparametric regression applied to quantitative structure - Activity relationships

被引:22
作者
Constans, P [1 ]
Hirst, JD [1 ]
机构
[1] Scripps Res Inst, Dept Mol Biol, La Jolla, CA 92037 USA
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 2000年 / 40卷 / 02期
关键词
D O I
10.1021/ci990082e
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Several nonparametric regressors have been applied to modeling quantitative structure-activity relationship (QSAR) data. The simplest regressor, the Nadaraya-Watson, was assessed in a genuine multivariate setting. Other regressors, the local linear and the shifted Nadaraya-Watson, were implemented within additive models-a computationally more expedient approach, better suited for low-density designs. Performances were benchmarked against the nonlinear method of smoothing splines. A linear reference point was provided by multilinear regression (MLR). Variable selection was explored using systematic combinations of different variables and combinations of principal components. For the data set examined, 47 inhibitors of dopamine beta-hydroxylase, the additive nonparametric regressors have greater predictive accuracy las measured by the mean absolute error of the predictions or the Pearson correlation in cross-validation trails than MLR. The use of principal components did not improve the performance of the nonparametric regressors over use of the original descriptors, since the original descriptors are not strongly correlated. It remains to be seen if the nonparametric regressors can be successfully coupled with better variable selection and dimensionality reduction in the context of high-dimensional QSARs.
引用
收藏
页码:452 / 459
页数:8
相关论文
共 65 条
[31]   COMPASS - A SHAPE-BASED MACHINE LEARNING TOOL FOR DRUG DESIGN [J].
JAIN, AN ;
DIETTERICH, TG ;
LATHROP, RH ;
CHAPMAN, D ;
CRITCHLOW, RE ;
BAUER, BE ;
WEBSTER, TA ;
LOZANOPEREZ, T .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 1994, 8 (06) :635-652
[32]   A NOTE ON THE USE OF PRINCIPAL COMPONENTS IN REGRESSION [J].
JOLLIFFE, IT .
APPLIED STATISTICS-JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C, 1982, 31 (03) :300-303
[33]   Structure-activity relationships derived by machine learning: The use of atoms and their bond connectivities to predict mutagenicity by inductive logic programming [J].
King, RD ;
Muggleton, SH ;
Srinivasan, A ;
Sternberg, MJE .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1996, 93 (01) :438-442
[34]   DRUG DESIGN BY MACHINE LEARNING - THE USE OF INDUCTIVE LOGIC PROGRAMMING TO MODEL THE STRUCTURE-ACTIVITY-RELATIONSHIPS OF TRIMETHOPRIM ANALOGS BINDING TO DIHYDROFOLATE-REDUCTASE [J].
KING, RD ;
MUGGLETON, S ;
LEWIS, RA ;
STERNBERG, MJE .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1992, 89 (23) :11322-11326
[35]   MULTISUBSTRATE INHIBITORS OF DOPAMINE BETA-HYDROXYLASE .2. STRUCTURE-ACTIVITY-RELATIONSHIPS AT THE PHENETHYLAMINE BINDING-SITE [J].
KRUSE, LI ;
KAISER, C ;
DEWOLF, WE ;
FRAZEE, JS ;
ROSS, ST ;
WAWRO, J ;
WISE, M ;
FLAIM, KE ;
SAWYER, JL ;
ERICKSON, RW ;
EZEKIEL, M ;
OHLSTEIN, EH ;
BERKOWITZ, BA .
JOURNAL OF MEDICINAL CHEMISTRY, 1987, 30 (03) :486-494
[36]   QSAR and 3D QSAR in drug design .1. methodology [J].
Kubinyi, H .
DRUG DISCOVERY TODAY, 1997, 2 (11) :457-467
[37]   QSAR and 3D QSAR in drug design .2. Applications and problems [J].
Kubinyi, H .
DRUG DISCOVERY TODAY, 1997, 2 (12) :538-546
[39]   A KERNEL-METHOD OF ESTIMATING STRUCTURED NONPARAMETRIC REGRESSION-BASED ON MARGINAL INTEGRATION [J].
LINTON, O ;
NIELSEN, JP .
BIOMETRIKA, 1995, 82 (01) :93-100
[40]   Efficient estimation of additive nonparametric regression models [J].
Linton, OB .
BIOMETRIKA, 1997, 84 (02) :469-473