Novel variable selection quantitative structure-property relationship approach based on the k-nearest-neighbor principle

被引:400
作者
Zheng, WF [1 ]
Tropsha, A [1 ]
机构
[1] Univ N Carolina, Sch Pharm, Div Med Chem & Nat Prod, Lab Mol Modeling, Chapel Hill, NC 27599 USA
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 2000年 / 40卷 / 01期
关键词
D O I
10.1021/ci980033m
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
A novel automated variable selection quantitative structure-activity relationship (QSAR) method, based on the K-nearest neighbor principle (kNN-QSAR) has been developed. The kNN-QSAR method explores formally the active analogue approach, which implies that similar compounds display similar profiles of pharmacological activities. The activity of each compound is predicted as the average activity of K most chemically similar compounds from the data set. The robustness of a QSAR model is characterized by the value of cross-validated R-2 (q(2)) using the leave-one-out cross-validation method. The chemical structures are characterized by multiple topological descriptors such as molecular connectivity indices or atom pairs. The chemical similarity is evaluated by Euclidean distances between compounds in multidimensional descriptor space, and the optimal subset of descriptors is selected using simulated annealing as a stochastic optimization algorithm. The application of the kNN-QSAR method to 58 estrogen receptor ligands as well as to several other groups of pharmacologically active compounds yielded QSAR models with q(2) values of 0.6 or higher. Due to its relative simplicity, high degree of automation, nonlinear nature, and computational efficiency, this method could be applied routinely to a large variety of experimental data sets.
引用
收藏
页码:185 / 194
页数:10
相关论文
共 87 条
[1]   3-D QSAR FOR INTRINSIC ACTIVITY OF 5-HT(1A) RECEPTOR LIGANDS BY THE METHOD OF COMPARATIVE MOLECULAR-FIELD ANALYSIS [J].
AGARWAL, A ;
TAYLOR, EW .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 1993, 14 (02) :237-245
[2]  
AJAY A, 1993, J MED CHEM, V36, P3565
[3]   APPLICATIONS OF NEURAL NETWORKS IN QUANTITATIVE STRUCTURE-ACTIVITY-RELATIONSHIPS OF DIHYDROFOLATE-REDUCTASE INHIBITORS [J].
ANDREA, TA ;
KALAYEH, H .
JOURNAL OF MEDICINAL CHEMISTRY, 1991, 34 (09) :2824-2836
[4]   QUANTITATIVE STRUCTURE RETENTION RELATIONSHIP STUDIES OF ODOR-ACTIVE ALIPHATIC-COMPOUNDS WITH OXYGEN-CONTAINING FUNCTIONAL-GROUPS [J].
ANKER, LS ;
JURS, PC ;
EDWARDS, PA .
ANALYTICAL CHEMISTRY, 1990, 62 (24) :2676-2684
[5]   GENERATING OPTIMAL LINEAR PLS ESTIMATIONS (GOLPE) - AN ADVANCED CHEMOMETRIC TOOL FOR HANDLING 3D-QSAR PROBLEMS [J].
BARONI, M ;
COSTANTINO, G ;
CRUCIANI, G ;
RIGANELLI, D ;
VALIGI, R ;
CLEMENTI, S .
QUANTITATIVE STRUCTURE-ACTIVITY RELATIONSHIPS, 1993, 12 (01) :9-20
[6]  
Basak S C, 1995, SAR QSAR Environ Res, V3, P265, DOI 10.1080/10629369508050153
[7]  
BASAK SC, 1995, NEW J CHEM, V19, P231
[8]   PREDICTING MUTAGENICITY OF CHEMICALS USING TOPOLOGICAL AND QUANTUM-CHEMICAL PARAMETERS - A SIMILARITY BASED STUDY [J].
BASAK, SC ;
GRUNWALD, GD .
CHEMOSPHERE, 1995, 31 (01) :2529-2546
[9]   USE OF GRAPH-THEORETIC PARAMETERS IN RISK ASSESSMENT OF CHEMICALS [J].
BASAK, SC ;
BERTELSEN, S ;
GRUNWALD, GD .
TOXICOLOGY LETTERS, 1995, 79 (1-3) :239-250
[10]   A MACHINE LEARNING APPROACH TO COMPUTER-AIDED MOLECULAR DESIGN [J].
BOLIS, G ;
DIPACE, L ;
FABROCINI, F .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 1991, 5 (06) :617-628