Validated QSAR prediction of OH tropospheric degradation of VOCs: Splitting into training-test sets and consensus modeling

被引:202
作者
Gramatica, P
Pilutti, P
Papa, E
机构
[1] Univ Insubria, QSAR, Dept Struct & Funct Biol, I-21100 Varese, Italy
[2] Univ Insubria, Environm Chem Res Unit, I-21100 Varese, Italy
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 2004年 / 44卷 / 05期
关键词
D O I
10.1021/ci049923u
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The rate constant for hydroxyl radical tropospheric degradation of 460 heterogeneous organic compounds is predicted by QSAR modeling. The applied Multiple Linear Regression is based on a variety of theoretical molecular descriptors, selected by the Genetic Algorithms-Variable Subset Selection (GA-VSS) procedure. The models were validated for predictivity by both internal and external validation. For the external validation two splitting approaches, D-optimal Experimental Design and Kohonen Artificial Neural Networks (K-ANN), were applied to the original data set to compare the two methodologies. We emphasize that external validation is the only way to establish a reliable QSAR model for predictive purposes. Predicted data by consensus modeling from different models are also proposed.
引用
收藏
页码:1794 / 1802
页数:9
相关论文
共 56 条
[1]  
[Anonymous], 2004, DRAGON SOFTWARE CALC
[2]  
Atkinson AC., 1985, Plots, transformations and regression
[3]  
an introduction to graphical methods of diagnostic regression analysis
[5]   Atmospheric chemistry of VOCs and NOx [J].
Atkinson, R .
ATMOSPHERIC ENVIRONMENT, 2000, 34 (12-14) :2063-2101
[6]  
ATKINSON R, 1989, J PHYS REF DATA
[7]   Prediction of hydroxyl radical rate constants from molecular structure [J].
Bakken, GA ;
Jurs, PC .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1999, 39 (06) :1064-1075
[8]  
BONCHEV D, 1983, INFORMATION THEORETI
[9]  
Broto P., 1990, PRACTICAL APPL QUANT
[10]   MOLECULAR-IDENTIFICATION NUMBER FOR SUBSTRUCTURE SEARCHES [J].
BURDEN, FR .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1989, 29 (03) :225-227