Predictions of chromatographic retention indices of alkylphenols with support vector machines and multiple linear regression

被引:15
作者
Fatemi, Mohammed Hossein [1 ]
Baher, Elham [1 ]
Ghorbanzade'h, Mehdi [1 ]
机构
[1] Univ Mazandaran, Dept Chem, Fac Basic Sci, Babol Sar, Iran
关键词
Alkylphenols; Kovats retention indices; Multiple linear regression; Quantitative structure-retention relationship; Support vector machine; QUANTITATIVE STRUCTURE; GAS-LIQUID; WATER; QSPR; NONYLPHENOL; TOXICITY; MODEL;
D O I
10.1002/jssc.200900373
中图分类号
O65 [分析化学];
学科分类号
070302 [分析化学];
摘要
In this study, quantitative structure-retention relationship (QSRR) was used for the prediction of Kovats retention indices of 180 alkylphenols and their derivatives using the multiple linear regression (MLR) and support vector machine (SVM). After the calculation of some molecular descriptors for all molecules, the data set was randomly divided into training and test sets. The diversity of training and test sets was examined by molecular diversity validation test. Then stepwise MLR was used for the selection of the most important descriptors and development of MLR models. Descriptors which appeared in these QSRR models are number of H atoms, relative number of O atoms, Balaban index, relation yz-shadow/yz-rectangle and partial charges hydrogen bond donor atoms HDCA(2) index. These descriptors were used as inputs for developing the SVM model. After optimizing the SVM parameters, it was used for the calculation of chromatographic retention of interest molecules. The values of SE in calculation of Kovats retention indices for training and test sets are 0.34 and 0.63, respectively, for MLR model and 0.35 and 0.63, respectively, for SVM model. The overall values of average absolute relative error were 13.24 and 13.83 for MLR and SVM models, respectively. in addition, the cross-validation tests were performed to further examine the obtained model. The calculated values of cross-validation correlation coefficient (V) and standard deviation based on predicted residual sum of square are 0.896 and 0.680 for MLR model and 0.893 and 0.67 for SVM model. These values and other obtained statistical parameters for these models reveal the suitability of QSRR in prediction of Kovats retention indices of alkylphenols using MLR and SVM techniques.
引用
收藏
页码:4133 / 4142
页数:10
相关论文
共 39 条
[1]
QUANTITATIVE STRUCTURE RETENTION RELATIONSHIP STUDIES OF ODOR-ACTIVE ALIPHATIC-COMPOUNDS WITH OXYGEN-CONTAINING FUNCTIONAL-GROUPS [J].
ANKER, LS ;
JURS, PC ;
EDWARDS, PA .
ANALYTICAL CHEMISTRY, 1990, 62 (24) :2676-2684
[2]
[Anonymous], HDB CHEMOMETRICS A
[3]
Gas chromatography-mass spectrometry analysis of alkylphenols in produced water from offshore oil installations as pentafluorobenzoate derivatives [J].
Boitsov, S ;
Meier, S ;
Klungsoyr, J ;
Svardal, A .
JOURNAL OF CHROMATOGRAPHY A, 2004, 1059 (1-2) :131-141
[4]
Application of LS-SVM to non-linear phenomena in NIR spectroscopy: development of a robust and portable sensor for acidity prediction in grapes [J].
Chauchard, F ;
Cogdill, R ;
Roussel, S ;
Roger, JM ;
Bellon-Maurel, V .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2004, 71 (02) :141-150
[5]
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
[6]
STRUCTURE RETENTION CORRELATIONS OF ALKYLPHENOLS IN GAS-LIQUID AND GAS SOLID CHROMATOGRAPHY [J].
ENGEWALD, W ;
BILLING, U ;
TOPALOVA, I ;
PETSEV, N .
JOURNAL OF CHROMATOGRAPHY, 1988, 446 :71-77
[7]
NOMENCLATURE FOR CHROMATOGRAPHY [J].
ETTRE, LS .
PURE AND APPLIED CHEMISTRY, 1993, 65 (04) :819-872
[8]
A novel QSAR model for prediction of apoptosis-inducing activity of 4-aryl-4-H-chromenes based on support vector machine [J].
Fatemi, Mohammad Hossein ;
Gharaghani, Sajjad .
BIOORGANIC & MEDICINAL CHEMISTRY, 2007, 15 (24) :7746-7754
[9]
Retention index in temperature-programmed gas chromatography [J].
Gonzalez, FR ;
Nardillo, AM .
JOURNAL OF CHROMATOGRAPHY A, 1999, 842 (1-2) :29-49
[10]
HYDROCARBONS AND PHENOLS IN DISCHARGE WATER FROM OFFSHORE OPERATIONS - FATE OF THE HYDROCARBONS IN THE RECIPIENT [J].
GRAHLNIELSEN, O .
SARSIA, 1987, 72 (3-4) :375-382