THE STRUCTURE-PROPERTY MODELS CAN BE IMPROVED USING THE ORTHOGONALIZED DESCRIPTORS

被引:81
作者
LUCIC, B
NIKOLIC, S
TRINAJSTIC, N
JURETIC, D
机构
[1] RUDJER BOSKOVIC INST,ZAGREB 41001,CROATIA
[2] UNIV SPLIT,DEPT NAT SCI & ARTS,SPLIT 58000,CROATIA
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 1995年 / 35卷 / 03期
关键词
D O I
10.1021/ci00025a022
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In this report we describe an approach of how one can with the use of orthogonalized descriptors achieve a better structure-property-activity model. This is illustrated using the truncated connectivity basis (l) chi (l = 0, 1,..., 6). The molecular property used to test the approach was the boiling paints of octanes. We first developed the algorithm which produces absolutely the best models with I descriptors (I = 1-7) in nonorthogonalized basis. These models were always better than the models that most authors achieve by the use of the stepwise/inclusion-exclusion procedure. The next step was the development of the computer program by which we could realize all possible orthogonalization orderings of a given set of I descriptors. In doing that we discovered that the certain orderings of the orthogonalized descriptors lead to models with higher values of the correlation coefficient (R) than the corresponding models with nonorthogonalized descriptors. Because of that we selected among all the possible orthogonalization orderings (there are I! possibilities for I descriptors) that ordering which leads to the descriptor which gives the highest value of R. We call this descriptor the dominant descriptor. After we located the first dominant descriptor, we have chosen the second dominant descriptor among the remaining (I - 1) descriptors following the same procedure. In the identical way are obtained the third, the fourth, etc. dominant descriptor. In this manner the selection of the dominant descriptors necessarily minimize the contributions of those descriptors which contribute small amounts to the total correlation coefficient, because the total R is for any fixed set of I descriptors constant and independent of the orthogonalization order. These descriptors appear to be insignificant and are removed from the consideration. With this act we only negligibly diminished the total R, but the value of S as well as F-test were significantly improved, since we obtained the model with less descriptors.
引用
收藏
页码:532 / 538
页数:7
相关论文
共 21 条
[1]  
Lukovits I., Quantitative Structure-Activity Relationships Employing Independent Quantum Chemical Indexes. J. Med. Chem., 26, pp. 1104-1109, (1983)
[2]  
Randic M., Orthogonal Molecular Descriptors., New J. Chem., 15, pp. 517-525, (1991)
[3]  
Amic D., Davidovic-Amic D., Trinajstic N., Calculation of Retention Times of Anthocyanins with Orthogonalized Topological Indices., J. Chem. Inf. Comput. Sci., 35, pp. 136-139, (1995)
[4]  
Rouvray D.H., The Limits of Applicability of Topological Indices., J. Mol. Struct. (Theochem), 185, pp. 187-201, (1989)
[5]  
Randic M., Comparative Regression Analysis. Regressions Based on a Single Descriptor., Croat. Chem. Acta, 66, pp. 289-312, (1993)
[6]  
Garbalena M., Herndon W.C., Optimum Graph-Theoretical Models for Enthalpic Properties of Alkanes. J. Chem. Inf. Comput. Sci., 32, pp. 37-42, (1992)
[7]  
Kier L.B., Hall L.H., Molecular Connectivity in Structure-Activity Analysis., (1986)
[8]  
Randic M., On Computation of Optimal Parameters for Multivariate Analysis of Structure-Property Relationship., J. Comput. Chem., 12, pp. 970-980, (1991)
[9]  
Randic M., Resolution of Ambiguities in Structure-Property Studies by Use of Orthogonal Descriptors., J. Chem. Inf. Comput. Sci., 31, pp. 317-320, (1991)
[10]  
Randic M., Chemical Structure–What is “She”?, J. Chem. Educ., 63, pp. 713-718, (1992)