A new 3D molecular structure representation using quantum topology with application to structure-property relationships

被引:46
作者
Alsberg, BK [1 ]
Marchand-Geneste, N [1 ]
King, RD [1 ]
机构
[1] Univ Coll Wales, Dept Comp Sci, Computat Biol Grp, Aberystwyth SY23 3DB, Ceredigion, Wales
基金
英国生物技术与生命科学研究理事会;
关键词
structure representation using quantum topology; StruQT; quantitative structure-activity relationships QSAR; quantitative structure-property relationships; QSPR; atoms in molecules AIM; quantum chemistry; bader theory; multivariate analysis; partial least squares regression; 3D structure representation; genetic programming; genetic algorithms; variable selection;
D O I
10.1016/S0169-7439(00)00101-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present a new 3D molecular structure representation based on Richard F.W. Bader's quantum topological atoms in molecules (AIM) theory for use in quantitative structure-property/activity relationship (QSPR/QSAR) modeling. Central to this structure representation using quantum topology (StruQT) are critical points located on the electron density distribution of the molecules. Other gradient fields such as the Laplacian of the electron density distribution can also be used. The type of critical point of particular interest is the bond critical point (BCP) which is here characterized by using the following three parameters: electron density rho, the Laplacian del rho and the ellipticity epsilon. This representation has the advantage that there is no need to probe a large number of lattice points in 3D space to capture the important parts of the 3D electronic structure as is necessary in, e.g. comparative field analysis (CoMFA). We tested the new structure representation by predicting the wavelength of the lowest UV transition for a system of 18 anthocyanidins. Different quantitative structure-property relationship (QSPR) models are constructed using several chemometric/machine learning methods such as standard partial least squares regression (PLS), truncated PLS variable selection, genetic algorithm-based variable selection and genetic programming (GP). These models identified bonds that either take part in decreasing or increasing the dominant excitation wavelength. The models also correctly emphasized on the involvement of the conjugated pi system for predicting the wavelength through flagging the BCP ellipticity parameters as important for this particular data set. (C) 2000 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:75 / 91
页数:17
相关论文
共 64 条
[1]   MOLECULAR REFERENCE (MOLREF) - A NEW TOOL IN QUANTITATIVE STRUCTURE-ACTIVITY-RELATIONSHIPS (QSAR) [J].
ALSBERG, B .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1990, 8 (02) :173-181
[2]  
Alsberg BK, 2000, DATA HANDL SCI TECHN, V22, P351, DOI 10.1016/S0922-3487(00)80041-6
[3]   Improving the interpretation of multivariate and rule induction models by using a peak parameter representation [J].
Alsberg, BK ;
Winson, MK ;
Kell, DB .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1997, 36 (02) :95-109
[4]   PLS regression methods [J].
Höskuldsson, Agnar .
Journal of Chemometrics, 1988, 2 (03) :211-228
[5]  
[Anonymous], 1994, INT SERIES MONOGRAPH
[6]  
[Anonymous], 1989, MULTIVARIATE CALIBRA
[7]  
[Anonymous], 1991, Handbook of genetic algorithms
[8]   The electron pair [J].
Bader, RFW ;
Johnson, S ;
Tang, TH ;
Popelier, PLA .
JOURNAL OF PHYSICAL CHEMISTRY, 1996, 100 (38) :15398-15415
[9]   DESCRIPTION OF CONJUGATION AND HYPERCONJUGATION IN TERMS OF ELECTRON DISTRIBUTIONS [J].
BADER, RFW ;
SLEE, TS ;
CREMER, D ;
KRAKA, E .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 1983, 105 (15) :5061-5068
[10]   A TOPOLOGICAL THEORY OF MOLECULAR-STRUCTURE [J].
BADER, RFW ;
NGUYENDANG, TT ;
TAL, Y .
REPORTS ON PROGRESS IN PHYSICS, 1981, 44 (08) :893-948