Database mining using soft computing techniques. An integrated neural network-fuzzy logic-genetic algorithm approach

被引:26
作者
Cundari, TR [1 ]
Russo, M
机构
[1] Univ Memphis, Dept Chem, Computat Res Mat Inst CROMIUM, Memphis, TN 38152 USA
[2] Univ Messina, Dept Phys, I-98166 Messina, Italy
[3] Natl Inst Nucl Phys, Ist Nazl Fis Nucl, Sect Catania, I-95127 Catania, Italy
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 2001年 / 41卷 / 02期
关键词
D O I
10.1021/ci0000068
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Two different soft computing (SC) techniques (a competitive learning neural network and an integrated neural network-fuzzy logic-genetic algorithm approach) are employed in the analysis of a database subset obtained from the Cambridge Structural Database. The chemical problem chosen for study is relevant to the relationship between various metric parameters in transition metal imido (L(n)MdNZ, Z = carbon-based substituent) complexes and the chemical consequences of such relationships. The SC techniques confirmed and quantified the suspected relationship between the metal-nitrogen bond length and the metal-nitrogen-substituent bond angle for transition metal imidos: increased metal-nitrogen-carbon angles correlate with shortened metal-nitrogen distances. The mining effort also yielded an unexpected correlation between the NC distance and the MNC angle-shorter NC correlate with larger MNC. A fuzzy inference system is used to construct an MNred-NC-MNC hypersurface. This hypersurface suggests a complicated interdependence among NC, MNred, and the angle subtended by these two bonds. Also, major portions of the hypersurface are very flat, in regions where MNC is approaching linearity. The relationships are also seen to be influenced by whether the imido substituent is an alkyl or aryl group. Computationally, the present results are of particular interest in two respects. First, SC classification was able to isolate an "outlier" cluster. Identification of outliers is important as they may correspond to unreported experimental errors in the database or novel chemical entities, both of which warrant further investigation. Second, the SC database mining not only confirmed and quantified a suspected relationship (MNred versus MNC) within the data but also yielded a trend that was not suspected (NC versus MNC).
引用
收藏
页码:281 / 287
页数:7
相关论文
共 38 条
[1]  
Allen F.H., 1993, CHEM AUTOMAT NEWS, V8, P31
[2]   TABLES OF BOND LENGTHS DETERMINED BY X-RAY AND NEUTRON-DIFFRACTION .1. BOND LENGTHS IN ORGANIC-COMPOUNDS [J].
ALLEN, FH ;
KENNARD, O ;
WATSON, DG ;
BRAMMER, L ;
ORPEN, AG ;
TAYLOR, R .
JOURNAL OF THE CHEMICAL SOCIETY-PERKIN TRANSACTIONS 2, 1987, (12) :S1-S19
[3]  
BENSON MT, 1996, REV COMPUTATIONAL CH, V8, P145
[4]   Conformation of tripod metal templates in CH3C(CH(2)PPh(2))(3)ML(n) (n=2,3): Neural networks in conformational analysis [J].
Beyreuther, S ;
Hunger, J ;
Huttner, G ;
Mann, S ;
Zsolnai, L .
CHEMISCHE BERICHTE, 1996, 129 (07) :745-757
[5]   NITROGEN NUCLEAR-MAGNETIC-RESONANCE SPECTROSCOPY AS A PROBE OF BONDING, BENDING AND FLUXIONALITY OF THE IMIDO LIGAND [J].
BRADLEY, DC ;
HODGE, SR ;
RUNNACLES, JD ;
HUGHES, M ;
MASON, J ;
RICHARDS, RL .
JOURNAL OF THE CHEMICAL SOCIETY-DALTON TRANSACTIONS, 1992, (10) :1663-1668
[6]  
BRAGA, 1996, J CHEM SOC DA, P3925
[7]   FROM CRYSTAL STATICS TO CHEMICAL-DYNAMICS [J].
BURGI, HB ;
DUNITZ, JD .
ACCOUNTS OF CHEMICAL RESEARCH, 1983, 16 (05) :153-161
[8]   A STERIC PREFERENCE FOR LINEAR VERSUS BENT IMIDO LIGATION - SYNTHESIS AND X-RAY CRYSTAL-STRUCTURE OF [MO(NAR)2(EDTC)2] (AR = 2,6-IPR2C6H3 EDTC = S2CNET2) CONTAINING 2 LINEAR IMIDO MOIETIES [J].
COFFEY, TA ;
FORSTER, GD ;
HOGARTH, G ;
SELLA, A .
POLYHEDRON, 1993, 12 (22) :2741-2743
[9]   TRANSITION-METAL IMIDO COMPLEXES [J].
CUNDARI, TR .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 1992, 114 (20) :7879-7888
[10]   Structural analysis of transition metal β-X substituent interactions.: Toward the use of soft computing methods for catalyst modeling [J].
Cundari, TR ;
Deng, J ;
Pop, HF ;
Sârbu, C .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2000, 40 (04) :1052-1061