The maximum common substructure as a molecular depiction in a supervised classification context:: Experiments in quantitative structure/biodegradability relationships

被引:27
作者
Cuissart, B
Touffet, F
Crémilleux, B
Bureau, R
Rault, S
机构
[1] Univ Caen, UPRESEA 2126, Ctr Etud & Rech Medicament Normandie, F-14032 Caen, France
[2] Univ Caen, CNRS, UMR6072, Grp Rech Elect Informat & Imagerie Caen, F-14032 Caen, France
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 2002年 / 42卷 / 05期
关键词
D O I
10.1021/ci020017w
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The maximum common structure between two molecules (MCS) induces a similarity that enables one to group compounds sharing the same pattern. This text relates a study based on such a structural depiction in a context of quantitative structure/biodegradability relationships (QSBR). The similarity indices are based exclusively on the MCS. First, the results of statistical tests prove that these indices significantly group compounds of similar activity together. These first conclusions enable the elaboration of classification models using those structural similarities. In a second part, a population of classifiers relying on the maximum common structure and the k-nearest-neighbor algorithm is explored. Finally, a thorough examination of the best models is conducted.
引用
收藏
页码:1043 / 1052
页数:10
相关论文
共 31 条
[1]  
[Anonymous], 1997, EVALUATION METHODS M
[2]  
[Anonymous], SAR QSAR ENV RES
[3]  
[Anonymous], ELEMENTS STAT LEARNI
[4]   AN ALGORITHM FOR THE MULTIPLE COMMON SUBGRAPH PROBLEM [J].
BAYADA, DM ;
SIMPSON, RW ;
JOHNSON, AP ;
LAURENCO, C .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1992, 32 (06) :680-685
[5]   GROUP-CONTRIBUTION METHOD FOR PREDICTING PROBABILITY AND RATE OF AEROBIC BIODEGRADATION [J].
BOETHLING, RS ;
HOWARD, PH ;
MEYLAN, W ;
STITELER, W ;
BEAUMAN, J ;
TIRADO, N .
ENVIRONMENTAL SCIENCE & TECHNOLOGY, 1994, 28 (03) :459-465
[6]  
DACUNHACASTELLE D, 1994, PROBABILITES STAT, P134
[7]  
Dasarathy B.V., 1991, IEEE COMPUTER SOC TU
[8]  
EGAN JP, 1975, SIGNAL DETECTION THE
[9]  
Fawcett T, 1997, P 3 INT C KNOWL DISC
[10]   MOLECULAR SUBSTRUCTURE SIMILARITY SEARCHING - EFFICIENT RETRIEVAL IN 2-DIMENSIONAL STRUCTURE DATABASES [J].
HAGADONE, TR .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1992, 32 (05) :515-521