A distance measure between models: a tool for similarity/diversity analysis of model populations

被引:28
作者
Todeschini, R [1 ]
Consonni, V [1 ]
Pavan, M [1 ]
机构
[1] Univ Milano Bicocca, Dept Environm Sci, Milano Chemometr & QSAR Res Grp, I-20126 Milan, Italy
关键词
similarity/diversity; Hamming distance; Model distance; Selwood data set;
D O I
10.1016/j.chemolab.2003.10.003
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In many research fields, there is, nowadays, a lot of readily available information, however, it needs processing. This is the case of the field of Quantitative Structure-Activity Relationships (QSAR), which exploits several thousand molecular descriptors, and quality control and multivariate calibration where hundreds of spectroscopic signals are easily obtained from spectroscopic methods. Genetic Algorithms, Simulated Annealing, and Tabu Search are some of the methods that are widely used to process available information to find sets of optimal models. In this case, the problem that arises is how to compare the selected models. This work proposes a new measure of the distance between two models, and we will demonstrate that this model distance allows clusters of similar models to be found and the most diverse models to be caught in such a way as to preserve maximum information and diversity. (C) 2003 Elsevier B.V. All rights reserved.
引用
收藏
页码:55 / 61
页数:7
相关论文
共 10 条
[2]   Tabu search model selection in multiple regression analysis [J].
Drezner, Z ;
Marcoulides, GA ;
Salhi, S .
COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 1999, 28 (02) :349-367
[3]  
Kalivas J. H., 1995, ADAPTION SIMULATED A
[4]  
Leardi R, 2001, J CHEMOMETR, V15, P559, DOI 10.1002/cem.651
[5]   STRUCTURE-ACTIVITY-RELATIONSHIPS OF ANTIFILARIAL ANTIMYCIN ANALOGS - A MULTIVARIATE PATTERN-RECOGNITION STUDY [J].
SELWOOD, DL ;
LIVINGSTONE, DJ ;
COMLEY, JCW ;
ODOWD, AB ;
HUDSON, AT ;
JACKSON, P ;
JANDU, KS ;
ROSE, VS ;
STABLES, JN .
JOURNAL OF MEDICINAL CHEMISTRY, 1990, 33 (01) :136-142
[6]  
Todeschini R., 2008, HDB MOL DESCRIPTORS
[7]  
TODESCHINI R, 2002, MOBY DIGS EVOLUTION
[8]  
TODESCHINI R, 2002, DRAGON REL 2 1 WINDO
[9]   Evolutionary optimisation: a tutorial [J].
Wehrens, R ;
Buydens, LMC .
TRAC-TRENDS IN ANALYTICAL CHEMISTRY, 1998, 17 (04) :193-203
[10]   Rational combinatorial library design. 3. Simulated annealing guided evaluation (SAGE) of molecular diversity: A novel computational tool for universal library design and database mining [J].
Zheng, WF ;
Cho, SJ ;
Waller, CL ;
Tropsha, A .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1999, 39 (04) :738-746