Integrating chemists preferences for shape-similarity clustering of series

被引:5
作者
Baumes, Laurent A. [1 ]
Gaudin, Remi [2 ]
Serna, Pedro [1 ]
Nicoloyannis, Nicolas [2 ]
Corma, Avelino [1 ]
机构
[1] Univ Politecn Valencia, CSIC, Inst Tecnol Quim, E-46022 Valencia, Spain
[2] Univ Lyon 2, Lab ERIC, F-69676 Bron, France
关键词
combinatorial; high throughput; heterogeneous catalysis; heck; data mining; time series; clustering; hierarchical; k-means;
D O I
10.2174/138620708784246068
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
This study shows how chemistry knowledge and reasoning are taken into account for building a new methodology that aims at automatically grouping data having a chronological structure. We consider combinatorial catalytic experiments where the evolution of a reaction ( e. g., conversion) over time is expected to be analyzed. The mathematical tool has been developed to compare and group curves taking into account their shape. The strategy, which consists on combining a hierarchical clustering with the k-means algorithm, is described and compared with both algorithms used separately. The hybridization is shown to be of great interest. Then, a second application mode of the proposed methodology is presented. Once meaningful clusters according to chemist's preferences and goals are successfully achieved, the induced model may be used in order to automatically classify new experimental results. The grouping of the new catalysts tested for the Heck coupling reaction between styrene and iodobenzene verified the set of criteria "defined" during the initial clustering step, and facilitated a quick identification of the catalytic behaviors following user's preferences.
引用
收藏
页码:266 / 282
页数:17
相关论文
共 69 条
[1]   From science to innovation and from data to knowledge: eScience in the Dutch Polymer Institute's high-throughput experimentation cluster [J].
Adams, N ;
Schubert, US .
QSAR & COMBINATORIAL SCIENCE, 2005, 24 (01) :58-65
[2]   Software solutions for combinatorial and high-throughput materials and polymer research [J].
Adams, N ;
Schubert, US .
MACROMOLECULAR RAPID COMMUNICATIONS, 2004, 25 (01) :48-58
[3]  
[Anonymous], 2003, Experimental Design for Combinatorial and High Throughput Materials Development
[4]  
[Anonymous], AUSTR DAT MIN WORKSH
[5]  
ANTUNES CM, 2001, P WORKSH TEMP DAT MI
[6]  
ARELLANO G, 2006, J CATAL, V238, P497
[7]   Using Artificial Neural Networks to boost high-throughput discovery in heterogeneous catalysis [J].
Baumes, L ;
Farrusseng, D ;
Lengliz, M ;
Mirodatos, C .
QSAR & COMBINATORIAL SCIENCE, 2004, 23 (09) :767-778
[8]   Support vector machines for predictive modeling in heterogeneous catalysis: A comprehensive introduction and overfitting investigation based on two real applications [J].
Baumes, L. A. ;
Serra, J. M. ;
Serna, P. ;
Corma, A. .
JOURNAL OF COMBINATORIAL CHEMISTRY, 2006, 8 (04) :583-596
[9]   MAP: An iterative experimental design methodology for the optimization of catalytic search space structure modeling [J].
Baumes, LA .
JOURNAL OF COMBINATORIAL CHEMISTRY, 2006, 8 (03) :304-314
[10]  
BAUMES LA, 2003, LECT NOTES AI LNCS