Evolutionary-algorithm-based strategy for computer-assisted structure elucidation

被引:17
作者
Han, YQ [1 ]
Steinbeck, C [1 ]
机构
[1] Max Planck Inst Chem Okol, D-07745 Jena, Germany
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 2004年 / 44卷 / 02期
关键词
D O I
10.1021/ci034132y
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
An evolutionary algorithm (EA) using a graph-based data structure to explore the molecular constitution space is presented. The EA implementation proves to be a promising alternative to deterministic approaches to the problem of computer-assisted structure elucidation (CASE). While not relying on any external database, the EA-guided CASE program SENECA is able to find correct solutions within calculation times comparable to that of other CASE expert systems. The implementation presented here significantly expands the size limit of constitutional optimization problems treatable with evolutionary algorithms by introducing novel efficient graph-based genetic operators. The new EA-based search strategy is discussed including the underlying data structures, component design, parameter optimization, and evolution process control. Typical structure elucidation examples are given to demonstrate the algorithm's performance.
引用
收藏
页码:489 / 498
页数:10
相关论文
共 28 条
[21]   NMRShiftDB - Constructing a free chemical information system with open-source components [J].
Steinbeck, C ;
Krause, S ;
Kuhn, S .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2003, 43 (06) :1733-1739
[22]   The Chemistry Development Kit (CDK): An open-source Java']Java library for chemo- and bioinformatics [J].
Steinbeck, C ;
Han, YQ ;
Kuhn, S ;
Horlacher, O ;
Luttmann, E ;
Willighagen, E .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2003, 43 (02) :493-500
[23]   LUCY - A program for structure elucidation from NMR correlation experiments [J].
Steinbeck, C .
ANGEWANDTE CHEMIE-INTERNATIONAL EDITION IN ENGLISH, 1996, 35 (17) :1984-1986
[24]   SENECA: A platform-independent, distributed, and parallel system for computer-assisted structure elucidation in organic chemistry [J].
Steinbeck, C .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2001, 41 (06) :1500-1507
[25]   Parametric sensitivity and search-space characterization studies of genetic algorithms for computer-aided polymer design [J].
Sundaram, A ;
Venkatasubramanian, V .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1998, 38 (06) :1177-1191
[26]   SMILES .2. ALGORITHM FOR GENERATION OF UNIQUE SMILES NOTATION [J].
WEININGER, D ;
WEININGER, A ;
WEININGER, JL .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1989, 29 (02) :97-101
[27]   SMILES, A CHEMICAL LANGUAGE AND INFORMATION-SYSTEM .1. INTRODUCTION TO METHODOLOGY AND ENCODING RULES [J].
WEININGER, D .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1988, 28 (01) :31-36
[28]  
Williams A, 2000, Curr Opin Drug Discov Devel, V3, P298