CHEMICAL LITERATURE DATA EXTRACTION - THE CLIDE PROJECT

被引:50
作者
IBISON, P [1 ]
JACQUOT, M [1 ]
KAM, F [1 ]
NEVILLE, AG [1 ]
SIMPSON, RW [1 ]
TONNELIER, C [1 ]
VENCZEL, T [1 ]
JOHNSON, AP [1 ]
机构
[1] UNIV LEEDS,SCH CHEM,LEEDS LS2 9JT,W YORKSHIRE,ENGLAND
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 1993年 / 33卷 / 03期
关键词
D O I
10.1021/ci00013a010
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Chemical information, especially that concerning chemical reactions, is becoming increasingly available in a variety of computer-readable databases. However, the creation of these databases is a time-consuming and expensive process. CLiDE (Chemical Literature Data Extraction) is a new software project to help solve the problem of building substance and reaction databases. CLiDE uses a combination of imaging and artificial intelligence techniques to recognize a range of chemical diagrams and extract the information they contain. The steps necessary to transform a chemical structure drawing into a computer-readable output are detailed. Several examples are given to illustrate the scope of the current work.
引用
收藏
页码:338 / 344
页数:7
相关论文
共 16 条
[1]  
AHRONOVITZ E, 1986, IAPR86IEEE742, V2, P1033
[2]  
ASH JE, 1975, CHEM INFORMATION SYS, P157
[3]  
BORKENT JH, 1988, J CHEM INF COMP SCI, V28, P145
[4]   COMPUTATIONAL PERCEPTION AND RECOGNITION OF DIGITIZED MOLECULAR-STRUCTURES [J].
CONTRERAS, ML ;
ALLENDES, C ;
ALVAREZ, LT ;
ROZAS, R .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1990, 30 (03) :302-307
[5]   DESCRIPTION OF SEVERAL CHEMICAL-STRUCTURE FILE FORMATS USED BY COMPUTER-PROGRAMS DEVELOPED AT MOLECULAR DESIGN LIMITED [J].
DALBY, A ;
NOURSE, JG ;
HOUNSHELL, WD ;
GUSHURST, AKI ;
GRIER, DL ;
LELAND, BA ;
LAUFER, J .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1992, 32 (03) :244-255
[6]  
DUDA RO, 1972, GRAPHICS IMAGE PROCE, P1
[7]   A TOPOLOGY-BASED COMPONENT EXTRACTOR FOR UNDERSTANDING ELECTRONIC-CIRCUIT DIAGRAMS [J].
FAHN, CS ;
WANG, JF ;
LEE, JY .
COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1988, 44 (02) :119-138
[8]  
FLETCHER LA, 1988, IEEE T PATTERN ANAL, V10, P6
[9]   A PRIMARY ALGORITHM FOR THE UNDERSTANDING OF LOGIC-CIRCUIT DIAGRAMS [J].
FUKADA, Y .
PATTERN RECOGNITION, 1984, 17 (01) :125-134
[10]   CHARACTER-RECOGNITION - A REVIEW [J].
GOVINDAN, VK ;
SHIVAPRASAD, AP .
PATTERN RECOGNITION, 1990, 23 (07) :671-683