AUTOMATIC INTERPRETATION OF THE TEXTS OF CHEMICAL PATENT ABSTRACTS .2. PROCESSING AND RESULTS

被引:11
作者
CHOWDHURY, GG [1 ]
LYNCH, MF [1 ]
机构
[1] UNIV SHEFFIELD,DEPT INFORMAT STUDIES,SHEFFIELD S10 2TN,S YORKSHIRE,ENGLAND
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 1992年 / 32卷 / 05期
关键词
D O I
10.1021/ci00009a012
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Part 1 of this series described the lexical isolation and categorization of the text tokens of statements describing generic structures in the texts of Documentation Abstracts from Derwent Publications Ltd.;1 this paper describes the syntactic and semantic processing of the tokens with a view to producing the corresponding GENSAL expressions. The syntactic analysis proceeds as an expectation-driven process; the result of the analysis is then validated by semantic information associated with each token. The prototype system can satisfactorily process 86% of the 545 descriptions studied. Routines for processing variable expressions, multiplier expressions, nested parameter expressions, nested substitutions, compound token declarations, and conditional expressions are described. Messages are automatically produced calling for manual intervention in the 14% of statements which are beyond the scope of the prototype system. The prototype could be implemented either for retrospective conversion of databases of generic chemical structures from printed sources or could be adapted to serve as an intelligent editor during preparation of patent abstracts.
引用
收藏
页码:468 / 473
页数:6
相关论文
共 1 条
[1]   AUTOMATIC INTERPRETATION OF THE TEXTS OF CHEMICAL PATENT ABSTRACTS .2. PROCESSING AND RESULTS [J].
CHOWDHURY, GG ;
LYNCH, MF .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1992, 32 (05) :468-473