SHORTENING THE OED - EXPERIENCE WITH A GRAMMAR-DEFINED DATABASE

被引:6
作者
BLAKE, GE
BRAY, T
TOMPA, FWM
机构
[1] Centre for the New Oxford English Dictionary and Text Research, University of Waterloo, Waterloo, Ontario
[2] Open Text Corp., Waterloo Town Square, Waterloo, Ontario
关键词
DESIGN; HUMAN FACTORS; LANGUAGES; GRAMMAR DEFINED MODEL; PARSED STRING; TEXT DATABASE;
D O I
10.1145/146760.146764
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Textual databases with highly variable structure can be usefully described by a grammar-defined model. One example of such a text is the Oxford English Dictionary. This paper describes a first attempt to apply technology based on this model to a real problem. A language called GOEDEL, which is a partial implementation of a set of grammar-defined database operators, was used to extract and alter a subset of the OED in order to assist the editors in their production of The Shorter Oxford English Dictionary. The implementation of the pstring data structure to describe a piece of text and the functions that operate on this pstring are illustrated with some detailed examples. The project was judged a success and the resulting program used in production by the Oxford University Press
引用
收藏
页码:213 / 232
页数:20
相关论文
共 22 条
[1]  
[Anonymous], 1989, OXFORD ENGLISH DICT
[2]  
BENBOW T, 1991, COMMUNICATION JUL
[3]  
BERG DL, 1991, COMPUTATIONAL ISSUES
[4]  
BERG DL, 1991, USERS GUIDE OED
[5]  
BERG DL, 1989, OED8902 UW CTR TECH
[6]  
BRUGGEMANNKLEIN A, 1988, 2 U FREIB I INF REP
[7]  
BURNETT LS, 1986, ZURILEX 86 P, P229
[8]  
CHAR B, 1988, MAPLE REFERENCE MANU
[9]  
FAWCETT HJ, 1988, USERS GUIDE
[10]  
Goldfarb C. F., 1990, SGML HDB