SHORTENING THE OED - EXPERIENCE WITH A GRAMMAR-DEFINED DATABASE

被引:6
作者
BLAKE, GE
BRAY, T
TOMPA, FWM
机构
[1] Centre for the New Oxford English Dictionary and Text Research, University of Waterloo, Waterloo, Ontario
[2] Open Text Corp., Waterloo Town Square, Waterloo, Ontario
关键词
DESIGN; HUMAN FACTORS; LANGUAGES; GRAMMAR DEFINED MODEL; PARSED STRING; TEXT DATABASE;
D O I
10.1145/146760.146764
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Textual databases with highly variable structure can be usefully described by a grammar-defined model. One example of such a text is the Oxford English Dictionary. This paper describes a first attempt to apply technology based on this model to a real problem. A language called GOEDEL, which is a partial implementation of a set of grammar-defined database operators, was used to extract and alter a subset of the OED in order to assist the editors in their production of The Shorter Oxford English Dictionary. The implementation of the pstring data structure to describe a piece of text and the functions that operate on this pstring are illustrated with some detailed examples. The project was judged a success and the resulting program used in production by the Oxford University Press
引用
收藏
页码:213 / 232
页数:20
相关论文
共 22 条
[11]  
GONNET GH, 1987, 13TH P INT C VER LAR, P339
[12]  
HORAK W, 1985, IEEE COMPUT, V18, P50
[13]  
KAZMAN R, 1986, CS8620 U WAT CS TECH
[14]  
OSSANNA JF, 1976, NROFF TROFF USERS MA
[15]  
PRATT TW, 1984, PROGRAMMING LANGUAGE
[16]  
SWANNEL J, 1987, UNPUB USERS GUIDE OE
[17]  
THORNTON F, 1988, AUTOMATIC CHANGES NE
[18]  
TOMPA FW, 1989, 5TH P ANN C UW CTR N, P81
[19]  
[No title captured], DOI DOI 10.1109/MC.1987.1663532
[20]  
1989, DOCUMENT STYLE SEMAN