EVALUATING COMPUTER-GENERATED DOMAIN-ORIENTED VOCABULARIES

被引:17
作者
DAMERAU, FJ
机构
[1] IBM, Thomas J. Watson Research Center, Yorktown Heights, NY 10598
关键词
D O I
10.1016/0306-4573(90)90052-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 [计算机科学与技术];
摘要
It is generally accepted that natural language understanding systems are not now able to deal successfully with unrestricted text, except in very superficial ways. Certainly no current NL system exhibits any significant degree of understanding over arbitrary subject matter. Moreover, there is no convincing reason to believe this situation will change in the near future. Successful systems, therefore, have been restricted to specific applications in particular discourse domains. In those situations where users are expected to provide the domain vocabulary (e.g., TEAM, TQA, etc.) it would be very desirable to provide at least suggestions as to what this vocabulary might be, because a good part of the difficulty in customizing a general system consists of supplying the domain vocabulary and specifying its grammatical properties. This paper discusses some methods for identifying domain vocabulary, as well as techniques for evaluating the quality of the resulting word list. © 1990.
引用
收藏
页码:791 / 801
页数:11
相关论文
共 18 条
[1]
Ballard B. W., 1984, Computational Linguistics, V10, P81
[2]
BATES M, 1986, EXPERT DATABASE SYST, P617
[3]
BOGURAEV B, 1987, COMPUT LINGUIST, V13, P203
[4]
Byrd R. J., 1987, COMPUT LINGUIST, V13, P219
[5]
CARTER L, 1979, J CSS, V18, P143, DOI DOI 10.1016/0022-0000(79)90044-8
[6]
Damerau F. J., 1981, American Journal of Computational Linguistics, V7, P30
[8]
PROBLEMS AND SOME SOLUTIONS IN CUSTOMIZATION OF NATURAL-LANGUAGE DATABASE FRONT ENDS [J].
DAMERAU, FJ .
ACM TRANSACTIONS ON OFFICE INFORMATION SYSTEMS, 1985, 3 (02) :165-184
[9]
AN EXAMINATION OF UNDETECTED TYPING ERRORS [J].
DAMERAU, FJ ;
MAYS, E .
INFORMATION PROCESSING & MANAGEMENT, 1989, 25 (06) :659-664
[10]
FIEDLER S, 1988, UNIQUE, V5, P1