INNOVATIONS IN TEXT INTERPRETATION

被引:19
作者
JACOBS, PS
RAU, LF
机构
[1] Artificial Intelligence Laboratory, GE Research, Development Center, Schenectady
关键词
D O I
10.1016/0004-3702(93)90016-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The field of natural language processing is developing a new concentration on interpreting extended texts, with applications in information retrieval, text categorization, and data extraction. The research that addresses these problems represents the first real task-driven focus since machine translation research in the 1960s. Text interpretation applications have already produced good results in accuracy and throughput. This new focus on task-driven text interpretation has been the driving force for a number of advances in the field, because earlier systems fell so far short of the coverage required to interpret bodies of text. The innovations behind this scale-up include work in lexicon development and representation, weak methods of corpus analysis and text pre-processing, and flexible control architectures for parsing. Together, these methods provide coverage and accuracy in interpretation by extending the knowledge that a system can use and controlling how this knowledge is applied. This paper explains the context in which this research is conducted, along with the general progress of the field and some of the details of how our own system realizes these advances.
引用
收藏
页码:143 / 191
页数:49
相关论文
共 70 条
[1]  
BECKER JD, 1975, THEORETICAL ISSUES N
[2]  
BESEMER D, 1987, 25TH P M ASS COMP LI
[3]  
BOBROW R, 1980, P AAAI 80 STANFORD
[4]  
BOGURAEV B, 1988, J COMPUT LINGUISTICS, V13
[5]   AN OVERVIEW OF THE KL-ONE KNOWLEDGE REPRESENTATION SYSTEM [J].
BRACHMAN, RJ ;
SCHMOLZE, JG .
COGNITIVE SCIENCE, 1985, 9 (02) :171-216
[6]  
CARDIE C, 1991, PROCEEDINGS : NINTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 AND 2, P117
[7]  
CHARNIAK E, 1982, STRATEGIES NATURAL L
[8]  
CHURCH K, 1989, P INT WORKSHOP PARSI
[9]  
CHURCH K, 1991, BUILDING LEXICON ON
[10]  
CROFT WB, 1987, 10TH P ANN INT ACM S, P26