Information extraction: Beyond document retrieval

被引:58
作者
Gaizauskas, R [1 ]
Wilks, Y [1 ]
机构
[1] Univ Sheffield, Dept Comp Sci, Sheffield S10 2TN, S Yorkshire, England
关键词
D O I
10.1108/EUM0000000007162
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we give a synoptic view of the growth of the text processing technology of information extraction (Ie) whose function is to extract information about a pre-specified set of entities, relations or events from natural language texts and to record this information in structured representations called templates. Here we describe the nature of the re task, review the history of the area from its origins in AI work in the 1960s and 70s till the present, discuss the techniques being used to carry out the task, describe application areas where IE systems are or are about to be at work, and conclude with a discussion of the challenges facing the area. What emerges is a picture of an exciting new text processing technology with a host of new applications, both on its own and in conjunction with other technologies, such as information retrieval, machine translation and data mining.
引用
收藏
页码:70 / 105
页数:36
相关论文
共 71 条
  • [1] Aberdeen J., 1995, P 6 MESS UND C MUC 6, P141
  • [2] ANDERSEN PM, 1992, P 3 C APPL NAT LANG, P170
  • [3] [Anonymous], P 3 C APPL NAT LANG
  • [4] [Anonymous], P 16 C COMP LING
  • [5] Appelt D.E., 1995, MUC 6, P237, DOI DOI 10.3115/1072399.1072420
  • [6] Appelt D. E., 1993, P 5 MESS UND C MUC 5, P221
  • [7] *AVENTINUS, ADV INF SYST MULT DR
  • [8] AZZAM S, 1997, IN PRESS P IJCAI 97
  • [9] BLACK WJ, 1997, NATURAL LANGUAGE PRO, P119
  • [10] CHINCHOR N, 1993, P 5 MESS UND C MUC 5, P79