A new method for selecting English field association terms of compound words and its knowledge representation

被引:31
作者
Atlam, E [1 ]
Morita, K [1 ]
Fuketa, M [1 ]
Aoe, J [1 ]
机构
[1] Univ Tokushima, Dept Informat Sci & Intelligent Syst, Tokushima 7708506, Japan
关键词
compound terms; field association term; semantic classification; redundant candidates;
D O I
10.1016/S0306-4573(01)00062-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a strategy for building a morphological machine dictionary of English that infers meaning of derivations by considering morphological affixes and their semantic classification. Derivations are grouped into a frame that is accessible to semantic stem and knowledge base. This paper also proposes an efficient method for selecting compound Field Association (FA) terms from a large pool of single FA terms for some specialized fields. For single FA terms, five levels of association are defined and two ranks are defined, based on stability and inheritance. About 85% of redundant compound FA terms can be removed effectively by using levels and ranks proposed in this paper. Recall averages of 60-80% are achieved, depending on the type of text. The proposed methods are applied to 22,000 relationships between verbs and nouns extracted from the large tagged corpus. (C) 2002 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:807 / 821
页数:15
相关论文
共 27 条
[11]   MODELS FOR RETRIEVAL WITH PROBABILISTIC INDEXING [J].
FUHR, N .
INFORMATION PROCESSING & MANAGEMENT, 1989, 25 (01) :55-72
[12]   A fast method of determining weighted compound keywords from text databases [J].
Fuketa, M ;
Mizofuchi, S ;
Hayashi, Y ;
Aoe, JI .
INFORMATION PROCESSING & MANAGEMENT, 1998, 34 (04) :431-442
[13]  
Fukumoto F., 1996, P 16 INT C COMP LING, P406
[14]  
KAWABE K, 1998, INFORMATION PROCESSI, P87
[15]  
Kimoto H., 1991, Transactions of the Institute of Electronics, Information and Communication Engineers D-I, VJ74D-I, P556
[16]  
Kupiec J., 1995, SIGIR FOR ACM SPEC I, P68, DOI DOI 10.1145/215206.215333
[17]  
MAMIKI T, 1985, NEW ENGLISH GRAMMAR, V2
[18]  
Miyazaki M., 1984, Transactions of the Information Processing Society of Japan, V25, P970
[19]  
Miyazaki M., 1993, Transactions of the Information Processing Society of Japan, V34, P743
[20]  
NORVIG P, 1987, P 6 C ART INT SEATTL, P561