利用基于图互增理论的自举算法学习语义辞典(英文)

被引:3
作者
张奇
邱锡鹏
黄萱菁
吴立德
机构
[1] DepartmentofComputerScienceandTechnology,FudanUniversity,Shanghai,PRChina
关键词
Semantic lexicon; bootstrapping; graph mutual reinforcement (GMR);
D O I
暂无
中图分类号
TP301.6 [算法理论];
学科分类号
081202 ;
摘要
<正>This paper presents a method to learn semantic lexicons using a new bootstrapping method based on graph mutual reinforcement (GMR).The approach uses only unlabeled data and a few seed words to learn new words for each semantic category. Different from other bootstrapping methods,we use GMR-based bootstrapping to sort the candidate words and patterns.Experi- mental results show that the GMR-based bootstrapping approach outperforms the existing algorithms both in in-domain data and out-domain data.Furthermore,it shows that the result depends on not only the size of the corpus but also the quality.
引用
收藏
页码:1257 / 1261
页数:5
相关论文
共 2 条
[1]  
Unsupervised named-entity extraction from the Web: An experimental study[J] . Oren Etzioni,Michael Cafarella,Doug Downey,Ana-Maria Popescu,Tal Shaked,Stephen Soderland,Daniel S. Weld,Alexander Yates.Artificial Intelligence . 2005 (1)
[2]   Authoritative sources in a hyperlinked environment [J].
Kleinberg, JM .
JOURNAL OF THE ACM, 1999, 46 (05) :604-632