Deterministic Coreference Resolution Based on Entity-Centric, Precision-Ranked Rules

被引:188
作者
Lee, Heeyoung [1 ]
Chang, Angel [1 ]
Peirsman, Yves [2 ]
Chambers, Nathanael [3 ]
Surdeanu, Mihai [4 ]
Jurafsky, Dan [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
[2] Univ Leuven, B-3000 Louvain, Belgium
[3] US Naval Acad, Annapolis, MD 21402 USA
[4] Univ Arizona, Tucson, AZ 85721 USA
关键词
D O I
10.1162/COLI_a_00152
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
We propose a new deterministic approach to coreference resolution that combines the global information and precise features of modern machine-learning models with the transparency and modularity of deterministic, rule-based systems. Our sieve architecture applies a battery of deterministic coreference models one at a time from highest to lowest precision, where each model builds on the previous model's cluster output. The two stages of our sieve-based architecture, a mention detection stage that heavily favors recall, followed by coreference sieves that are precision-oriented, offer a powerful way to achieve both high precision and high recall. Further, our approach makes use of global information through an entity-centric model that encourages the sharing of features across all mentions that point to the same real-world entity. Despite its simplicity, our approach gives state-of-the-art performance on several corpora and genres, and has also been incorporated into hybrid state-of-the-art coreference systems for Chinese and Arabic. Our system thus offers a new paradigm for combining knowledge in rule-based systems that has implications throughout computational linguistics.
引用
收藏
页码:885 / 916
页数:32
相关论文
共 84 条
[1]
[Anonymous], 2008, Proceedings of the International Conference on Empirical Methods Conference in Natural Language Processing
[2]
[Anonymous], 1998, PROC 1 LANGUAGE RESO
[3]
[Anonymous], 2009, P 2009 C EMP METH NA
[4]
[Anonymous], 1990, The behavior of organisms: An experimental analysis
[5]
[Anonymous], 1995, Proceedings of the 6th conference on Message understanding-MUC6'95, DOI DOI 10.3115/1072399.1072405
[6]
[Anonymous], 2012, JOINT C EMNLP CONLL
[7]
[Anonymous], 2009, Proceedings of NAACL-HLT 2009
[8]
[Anonymous], 2011, Proceedings of the Fifteenth Conference on Computational Natural Language Learning: Shared Task
[9]
[Anonymous], 2007, ANN C N AM CHAPT ASS
[10]
[Anonymous], 2012, P ACL 2012