Centroid-based summarization of multiple documents

被引:520
作者
Radev, DR [1 ]
Jing, HY
Stys, M
Tam, D
机构
[1] Univ Michigan, Ann Arbor, MI 48109 USA
[2] IBM Corp, Thomas J Watson Res Ctr, Yorktown Hts, NY 10598 USA
关键词
multi-document summarization; centroid-based summarization; cluster-based relative utility; cross-sentence informational subsumption;
D O I
10.1016/j.ipm.2003.10.006
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present a multi-document summarizer, MEAD, which generates summaries using cluster centroids produced by a topic detection and tracking system. We describe two new techniques, a centroid-based summarizer, and an evaluation scheme based on sentence utility and subsumption. We have applied this evaluation to both single and multiple document summaries. Finally, we describe two user studies that test our models of multi-document summarization. (C) 2003 Elsevier Ltd. All rights reserved.
引用
收藏
页码:919 / 938
页数:20
相关论文
共 13 条
[1]  
Allan J., 1998, Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P37, DOI 10.1145/290941.290954
[2]  
ALLAN J, 1998, P BROADC NEWS UND TR
[3]  
AONE C, 1997, P ACL WORKSH INT SCA, P66
[4]  
Carbonell J., 1998, Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P335, DOI 10.1145/290941.291025
[5]   Summarizing text documents: Sentence selection and evaluation metrics [J].
Goldstein, J ;
Kantrowitz, M ;
Mittal, V ;
Carbonell, J .
SIGIR'99: PROCEEDINGS OF 22ND INTERNATIONAL CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 1999, :121-128
[6]  
MANI I, 2000, INFORMATION RETRIEVA, V1
[7]  
Mani I., 1999, ADV AUTOMATIC TEXT S
[8]  
McKeown KR, 1999, SIXTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-99)/ELEVENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE (IAAI-99), P453
[9]  
Radev D.R., 1999, DARPA BROADC NEWS WO
[10]  
Radev DR, 1998, COMPUT LINGUIST, V24, P469