Summarizing text documents: Sentence selection and evaluation metrics

被引:157
作者
Goldstein, J [1 ]
Kantrowitz, M [1 ]
Mittal, V [1 ]
Carbonell, J [1 ]
机构
[1] Carnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA 15213 USA
来源
SIGIR'99: PROCEEDINGS OF 22ND INTERNATIONAL CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL | 1999年
关键词
D O I
10.1145/312624.312665
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human-quality text summarization systems are difficult to design, and even more difficult to evaluate, in part because documents can differ along several dimensions, such as length, writing style and lexical usage. Nevertheless, certain cues can often help suggest the selection of sentences for inclusion in a summary. This paper presents our analysis of news-article summaries generated by sentence selection. Sentences are ranked for potential inclusion in the summary using a weighted combination of statistical and linguistic features. The statistical features were adapted from standard IR methods. The potential linguistic ones were derived from an analysis of news-wire summaries. To evaluate these features we use a normalized version of precision-recall curves, with a baseline of random sentence selection, as well as analyze the properties of such a baseline. We illustrate our discussions with empirical results showing the importance of corpus-dependent baseline summarization standards, compression ratios and carefully crafted long queries.
引用
收藏
页码:121 / 128
页数:8
相关论文
共 22 条
[1]  
[Anonymous], 1996, P 19 ANN INT ACM SIG, DOI DOI 10.1145/243199.243202
[2]  
BALDWIN B, 1998, P 3 C EMP METH NAT L
[3]  
BANKO M, 1999, IN PRESS P PACLING 9
[4]  
BUCKLEY C, 1985, 85686 TR CORNL U
[5]  
CARBONELL JG, 1998, P SIGIR 98 MELB AUST
[6]  
JING H, 1998, AAAI INT TEXT SUMM W, P60
[7]  
Jones K.S., 1996, Evaluating Natural Language Processing Systems: An Analysis and Review
[8]  
KLAVANS JL, 1995, P 1 ANN WORKSH IFIP
[9]  
LUHN PH, 1958, IBM J, P159
[10]  
MCKEOWN K, 1995, INFO P MANAGEMENT, V31, P5