Passage retrieval: A probabilistic technique

被引:21
作者
Melucci, M [1 ]
机构
[1] Univ Padua, Dipartimento Elettron & Informat, I-35131 Padua, Italy
关键词
D O I
10.1016/S0306-4573(97)00047-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a probabilistic technique to retrieve passages from texts having a large size or heterogeneous semantic content. The proposed technique is independent on any supporting auxiliary data, such as text structure, topic organization, or pre-defined text segments. A Bayesian framework implements the probabilistic technique. We carried out experiments to compare the probabilistic technique to one based on a text segmentation algorithm. In particular, the probabilistic technique is more effective than, or as effective as the one based on the text segmentation to retrieve small:passages. Results show that passage size affects passage retrieval performance. Result; do also suggest that text organization and query generality may have an impact on the difference in effectiveness between the two techniques. (C) 1998 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:43 / 68
页数:26
相关论文
共 26 条
[1]   On the use of information retrieval techniques for the automatic construction of hypertext [J].
Agosti, M ;
Crestani, F ;
Melucci, M .
INFORMATION PROCESSING & MANAGEMENT, 1997, 33 (02) :133-144
[2]   Design and implementation of a tool for the automatic construction of hypertexts for information retrieval [J].
Agosti, M ;
Crestani, F ;
Melucci, M .
INFORMATION PROCESSING & MANAGEMENT, 1996, 32 (04) :459-476
[3]  
AGOSTI M, 1997, INFORMATION PROCESSI, V33
[4]   Building hypertext using information retrieval [J].
Allan, J .
INFORMATION PROCESSING & MANAGEMENT, 1997, 33 (02) :145-159
[5]  
ALLAN J, 1995, P 18 ANN INT ACM SIG, P337
[6]  
[Anonymous], P 17 ANN INT ACM SIG
[7]   A BAYESIAN STUDY OF MULTINOMIAL DISTRIBUTION [J].
BLOCH, DA ;
WATSON, GS .
ANNALS OF MATHEMATICAL STATISTICS, 1967, 38 (05) :1423-&
[8]   DECISION THEORETIC FOUNDATION FOR INDEXING [J].
BOOKSTEIN, A ;
SWANSON, DR .
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1975, 26 (01) :45-50
[9]   OPERATIONS-RESEARCH APPLIED TO DOCUMENT INDEXING AND RETRIEVAL DECISIONS [J].
BOOKSTEIN, A ;
KRAFT, D .
JOURNAL OF THE ACM, 1977, 24 (03) :418-427
[10]  
CALLAN J, 1994, P ACM INT C RES DEV, P303