USING PROBABILISTIC MODELS OF DOCUMENT-RETRIEVAL WITHOUT RELEVANCE INFORMATION

被引：221

作者：

CROFT, WB ^{[1
]}

HARPER, DJ ^{[1
]}

机构：

[1] UNIV CAMBRIDGE,COMP LAB,CAMBRIDGE,ENGLAND

来源：

JOURNAL OF DOCUMENTATION | 1979年 / 35卷 / 04期

关键词：

D O I：

10.1108/eb026683

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Most probabilistic retrieval models incorporate information about the occurrence of index terms in relevant and nonrelevant documents. In this paper we consider the situation where no relevance information is available, that is, at the start of the search. Based on a probabilistic model, strategies are proposed for the initial search and an intermediate search. Retrieval experiments with the Cranfield collection of 1,400 documents show that this initial search strategy is better than conventional search strategies both in terms of retrieval effectiveness and in terms of the number of queries that retrieve relevant documents. The intermediate search is shown to be a useful substitute for a relevance feedback search. Experiments with queries that do not retrieve relevant documents at high rank positions indicate that a cluster search would be an effective alternative strategy. © 1979, MCB UP Limited

引用

页码：285 / 295

页数：11

共 10 条

[1] CROFT WB, 1979, THESIS U CAMBRIDGE
[2] EVALUATION OF FEEDBACK IN DOCUMENT-RETRIEVAL USING CO-OCCURRENCE DATA
HARPER, DJ
VANRIJSBERGEN, CJ
[J]. JOURNAL OF DOCUMENTATION, 1978, 34 (03) : 189 - 216
[3] IDE E, 1969, THESIS CORNELL U
[4] RELEVANCE WEIGHTING OF SEARCH TERMS
ROBERTSON, SE
SPARCK-JONES, K
[J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1976, 27 (03): : 129 - 146
[5] SALTON G, 1968, AUTOMATIC INFORMATIO
[6] SPARCKJONES K, 1979, J DOC, V35, P30
[7] SPARCKJONES K, 1977, RES AUTOMATIC INDEXI
[8] VANRIJSBERGEN C, 1979, INFORMATION RETRIEVA
[9] THEORETICAL BASIS FOR USE OF CO-OCCURRENCE DATA IN INFORMATION-RETRIEVAL
VANRIJSBERGEN, CJ
[J]. JOURNAL OF DOCUMENTATION, 1977, 33 (02) : 106 - 119
[10] PRECISION WEIGHTING - EFFECTIVE AUTOMATIC INDEXING METHOD
YU, CT
SALTON, G
[J]. JOURNAL OF THE ACM, 1976, 23 (01) : 76 - 88

← 1 →