SOME INCONSISTENCIES AND MISIDENTIFIED MODELING ASSUMPTIONS IN PROBABILISTIC INFORMATION-RETRIEVAL

被引:29
作者
COOPER, WS
机构
[1] University of California, Berkeley
关键词
ASSUMPTIONS; BIBLIOGRAPHIC SEARCHING; CONSISTENCY; DOCUMENT RETRIEVAL; INDEPENDENCE; LOGIC; MODELING;
D O I
10.1145/195705.195735
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Research in the probabilistic theory of information retrieval involves the construction of mathematical models based on statistical assumptions. One of the hazards inherent in this kind of theory construction is that the assumptions laid down may be inconsistent in unanticipated ways with the data to which they are applied. Another hazard is that the stated assumptions may not be those on which the derived modeling equations or resulting experiments are actually based. Both kinds of mistakes have been made in past research on probabilistic information retrieval. One consequence of these errors is that the statistical character of certain probabilistic IR models, including the so-called Binary Independence model, has been seriously misapprehended.
引用
收藏
页码:100 / 111
页数:12
相关论文
共 19 条
[1]   EXPLOITING THE MAXIMUM-ENTROPY PRINCIPLE TO INCREASE RETRIEVAL EFFECTIVENESS [J].
COOPER, WS .
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1983, 34 (01) :31-39
[2]  
COOPER WS, 1982, INFORMATION TECHNOLO, V1, P99
[3]  
Eells E, 1982, RATIONAL DECISION CA
[4]   A PROBABILISTIC LEARNING APPROACH FOR DOCUMENT INDEXING [J].
FUHR, N ;
BUCKLEY, C .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 1991, 9 (03) :223-248
[5]  
FUHR N, 1991, 13TH P INT C RES DEV, P345
[6]   EVALUATION OF FEEDBACK IN DOCUMENT-RETRIEVAL USING CO-OCCURRENCE DATA [J].
HARPER, DJ ;
VANRIJSBERGEN, CJ .
JOURNAL OF DOCUMENTATION, 1978, 34 (03) :189-216
[7]  
KANTOR PB, 1984, INFORM TECHNOL R & D, V3, P88
[8]  
LEE JJ, 1991, J AM SOC INFORM SCI, V42, P166, DOI 10.1002/(SICI)1097-4571(199104)42:3<166::AID-ASI2>3.0.CO
[9]  
2-A
[10]   ON RELEVANCE, PROBABILISTIC INDEXING AND INFORMATION RETRIEVAL [J].
MARON, ME ;
KUHNS, JL .
JOURNAL OF THE ACM, 1960, 7 (03) :216-244