ON MODELING OF INFORMATION-RETRIEVAL CONCEPTS IN VECTOR-SPACES

被引:76
作者
WONG, SKM
ZIARKO, W
RAGHAVAN, VV
WONG, PCN
机构
[1] Univ of Regina, Regina, Sask, Can, Univ of Regina, Regina, Sask, Can
来源
ACM TRANSACTIONS ON DATABASE SYSTEMS | 1987年 / 12卷 / 02期
关键词
INFORMATION RETRIEVAL SYSTEMS - Mathematical Models;
D O I
10.1145/22952.22957
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Vector Space Model (VSM) has been adopted in information retrieval as a means of coping with inexact representation of documents and queries, and the resulting difficulties in determining the relevance of a document relative to a given query. A generalization of the VSM, called the GVSM, is advanced. The developments provide a solution not only for the computation of a measure of similarity (correlation) between terms, but also for the incorporation of these similarities into the retrieval process. The major strength of the GVSM derives from the fact that it is theoretically sound and elegant. Furthermore, experimental evaluation of the model on several test collections indicates that the performance is better than that of the VSM.
引用
收藏
页码:299 / 321
页数:23
相关论文
共 17 条
[1]  
GORDON MD, 1985, 8TH P ANN ACM SIGIR, P179
[2]   EVALUATION OF FEEDBACK IN DOCUMENT-RETRIEVAL USING CO-OCCURRENCE DATA [J].
HARPER, DJ ;
VANRIJSBERGEN, CJ .
JOURNAL OF DOCUMENTATION, 1978, 34 (03) :189-216
[3]  
MINKER J, 1972, INFORM STORAGE RET, V8, P329, DOI 10.1016/0020-0271(72)90021-6
[4]  
Raghavan V. V., 1979, ACM Transactions on Database Systems, V4, P240, DOI 10.1145/320071.320081
[5]  
RAGHAVAN VV, 1986, J AM SOC INFORM SCI, V37, P279, DOI 10.1002/(SICI)1097-4571(198609)37:5<279::AID-ASI1>3.0.CO
[6]  
2-Q
[8]   COMPUTER EVALUATION OF INDEXING AND TEXT PROCESSING [J].
SALTON, G ;
LESK, ME .
JOURNAL OF THE ACM, 1968, 15 (01) :8-&
[9]  
SALTON G, 1972, INFORMATION PROCESSI, P115
[10]  
SALTON G, 1983, INTRO MODERN INFORMA