IMPROVING THE RETRIEVAL OF INFORMATION FROM EXTERNAL SOURCES

被引:285
作者
DUMAIS, ST
机构
[1] Bellcore, Morristown, 07962-1910, NJ, 445 South St.
来源
BEHAVIOR RESEARCH METHODS INSTRUMENTS & COMPUTERS | 1991年 / 23卷 / 02期
关键词
D O I
10.3758/BF03203370
中图分类号
B841 [心理学研究方法];
学科分类号
040201 ;
摘要
A major barrier to successful retrieval from external sources (e.g., electronic databases) is the tremendous variability in the words that people use to describe objects of interest. The fact that different authors use different words to describe essentially the same idea means that relevant objects will be missed; conversely, the fact that the same word can be used to refer to many different things means that irrelevant objects will be retrieved. We describe a statistical method called latent semantic indexing, which models the implicit higher order structure in the association of words and objects and improves retrieval performance by up to 30%. Additional large performance improvements of 40% and 67% can be achieved through the use of differential term weighting and iterative retrieval methods. © 1991 Psychonomic Society, Inc.
引用
收藏
页码:229 / 236
页数:8
相关论文
共 26 条