Automatic keyword identification by artificial neural networks compared to manual identification by users of filtering systems

被引:16
作者
Boger, Z
Kuflik, T
Shoval, P [1 ]
Shapira, B
机构
[1] Optimal Ind Neural Syst Ltd, Nucl Res Ctr, Beer Sheva, Israel
[2] Ben Gurion Univ Negev, Dept Ind Engn & Management, Informat Syst Program, IL-84105 Beer Sheva, Israel
[3] Rutgers State Univ, Sch Business, Dept MSIS, Piscataway, NJ USA
关键词
D O I
10.1016/S0306-4573(00)00030-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Information filtering (IF) systems usually filter data items by correlating a vector of terms that represent the user profile with similar vectors of terms that represent data items. Terms that represent data items can be determined by experts or automatic indexing methods. In this study we employ an artificial neural network (ANN) as an alternative method for both IF and term selection and compare its effectiveness to that of "traditional" methods. In an earlier study we developed and examined the performance of an IF system that employed content-based and stereotypic rule-based filtering methods in the domain of e-mail messages. In this study, we train a large-scale ANN-based filter, which uses meaningful terms in the same database as input, and use it to predict the relevance of those messages. Our results reveal that the ANN relevance prediction out-performs the prediction of the IF system. Moreover, we found very low correlation between the terms in the user profile (explicitly selected by the users) and the positive causal-index (CI) terms of the ANN, which indicate the relative importance of terms in messages. This implies that the users underestimate the importance of some terms, failing to include them in their profiles. This may explain the rather low prediction accuracy of the IF system. (C) 2001 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:187 / 198
页数:12
相关论文
共 33 条
[1]  
AAS K, 1997, 922 NORW COMP CTR
[2]  
BABA K, 1990, P INT JOINT C NEUR N, V3, P155
[3]   Fab: Content-based, collaborative recommendation [J].
Balabanovic, M ;
Shoham, Y .
COMMUNICATIONS OF THE ACM, 1997, 40 (03) :66-72
[4]  
BALABANOVIC M, 1995, SPRING S INF GATH HE
[5]   INFORMATION FILTERING AND INFORMATION-RETRIEVAL - 2 SIDES OF THE SAME COIN [J].
BELKIN, NJ ;
CROFT, WB .
COMMUNICATIONS OF THE ACM, 1992, 35 (12) :29-38
[6]  
Billsus D., 1998, P INT C MACH LEARN
[7]  
BOGER DL, 1992, ADV HETEROCYCLIC NAT, V2, P1
[8]   APPLICATION OF NEURAL NETWORKS FOR INTERPRETATION OF ION MOBILITY AND X-RAY-FLUORESCENCE SPECTRA [J].
BOGER, Z ;
KARPAS, Z .
ANALYTICA CHIMICA ACTA, 1994, 292 (03) :243-251
[9]   USE OF NEURAL NETWORKS FOR QUANTITATIVE MEASUREMENTS IN ION MOBILITY SPECTROMETRY (IMS) [J].
BOGER, Z ;
KARPAS, Z .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1994, 34 (03) :576-580
[10]  
Boger Z, 1997, IEEE SYS MAN CYBERN, P3030