Information filtering via hill climbing, WordNet, and index patterns

被引:21
作者
Mock, KJ
Vemuri, VR
机构
[1] UNIV CALIF DAVIS,DEPT COMP SCI,DAVIS,CA 95616
[2] UNIV CALIF DAVIS,DEPT APPL SCI,LIVERMORE,CA 94550
关键词
D O I
10.1016/S0306-4573(97)00022-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The recent growth of the Internet has left many users awash in a sea of information. This development has spawned the need for intelligent filtering systems. This paper describes work implemented in the INFOS (Intelligent News Filtering Organizational System) project that is designed to reduce the user's search burden by automatically categorizing data as relevant or irrelevant based upon user interests. These predictions are learned automatically based upon features taken from input articles and collaborative features derived from other users. The filtering is performed by a hybrid technique that combines elements of a keyword-based hill climbing method, knowledge-based conceptual representation via WordNet, and partial parsing via index patterns. The hybrid system integrating all these approaches combines the benefits of each while maintaining robustness and scalability. (C) 1997 Elsevier Science Ltd.
引用
收藏
页码:633 / 644
页数:12
相关论文
共 20 条
[1]  
BREWER RS, 1994, COLLABORATIVE KNOWLE
[2]  
COLLIS KF, 1980, COGNITION DEV INSTRU, P65
[3]  
DEJONG G, 1982, STRATEGIES NATURAL L, P147
[4]  
EBERTS R, 1991, P 1991 IEEE INT C SY, P1331
[5]  
EVANS DA, 1991, P RIAO 91 BARCELONA, P624
[6]  
JENNINGS A, 1992, IEICE T INF SYST, VE75D, P198
[7]  
Lang K, 1995, P 12 INT MACH LEARN
[8]  
LASHKARI Y, 1994, P 12 NAT C ART INT S, P449
[9]  
MAULDIN ML, 1991, CONCEPTUAL INFORMATI
[10]   WORDNET - A LEXICAL DATABASE FOR ENGLISH [J].
MILLER, GA .
COMMUNICATIONS OF THE ACM, 1995, 38 (11) :39-41