Neural network applications for automatic new topic identification

被引:22
作者
Özmutlu, S [1 ]
Çavdur, F [1 ]
机构
[1] Uludag Univ, Dept Ind Engn, Bursa, Turkey
关键词
search engine; neural nets; information retrieval;
D O I
10.1108/14684520510583936
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Purpose - This study aims to propose an artificial neural network to identify automatically topic changes in a user session by using the statistical characteristics of queries, such as time intervals and query reformulation patterns. Design/methodology/approach - A sample data log from the Norwegian search engine FAST (currently owned by Overture) is selected to train the neural network and then the neural network is used to identify topic changes in the data log. Findings - A total of 98.4 percent of topic shifts and 86.6 percent of topic continuations were estimated correctly. Originality/value - Content analysis of search engine user queries is an important task, since successful exploitation of the content of queries can result in the design of efficient information retrieval algorithms for search engines, which can offer custom-tailored services to the web user. Identification of topic changes within a user search session is a key issue in the content analysis of search engine user queries.
引用
收藏
页码:34 / 53
页数:20
相关论文
共 37 条
[1]  
[Anonymous], 1994, NEURAL NETWORKS
[2]  
Beeferman D., 2000, Proceedings. KDD-2000. Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, P407, DOI 10.1145/347090.347176
[3]  
Beitzel S. M., 2004, Proceedings of Sheffield SIGIR 2004. The Twenty-Seventh Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P321, DOI 10.1145/1008992.1009048
[4]  
Cooley R., 1999, Knowledge and Information Systems, V1, P5
[5]   DEFINITION OF RELEVANCE FOR INFORMATION RETRIEVAL [J].
COOPER, WS .
INFORMATION STORAGE AND RETRIEVAL, 1971, 7 (01) :19-&
[6]   Context learning in Okapi [J].
Goker, A .
JOURNAL OF DOCUMENTATION, 1997, 53 (01) :80-83
[7]  
GOKER A, 2000, P AH2000 INT C AD HY, P319
[8]  
GREISDORF H, 1993, J AM SOC INFORM SCI, V54, P1296
[9]  
He D., 2000, P BCS IRSG 22 ANN C, P57
[10]   Combining evidence for automatic Web session identification [J].
He, DQ ;
Göker, A ;
Harper, DJ .
INFORMATION PROCESSING & MANAGEMENT, 2002, 38 (05) :727-742