Building association-rule based sequential classifiers for web-document prediction

被引:45
作者
Yang, Q [1 ]
Li, TY [1 ]
Wang, K [1 ]
机构
[1] Simon Fraser Univ, Sch Comp Sci, Burnaby, BC V5A 1S6, Canada
关键词
web log mining; sequential classifiers; presending web documents;
D O I
10.1023/B:DAMI.0000023675.04946.f1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Web servers keep track of web users' browsing behavior in web logs. From these logs, one can build statistical models that predict the users' next requests based on their current behavior. These data are complex due to their large size and sequential nature. In the past, researchers have proposed different methods for building association-rule based prediction models using the web logs, but there has been no systematic study on the relative merits of these methods. In this paper, we provide a comparative study on different kinds of sequential association rules for web document prediction. We show that the existing approaches can be cast under two important dimensions, namely the type of antecedents of rules and the criterion for selecting prediction rules. From this comparison we propose a best overall method and empirically test the proposed model on real web logs.
引用
收藏
页码:253 / 273
页数:21
相关论文
共 14 条
[1]  
AGRAWAL R, 1995, PROC INT CONF DATA, P3, DOI 10.1109/ICDE.1995.380415
[2]  
Agrawal R, 1994, P 20 INT C VER LARG, V1215, P487
[3]  
[Anonymous], 1996, Advances in Knowledge Discovery and Data Mining, DOI DOI 10.1007/978-3-319-31750-2.
[4]  
Breiman L., 1998, CLASSIFICATION REGRE
[5]  
Liu H, 1998, ELEC SOC S, V98, P86
[6]  
MANNILA H, 1998, P 1 INT C KNOWL DISC, P210
[7]  
Pei J, 2001, PROC INT CONF DATA, P215
[8]  
PITKOW J, 1999, 2 USENIX S INT TECHN
[9]  
Quinlan J. R., 2014, C4 5 PROGRAMS MACHIN
[10]  
SCHECHTER S, 1998, P 7 INT WORLD WID WE, P457