SpeedTracer: A Web usage mining and analysis tool

被引:32
作者
Wu, KL [1 ]
Yu, PS [1 ]
Ballman, A [1 ]
机构
[1] IBM Corp, Div Res, Thomas J Watson Res Ctr, Internet Technol Dept,Software Tools & Tech Grp, Yorktown Hts, NY 10598 USA
关键词
D O I
10.1147/sj.371.0089
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
SpeedTracer, a World Wide Web usage mining and analysis tool, was developed to understand user surfing behavior by exploring the Web server log files with data mining techniques. As the popularity of the Web has exploded, there is a strong desire to understand user surfing behavior. However, it is difficult to perform user-oriented data mining and analysis directly on the server log files because they tend to be ambiguous and incomplete. With innovative algorithms, SpeedTracer first identifies user sessions by reconstructing user traversal paths. It does not require "cookies" or user registration for session identification. User privacy is protected. Once user sessions are identified, data mining algorithms are then applied to discover the most common traversal paths and groups of pages frequently visited together. Important user browsing patterns are manifested through the frequent traversal paths and page groups, helping the understanding of user surfing behavior. Three types of reports are prepared: user-based reports, path-based reports and group-based reports. In this paper, we describe the design of SpeedTracer and demonstrate some of its features with a few sample reports.
引用
收藏
页码:89 / 105
页数:17
相关论文
共 9 条
[1]  
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[2]  
AGRAWAL R, 1995, PROC INT CONF DATA, P3, DOI 10.1109/ICDE.1995.380415
[3]  
Agrawal R., 1994, P 20 INT C VER LARG, P478
[4]  
[Anonymous], P PYOC ACM SIGMOD IN
[5]  
[Anonymous], P HUM FACT COMP SYST
[6]   Data mining for path traversal patterns in a web environment [J].
Chen, MS ;
Park, JS ;
Yu, PS .
PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, 1996, :385-392
[7]  
Han Y., 2021, P420
[8]  
MOBASHER B, 1996, 96050 U MINN DEP COM
[9]  
PITKOW J, 1997, P 6 INT WORLD WID WE