Efficient mining of frequent episodes from complex sequences

被引:65
作者
Huang, Kuo-Yu [1 ]
Chang, Chia-Hui [1 ]
机构
[1] Natl Cent Univ, Dept Comp Sci & Informat Engn, Chungli 320, Taiwan
关键词
data mining; frequent episodes; temporal association;
D O I
10.1016/j.is.2007.07.003
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Discovering patterns with great significance is an important problem in data mining discipline. An episode is defined to be a partially ordered set of events for consecutive and fixed-time intervals in a sequence. Most of previous studies on episodes consider only frequent episodes in a sequence of events (called simple sequence). In real world, we may find a set of events at each time slot in terms of various intervals (hours, days, weeks, etc.). We refer to such sequences as complex sequences. Mining frequent episodes in complex sequences has more extensive applications than that in simple sequences. In this paper, we discuss the problem on mining frequent episodes in a complex sequence. We extend previous algorithm MINEPI to MINEPI+ for episode mining from complex sequences. Furthermore, a memory-anchored algorithm called EMMA is introduced for the mining task. Experimental evaluation on both real-world and synthetic data sets shows that EMMA is more efficient than MINEPI+. (C) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:96 / 114
页数:19
相关论文
共 34 条
[21]  
Luo JX, 2000, INT J INTELL SYST, V15, P687, DOI 10.1002/1098-111X(200008)15:8<687::AID-INT1>3.0.CO
[22]  
2-X
[23]  
Ma S, 2001, PROC INT CONF DATA, P205, DOI 10.1109/ICDE.2001.914829
[24]   Discovery of frequent episodes in event sequences [J].
Mannila, H ;
Toivonen, H ;
Verkamo, AI .
DATA MINING AND KNOWLEDGE DISCOVERY, 1997, 1 (03) :259-289
[25]  
MANNILA H, 1996, P 2 INT C KNOWL DISC, P146
[26]  
MEGER N, 2004, ECML PKDD 2004 DISC
[27]   Cyclic association rules [J].
Ozden, B ;
Ramaswamy, S ;
Silberschatz, A .
14TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 1998, :412-421
[28]   Mining sequential patterns by pattern-growth: The PrefixSpan approach [J].
Pei, J ;
Han, JW ;
Mortazavi-Asl, B ;
Wang, JY ;
Pinto, H ;
Chen, QM ;
Dayal, U ;
Hsu, MC .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2004, 16 (11) :1424-1440
[29]  
Pei J, 2001, PROC INT CONF DATA, P215
[30]  
Qin M., 2004, P 3 IEEE INT S NETW