Looking into the seeds of time: Discovering temporal patterns in large transaction sets

被引:16
作者
Li, YJ [1 ]
Zhu, SC
Wang, XS
Jajodia, S
机构
[1] Singapore Management Univ, Sch Informat Syst, Singapore 259756, Singapore
[2] Penn State Univ, Dept Comp Sci & Engn, University Pk, PA 16802 USA
[3] Univ Vermont, Dept Comp Sci, Burlington, VT 05405 USA
[4] George Mason Univ, Ctr Secure Informat Syst, Fairfax, VA 22030 USA
关键词
knowledge discovery; temporal data mining; temporal pattern;
D O I
10.1016/j.ins.2005.01.019
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper studies the problem of mining frequent itemsets along with their temporal patterns from large transaction sets. A model is proposed in which users define a large set of temporal patterns that are interesting or meaningful to them. A temporal pattern defines the set of time points where the user expects a discovered itemset to be frequent. The model is general in that (i) no constraints are placed on the interesting patterns given by the users, and (ii) two measures-inclusiveness and exchisiveness-are used to capture how well the temporal patterns match the time points given by the discovered itemsets. Intuitively, these measures indicate to what extent a discovered itemset is frequent at time points included in a temporal pattern p, but not at time points not in p. Using these two measures, one is able to model many temporal data mining problems appeared in the literature, as well as those that have not been studied. By exploiting the relationship within and between itemset space and pattern space simultaneously, a series of pruning techniques are developed to speed up the mining process. Experiments show that these pruning techniques allow one to obtain performance benefits up to 100 times over a direct extension of non-temporal data mining algorithms. (C) 2005 Elsevier Inc. All rights reserved.
引用
收藏
页码:1003 / 1031
页数:29
相关论文
共 20 条
  • [1] AGRAWAL R, 1995, PROC INT CONF DATA, P3, DOI 10.1109/ICDE.1995.380415
  • [2] Agrawal R, 1994, P 20 INT C VER LARG, V1215, P487
  • [3] [Anonymous], P 1998 ACM SIGMOD IN
  • [4] [Anonymous], 1999, Proceedings of the fifth ACM SIGKDD international conference on knowledge discovery and data mining p, DOI [10.1145/312129., DOI 10.1145/312129, 10.1145/312129, 10.1145/312129.312191]
  • [5] [Anonymous], 1995, P 1 SIGKDD INT C KNO
  • [6] Discovering frequent event patterns with multiple granularities in time sequences
    Bettini, C
    Wang, XS
    Jajodia, S
    Lin, JL
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1998, 10 (02) : 222 - 237
  • [7] Brin S., 1997, SIGMOD Record, V26, P255, DOI [10.1145/253262.253327, 10.1145/253262.253325]
  • [8] Data organization and access for efficient data mining
    Dunkel, B
    Soparkar, N
    [J]. 15TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 1999, : 522 - 529
  • [9] HAN J, 2000, P 2000 ACM SIGMOD IN, P1, DOI DOI 10.1145/342009.335372
  • [10] Efficient mining of partial periodic patterns in time series database
    Han, JW
    Dong, GZ
    Yin, YW
    [J]. 15TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 1999, : 106 - 115