Mining association rules with multiple minimum supports using maximum constraints

被引:55
作者
Lee, YC
Hong, TP [1 ]
Lin, WY
机构
[1] Kaohsiung Natl Univ, Dept Elect Engn, Kaohsiung 811, Taiwan
[2] I Shou Univ, Inst Informat Engn, Kaohsiung 840, Taiwan
[3] Kaohsiung Natl Univ, Dept Comp Sci & Informat Engn, Kaohsiung 811, Taiwan
关键词
data mining; multiple minimum supports; association rule; maximum constraint;
D O I
10.1016/j.ijar.2004.11.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data mining is the process of extracting desirable knowledge or interesting patterns from existing databases for specific purposes. Most of the previous approaches set a single minimum support threshold for all the items or itemsets. But in real applications, different items may have different criteria to judge its importance. The support requirements should then vary with different items. In this paper, we provide another point of view about defining the minimum supports of itemsets when items have different minimum supports. The maximum constraint is used, which is well explained and may be suitable to some mining domains. We then propose a simple algorithm based on the Apriori approach to find the large-itemsets and association rules under this constraint. The proposed algorithm is easy and efficient when compared to Wang et al.'s under the maximum constraint. The numbers of association rules and large itemsets obtained by the proposed mining algorithm using the maximum constraint are also less than those using the minimum constraint. Whether to adopt the proposed approach thus depends on the requirements of mining problems. Besides, the granular computing technique of bit strings is used to speed up the proposed data mining algorithm. (c) 2005 Elsevier Inc. All rights reserved.
引用
收藏
页码:44 / 54
页数:11
相关论文
共 20 条
  • [1] Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
  • [2] DATABASE MINING - A PERFORMANCE PERSPECTIVE
    AGRAWAL, R
    IMIELINSKI, T
    SWAMI, A
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1993, 5 (06) : 914 - 925
  • [3] AGRAWAL R, 1995, PROC INT CONF DATA, P3, DOI 10.1109/ICDE.1995.380415
  • [4] AGRAWAL R, 1997, 3 INT C KNOWL DISC D, P67
  • [5] Agrawal R, 1994, P 20 INT C VER LARG, V1215, P487
  • [6] [Anonymous], 21 INT C VLDB
  • [7] [Anonymous], P 2000 INT C VER LAR
  • [8] FRAWLEY WJ, 1991, AAAI WORKSH KNOWL DI, P1
  • [9] Fukuda T., 1996, Proceedings of the Fifteenth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems. PODS 1996, P182, DOI 10.1145/237661.237708
  • [10] Hong T. P., 2001, Intell. Data Anal., P111, DOI DOI 10.3233/IDA-2001-5203