Data mining with optimized two-dimensional association rules

被引:69
作者
Fukuda, T
Morimoto, Y
Morishita, S
Tokuyama, T
机构
[1] IBM Corp, Tokyo Res Lab, Kanagawa 242, Japan
[2] Univ Tokyo, Tokyo, Japan
来源
ACM TRANSACTIONS ON DATABASE SYSTEMS | 2001年 / 26卷 / 02期
关键词
algorithms; management; association rules; convex hull searching; data mining; image segmentation; matrix searching;
D O I
10.1145/383891.383893
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We discuss data mining based on association rules for two numeric attributes and one Boolean attribute. For example, in a database of bank customers, "Age" and "Balance" are two numeric attributes, and "CardLoan" is a Boolean attribute. Taking the pair (Age, Balance) as a point in two-dimensional space, we consider an association rule of the form ((Age, Balance) is an element of P) double right arrow (CardLoan = Yes), which implies that bank customers whose ages and balances fall within a planar region P tend to take out credit card loans with a high probability. We consider two classes of regions, rectangles and admissible (i.e., connected and x-monotone) regions. For each class, we propose efficient algorithms for computing the regions that give optimal association rules for gain, support, and confidence, respectively. We have implemented the algorithms for admissible regions as well as several advanced functions based on them in our data mining system named SONAR (System for Optimized Numeric Association Rules), where the rules are visualized by using a graphic user interface to make it easy for users to gain an intuitive understanding of rules.
引用
收藏
页码:179 / 213
页数:35
相关论文
共 35 条
[1]   GEOMETRIC APPLICATIONS OF A MATRIX-SEARCHING ALGORITHM [J].
AGGARWAL, A ;
KLAWE, MM ;
MORAN, S ;
SHOR, P ;
WILBER, R .
ALGORITHMICA, 1987, 2 (02) :195-208
[2]  
AGRAWAL R, 1992, PROC INT CONF VERY L, P560
[3]  
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[4]   DATABASE MINING - A PERFORMANCE PERSPECTIVE [J].
AGRAWAL, R ;
IMIELINSKI, T ;
SWAMI, A .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1993, 5 (06) :914-925
[5]  
Agrawal R., 1994, P 20 INT C VER LARG, V1215, P487
[6]  
[Anonymous], 1989, OPTIMIZATION HDB OPE
[7]  
Asano T, 1996, PROCEEDINGS OF THE SEVENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, P104
[8]  
Bentley J., 1984, Communications of the ACM, V27, P865, DOI 10.1145/358234.381162
[9]   SEGMENTATION THROUGH VARIABLE-ORDER SURFACE FITTING [J].
BESL, PJ ;
JAIN, RC .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1988, 10 (02) :167-192
[10]  
Breiman L., 1984, BIOMETRICS, DOI DOI 10.2307/2530946