A new approach to classification based on association rule mining

被引:108
作者
Chen, Guoqing [1 ]
Liu, Hongyan [1 ]
Yu, Lan [1 ]
Wei, Qiang [1 ]
Zhang, Xing [1 ]
机构
[1] Tsinghua Univ, Dept Management Sci & Engn, Sch Econ & Management, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
data mining; association rule; classification; information gain;
D O I
10.1016/j.dss.2005.03.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Classification is one of the key issues in the fields of decision sciences and knowledge discovery. This paper presents a new approach for constructing a classifier, based on an extended association rule mining technique in the context of classification. The characteristic of this approach is threefold: first, applying the information gain measure to the generation of candidate itemsets; second, integrating the process of frequent itemsets generation with the process of rule generation; third, incorporating strategies for avoiding rule redundancy and conflicts into the mining process. The corresponding mining algorithm proposed, namely GARC (Gain based Association Rule Classification), produces a classifier with satisfactory classification accuracy, compared with other classifiers (e.g., C4.5, CBA, SVM, NN). Moreover, in terms of association rule based classification, GARC could filter out many candidate itemsets in the generation process, resulting in a much smaller set of rules than that of CBA. (c) 2005 Elsevier B.V.. All rights reserved.
引用
收藏
页码:674 / 689
页数:16
相关论文
共 36 条
[1]  
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[2]   DATABASE MINING - A PERFORMANCE PERSPECTIVE [J].
AGRAWAL, R ;
IMIELINSKI, T ;
SWAMI, A .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1993, 5 (06) :914-925
[3]  
Agrawal R, 1994, P 20 INT C VER LARG, V1215, P487
[4]  
Ali K., 1997, Proceedings of the Third International Conference on Knowledge Discovery and Data Mining, P115
[5]  
ALSABTI K, P 4 INT C KNOWL DISC, P2
[6]  
[Anonymous], 1993, P 13 INT JOINT C ART
[7]  
BERTSIMAS D, 2000, DATA MODEL DECISIONS
[8]   SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation [J].
Blewitt, Marnie E. ;
Gendrel, Anne-Valerie ;
Pang, Zhenyi ;
Sparrow, Duncan B. ;
Whitelaw, Nadia ;
Craig, Jeffrey M. ;
Apedaile, Anwyn ;
Hilton, Douglas J. ;
Dunwoodie, Sally L. ;
Brockdorff, Neil ;
Kay, Graham F. ;
Whitelaw, Emma .
NATURE GENETICS, 2008, 40 (05) :663-669
[9]  
Brin S., 1997, SIGMOD Record, V26, P255, DOI [10.1145/253262.253327, 10.1145/253262.253325]
[10]  
CATLETT J, 1991, THESIS U SYDNEY