Classification by clustering decision tree-like classifier based on adjusted clusters

被引:28
作者
Aviad, Barak [1 ]
Roy, Gelbard [1 ]
机构
[1] Bar Ilan Univ, Informat Syst Program, Grad Sch Business Adm, IL-52900 Ramat Gan, Israel
关键词
Classification; Classifier; Cluster analysis; Decision trees decision rule;
D O I
10.1016/j.eswa.2011.01.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Currently cluster analysis techniques are used mainly to aggregate objects into groups according to similarity measures. Whether the number of groups is pre-defined (supervised clustering) or not (unsupervised clustering), clustering techniques do not provide decision rules or a decision tree for the associations that are implemented. The current study proposes and evaluates a new technique to define decision tree based on cluster analysis. The proposed model was applied and tested on two large datasets of real life HR classification problems. The results of the model were compared to results obtained by conventional decision trees. It was found that the decision rules obtained by the model are at least as good as those obtained by conventional decision trees. In some cases the model yields better results than decision trees. In addition, a new measure is developed to help fine-tune the clustering model to achieve better and more accurate results. (C) 2011 Elsevier Ltd. All rights reserved.
引用
收藏
页码:8220 / 8228
页数:9
相关论文
共 22 条
[11]   Fast and robust general purpose clustering algorithms [J].
Estivill-Castro, V ;
Yang, J .
DATA MINING AND KNOWLEDGE DISCOVERY, 2004, 8 (02) :127-150
[12]   Clustering by passing messages between data points [J].
Frey, Brendan J. ;
Dueck, Delbert .
SCIENCE, 2007, 315 (5814) :972-976
[13]   Investigating diversity of clustering methods: An empirical comparison [J].
Gelbard, Roy ;
Goldman, Orit ;
Spiegler, Israel .
DATA & KNOWLEDGE ENGINEERING, 2007, 63 (01) :155-166
[14]  
Hand D., 2001, ADAP COMP MACH LEARN
[15]   Data clustering: A review [J].
Jain, AK ;
Murty, MN ;
Flynn, PJ .
ACM COMPUTING SURVEYS, 1999, 31 (03) :264-323
[16]   Mining supervised classification performance studies: A meta-analytic investigation [J].
Jamain, Adrien ;
Hand, David J. .
JOURNAL OF CLASSIFICATION, 2008, 25 (01) :87-112
[17]  
MacQueen J., 1967, P 5 BERK S MATH STAT, V1, P281, DOI DOI 10.1007/S11665-016-2173-6
[18]   Application of data mining techniques in customer relationship management: A literature review and classification [J].
Ngai, E. W. T. ;
Xiu, Li ;
Chau, D. C. K. .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (02) :2592-2602
[19]  
Pregibon D., 1996, DVANCES KNOWLEDGE DI, P83
[20]   A database clustering methodology and tool [J].
Ryu, TW ;
Eick, CF .
INFORMATION SCIENCES, 2005, 171 (1-3) :29-59