On determining efficient finite mixture models with compact and essential components for clustering data

被引:2
作者
Abas, Ahmed R. [1 ,2 ]
机构
[1] Umm Al Qura Univ, Coll Comp Leith, Dept Comp Sci, Makka Al Mukarrama, Saudi Arabia
[2] Zagazig Univ, Fac Comp & Informat, Dept Comp Sci, Zagazig, Egypt
关键词
Finite mixture models; Clustering; Model selection; Mutual information; Compact components;
D O I
10.1016/j.eij.2013.02.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, an algorithm is proposed to learn and evaluate different finite mixture models (FMMs) for data clustering using a new proposed criterion. The FMM corresponds to the minimum value of the proposed criterion is considered the most efficient FMM with compact and essential components for clustering an input data. The proposed algorithm is referred to as the EMCE algorithm in this paper. The selected FMM by the EMCE algorithm is efficient, in terms of its complexity and composed of compact and essential components. Essential components have minimum mutual information, that is, redundancy, among them, and therefore, they have minimum overlapping among them. The performance of the EMCE algorithm is compared with the performances of other algorithms in the literature. Results show the superiority of the proposed algorithm to other algorithms compared, especially with small data sets that are sparsely distributed or generated from overlapping clusters. (C) 2013 Faculty of Computers and Information, Cairo University. Production and hosting by Elsevier B.V. All rights reserved.
引用
收藏
页码:79 / 88
页数:10
相关论文
共 42 条
[1]   An algorithm for unsupervised learning and optimization of finite mixture models [J].
Abas, Ahmed R. .
EGYPTIAN INFORMATICS JOURNAL, 2011, 12 (01) :19-27
[2]  
[Anonymous], 2003, STAT PATTERN RECOGNI
[3]  
Banfield J, 1993, J BIOMET, V49, P803
[4]  
Bezdek J. C., 1981, PATTERN RECOGNITION
[5]  
Biernacki C, 1997, COMPUTING SCI STAT, V29, P451
[6]  
Bozdogan H., 1988, Classification and Related Methods of Data Analysis. Proceedings of the First Conference of the International Federation of Classification Societies (IFCS), P599
[7]  
Bozdogan H., 1990, J COMMUN STAT THEORY, V19, P221
[8]   An entropy criterion for assessing the number of clusters in a mixture model [J].
Celeux, G ;
Soromenho, G .
JOURNAL OF CLASSIFICATION, 1996, 13 (02) :195-212
[9]   A component-wise EM algorithm for mixtures [J].
Celeux, G ;
Chrétien, S ;
Forbes, F ;
Mkhadri, A .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2001, 10 (04) :697-712
[10]   Maximum weighted likelihood via rival penalized EM for density mixture clustering with automatic model selection [J].
Cheung, YM .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (06) :750-761