Genetic-based EM algorithm for learning Gaussian mixture models

被引:187
作者
Pernkopf, F
Bouchaffra, D
机构
[1] Univ Washington, Dept Elect Engn, Seattle, WA 98195 USA
[2] Graz Univ Technol, Lab Signal Proc & Speech Commun, A-8010 Graz, Austria
[3] Oakland Univ, Dept Comp Sci & Engn, Rochester, MI 48309 USA
关键词
unsupervised learning; clustering; Gaussian mixture models; EM algorithm; genetic algorithm; minimum description length;
D O I
10.1109/TPAMI.2005.162
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a genetic-based expectation-maximization (GA-EM) algorithm for learning Gaussian mixture models from multivariate data. This algorithm is capable of selecting the number of components of the model using the minimum description length (MDL) criterion. Our approach benefits from the properties of Genetic algorithms (GA) and the EM algorithm by combination of both into a single procedure. The population-based stochastic search of the GA explores the search space more thoroughly than the EM method. Therefore, our algorithm enables escaping from local optimal solutions since the algorithm becomes less sensitive to its initialization. The GA-EM algorithm is elitist which maintains the monotonic convergence property of the EM algorithm. The experiments on simulated and real data show that the GA-EM outperforms the EM method since: 1) We have obtained a better MDL score while using exactly the same termination condition for both algorithms. 2) Our approach identifies the number of components which were used to generate the underlying data more often than the EM algorithm.
引用
收藏
页码:1344 / 1348
页数:5
相关论文
共 19 条
  • [1] [Anonymous], HOW SOLVE IT MODERN
  • [2] Evolutionary computation: An overview
    Back, T
    Schwefel, HP
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON EVOLUTIONARY COMPUTATION (ICEC '96), PROCEEDINGS OF, 1996, : 20 - 29
  • [3] Back T., 1996, EVOLUTIONARY ALGORIT
  • [4] Celeux G., 1999, 3746 INRIA
  • [5] Dasgupta S., 1999, P 40 ANN S FDN COMP, P634, DOI DOI 10.1109/SFFCS.1999.814639
  • [6] MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM
    DEMPSTER, AP
    LAIRD, NM
    RUBIN, DB
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01): : 1 - 38
  • [7] Duda R. O., 2000, PATTERN CLASSIFICATI
  • [8] Unsupervised learning of finite mixture models
    Figueiredo, MAT
    Jain, AK
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (03) : 381 - 396
  • [9] Clustering in image space for place recognition and visual annotations for human-robot interaction
    Martínez, AM
    Vitrià, J
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2001, 31 (05): : 669 - 682
  • [10] Learning mixture models using a genetic version of the EM algorithm
    Martínez, AM
    Vitrià, J
    [J]. PATTERN RECOGNITION LETTERS, 2000, 21 (08) : 759 - 769