Robust mixture modelling using the t distribution

被引:705
作者
Peel, D [1 ]
McLachlan, GJ [1 ]
机构
[1] Univ Queensland, Dept Math, St Lucia, Qld 4072, Australia
关键词
finite mixture models; normal components; multivariate t components; maximum likelihood; EM algorithm; cluster analysis;
D O I
10.1023/A:1008981510081
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Normal mixture models are being increasingly used to model the distributions of a wide variety of random phenomena and to cluster sets of continuous multivariate data. However, for a set of data containing a group or groups of observations with longer than normal tails or atypical observations, the use of normal components may unduly affect the fit of the mixture model. In this paper, we consider a more robust approach by modelling the data by a mixture of t distributions. The use of the ECM algorithm to fit this t mixture model is described and examples of its use are given in the context of clustering multivariate data in the presence of atypical observations in the form of background noise.
引用
收藏
页码:339 / 348
页数:10
相关论文
共 39 条
[1]  
AITCHISON J, 1975, STAT PREDICATION ANA
[2]  
[Anonymous], SANKHYA A
[3]  
[Anonymous], ENCY STAT SCI
[4]  
BOHNING D, 1999, COMPUTER ASSISTED AN
[5]   MULTIVARIATE STUDY OF VARIATION IN 2 SPECIES OF ROCK CRAB OF GENUS LEPTOGRAPSUS [J].
CAMPBELL, NA ;
MAHON, RJ .
AUSTRALIAN JOURNAL OF ZOOLOGY, 1974, 22 (03) :417-425
[6]   MIXTURE-MODELS AND ATYPICAL VALUES [J].
CAMPBELL, NA .
JOURNAL OF THE INTERNATIONAL ASSOCIATION FOR MATHEMATICAL GEOLOGY, 1984, 16 (05) :465-477
[7]   Robust clustering methods: A unified view [J].
Dave, RN ;
Krishnapuram, R .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 1997, 5 (02) :270-293
[8]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[9]   ROBUST ESTIMATION OF A NORMAL MIXTURE [J].
DEVEAUX, RD ;
KRIEGER, AM .
STATISTICS & PROBABILITY LETTERS, 1990, 10 (01) :1-7
[10]  
Everitt B. S., 1981, FINITE MIXTURE DISTR