Robust fitting of mixture regression models

被引:78
作者
Bai, Xiuqin [1 ]
Yao, Weixin [1 ]
Boyer, John E. [1 ]
机构
[1] Kansas State Univ, Dept Stat, Manhattan, KS 66506 USA
关键词
EM algorithm; Mixture regression models; Outliers; Robust regression; MAXIMUM-LIKELIHOOD; LINEAR-REGRESSION; EM-ALGORITHM; POINT; CLUSTERS;
D O I
10.1016/j.csda.2012.01.016
中图分类号
TP39 [计算机的应用];
学科分类号
080201 [机械制造及其自动化];
摘要
The existing methods for fitting mixture regression models assume a normal distribution for error and then estimate the regression parameters by the maximum likelihood estimate (MLE). In this article, we demonstrate that the MLE, like the least squares estimate, is sensitive to outliers and heavy-tailed error distributions. We propose a robust estimation procedure and an EM-type algorithm to estimate the mixture regression models. Using a Monte Carlo simulation study, we demonstrate that the proposed new estimation method is robust and works much better than the MLE when there are outliers or the error distribution has heavy tails. In addition, the proposed robust method works comparably to the MLE when there are no outliers and the error is normal. A real data application is used to illustrate the success of the proposed robust estimation procedure. (C) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:2347 / 2359
页数:13
相关论文
共 41 条
[1]
ROBUST METHOD FOR MULTIPLE LINEAR-REGRESSION [J].
ANDREWS, DF .
TECHNOMETRICS, 1974, 16 (04) :523-531
[2]
[Anonymous], J CLASSIFICATION
[3]
FITTING OF POWER-SERIES, MEANING POLYNOMIALS, ILLUSTRATED ON BAND-SPECTROSCOPIC DATA [J].
BEATON, AE ;
TUKEY, JW .
TECHNOMETRICS, 1974, 16 (02) :147-185
[4]
Computational and inferential difficulties with mixture posterior distributions. [J].
Celeux, G ;
Hurn, M ;
Robert, CP .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2000, 95 (451) :957-970
[5]
Chen JH, 2008, STAT SINICA, V18, P443
[6]
COHEN EA, 1984, MUSIC PERCEPT, V1, P323
[7]
MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[8]
An EM algorithm for estimating equations [J].
Elashoff, M ;
Ryan, L .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2004, 13 (01) :48-65
[9]
Robust clusterwise linear regression through trimming [J].
Garcia-Escudero, L. A. ;
Gordaliza, A. ;
Mayo-Iscar, A. ;
San Martin, R. .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2010, 54 (12) :3057-3069
[10]
Robust linear clustering [J].
Garcia-Escudero, L. A. ;
Gordaliza, A. ;
San Martin, R. ;
Van Aelst, S. ;
Zamar, R. .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2009, 71 :301-318