A tutorial on MM algorithms

被引:1315
作者
Hunter, DR [1 ]
Lange, K
机构
[1] Penn State Univ, Dept Stat, University Pk, PA 16802 USA
[2] Univ Calif Los Angeles, David Geffen Sch Med, Dept Biomath, Los Angeles, CA USA
[3] Univ Calif Los Angeles, David Geffen Sch Med, Dept Human Genet, Los Angeles, CA USA
关键词
constrained optimization; EM algorithm; majorization; minorization; Newton-Raphson;
D O I
10.1198/0003130042836
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Most problems in frequentist statistics involve optimization of a function such as a likelihood or a sum of squares. EM algorithms are among the most effective algorithms for maximum likelihood estimation because they consistently drive the likelihood uphill by maximizing a simple surrogate function for the log-likelihood. Iterative optimization of a surrogate function as exemplified by an EM algorithm does not necessarily require missing data. Indeed, every EM algorithm is a special case of the more general class of MM optimization algorithms, which typically exploit convexity rather than missing data in majorizing or minorizing an objective function. In our opinion, MM algorithms deserve to be part of the standard toolkit of professional statisticians. This article explains the principle behind MM algorithms, suggests some methods for constructing them, and discusses some of their attractive features. We include numerous examples throughout the article to illustrate the concepts described. In addition to surveying previous work on MM algorithms, this article introduces some new material on constrained optimization and standard error estimation.
引用
收藏
页码:30 / 37
页数:8
相关论文
共 39 条
[31]  
Maher M. J., 1982, Statistica Neerlandica, V36, P109, DOI [10.1111/j.1467-9574.1982.tb00782.x, DOI 10.1111/J.1467-9574.1982.TB00782.X]
[32]  
Marshall A., 1979, Inequalities: Theory of Majorization and Its Applications
[33]  
McLachlan G. J., 1997, EM ALGORITHM EXTENSI
[34]   USING EM TO OBTAIN ASYMPTOTIC VARIANCE - COVARIANCE MATRICES - THE SEM ALGORITHM [J].
MENG, XL ;
RUBIN, DB .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1991, 86 (416) :899-909
[35]  
MENG XL, 1993, BIOMETRIKA, V80, P267, DOI 10.2307/2337198
[36]   Direct calculation of the information matrix via the EM algorithm [J].
Oakes, D .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1999, 61 :479-482
[37]   Genomewide motif identification using a dictionary model [J].
Sabatti, C ;
Lange, K .
PROCEEDINGS OF THE IEEE, 2002, 90 (11) :1803-1810
[38]   ITERATIVE TECHNIQUE FOR ABSOLUTE DEVIATIONS CURVE FITTING [J].
SCHLOSSMACHER, EJ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1973, 68 (344) :857-859
[39]   ON THE CONVERGENCE PROPERTIES OF THE EM ALGORITHM [J].
WU, CFJ .
ANNALS OF STATISTICS, 1983, 11 (01) :95-103