A split-merge Markov chain Monte Carlo procedure for the dirichlet process mixture model

被引:242
作者
Jain, S [1 ]
Neal, RM
机构
[1] Univ Calif San Diego, Div Biostat, Dept Family & Prevent Med, La Jolla, CA 92093 USA
[2] Univ Toronto, Dept Stat, Toronto, ON M5S 3G3, Canada
[3] Univ Toronto, Dept Comp Sci, Toronto, ON M5S 3G3, Canada
关键词
Gibbs sampler; latent class analysis; Metropolis-Hastings algorithm;
D O I
10.1198/1061860043001
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
This article proposes a split-merge Markov chain algorithm to address the problem of inefficient sampling for conjugate Dirichlet process mixture models. Traditional Markov chain Monte Carlo methods for Bayesian mixture models, such as Gibbs sampling, can become trapped in isolated modes corresponding to an inappropriate clustering of data points. This article describes a Metropolis-Hastings procedure that can escape such local modes by splitting or merging mixture components. Our algorithm employs a new technique in which an appropriate proposal for splitting or merging components is obtained by using a restricted Gibbs sampling scan. We demonstrate empirically that our method outperforms the Gibbs sampler in situations where two or more components are similar in structure.
引用
收藏
页码:158 / 182
页数:25
相关论文
共 23 条
[1]   MIXTURES OF DIRICHLET PROCESSES WITH APPLICATIONS TO BAYESIAN NONPARAMETRIC PROBLEMS [J].
ANTONIAK, CE .
ANNALS OF STATISTICS, 1974, 2 (06) :1152-1174
[2]   FERGUSON DISTRIBUTIONS VIA POLYA URN SCHEMES [J].
BLACKWELL, D ;
MACQUEEN, JB .
ANNALS OF STATISTICS, 1973, 1 (02) :353-355
[3]   A semiparametric Bayesian model for randomised block designs [J].
Bush, CA ;
MacEachern, SN .
BIOMETRIKA, 1996, 83 (02) :275-285
[4]   Computational and inferential difficulties with mixture posterior distributions. [J].
Celeux, G ;
Hurn, M ;
Robert, CP .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2000, 95 (451) :957-970
[5]   BAYESIAN DENSITY-ESTIMATION AND INFERENCE USING MIXTURES [J].
ESCOBAR, MD ;
WEST, M .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1995, 90 (430) :577-588
[6]   ESTIMATING NORMAL MEANS WITH A DIRICHLET PROCESS PRIOR [J].
ESCOBAR, MD .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1994, 89 (425) :268-277
[7]  
Everitt B. S., 1981, FINITE MIXTURE DISTR
[8]  
Everitt BS., 1984, INTRO LATENT VARIABL
[9]  
Ferguson T.S., 1983, Recent Advances in Statistics: Papers in Honor of Herman Chernojf on His Sixtieth Birthday, P287
[10]  
Gelman A, 1998, STAT SCI, V13, P163