Hierarchical Dirichlet processes

被引:1859
作者
Teh, Yee Whye [1 ]
Jordan, Michael I.
Beal, Matthew J.
Blei, David M.
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
[2] SUNY Buffalo, Buffalo, NY 14260 USA
[3] Princeton Univ, Princeton, NJ 08544 USA
关键词
clustering; hierarchical model; Markov chain Monte Carlo; mixture model; nonparametric Bayesian statistics;
D O I
10.1198/016214506000000302
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider problems involving groups of data where each observation within a group is a draw from a mixture model and where it is desirable to share mixture components between groups. We assume that the number of mixture components is unknown a priori and is to be inferred from the data. In this setting it is natural to consider sets of Dirichlet processes, one for each group, where the well-known clustering property of the Dirichlet process provides a nonparametric prior for the number of mixture components within each group. Given our desire to tie the mixture models in the various groups, we consider a hierarchical model, specifically one in which the base measure for the child Dirichlet processes is itself distributed according to a Dirichlet process. Such a base measure being discrete, the child Dirichlet processes necessarily share atoms. Thus, as desired, the mixture models in the different groups necessarily share mixture components. We discuss representations of hierarchical Dirichlet processes in terms of a stick-breaking process, and a generalization of the Chinese restaurant process that we refer to as the "Chinese restaurant franchise." We present Markov chain Monte Carlo algorithms for posterior inference in hierarchical Dirichlet process mixtures and describe applications to problems in information retrieval and text modeling.
引用
收藏
页码:1566 / 1581
页数:16
相关论文
共 46 条
  • [1] Aldous DJ., 1985, Exchangeability and related topics. Ecole d'ete de probabilites de Saint-Flour XIII, P1, DOI DOI 10.1007/BFB0099421
  • [2] MIXTURES OF DIRICHLET PROCESSES WITH APPLICATIONS TO BAYESIAN NONPARAMETRIC PROBLEMS
    ANTONIAK, CE
    [J]. ANNALS OF STATISTICS, 1974, 2 (06) : 1152 - 1174
  • [3] Beal M.J., 2003, Variational algorithms for approximate Bayesian inference
  • [4] Beal MJ, 2002, ADV NEUR IN, V14, P577
  • [5] FERGUSON DISTRIBUTIONS VIA POLYA URN SCHEMES
    BLACKWELL, D
    MACQUEEN, JB
    [J]. ANNALS OF STATISTICS, 1973, 1 (02) : 353 - 355
  • [6] Variational Inference for Dirichlet Process Mixtures
    Blei, David M.
    Jordan, Michael I.
    [J]. BAYESIAN ANALYSIS, 2006, 1 (01): : 121 - 143
  • [7] Blei DM, 2003, BAYESIAN STATISTICS 7, P25
  • [8] Semiparametric regression for count data
    Carota, C
    Parmigiani, G
    [J]. BIOMETRIKA, 2002, 89 (02) : 265 - 281
  • [9] CIFARELLI D, 1978, PROBLEMI STAT PARAME
  • [10] An ANOVA model for dependent random measures
    De Iorio, M
    Müller, P
    Rosner, GL
    MacEachern, SN
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2004, 99 (465) : 205 - 215