Bayesian inference for finite mixtures of univariate and multivariate skew-normal and skew-t distributions

被引:155
作者
Fruehwirth-Schnatter, Sylvia [1 ]
Pyne, Saumyadipta [2 ]
机构
[1] Johannes Kepler Univ Linz, Dept Appl Stat & Econometr, A-1040 Linz, Austria
[2] Harvard Univ, Sch Med, Dana Farber Canc Inst, Dept Med Oncol, Boston, MA 02115 USA
关键词
Flow cytometry; Gibbs sampling; Kurtosis; Markov chain Monte Carlo; Skewness; Stochastic representation; FLOW-CYTOMETRY; MONTE-CARLO; UNKNOWN NUMBER; MODEL;
D O I
10.1093/biostatistics/kxp062
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We develop Bayesian inference based on data augmentation and Markov chain Monte Carlo (MCMC) sampling. In addition to the latent allocations, data augmentation is based on a stochastic representation of the skew-normal distribution in terms of a random-effects model with truncated normal random effects. For finite mixtures of skew normals, this leads to a Gibbs sampling scheme that draws from standard densities only. This MCMC scheme is extended to mixtures of skew-t distributions based on representing the skew-t distribution as a scale mixture of skew normals. As an important application of our new method, we demonstrate how it provides a new computational framework for automated analysis of high-dimensional flow cytometric data. Using multivariate skew-normal and skew-t mixture models, we could model non-Gaussian cell populations rigorously and directly without transformation or projection to lower dimensions.
引用
收藏
页码:317 / 336
页数:20
相关论文
共 51 条
[1]  
[Anonymous], 2000, Sankhya Ser. A, DOI DOI 10.2307/25051289
[2]   On the unification of families of skew-normal distributions [J].
Arellano-Valle, Reinaldo B. ;
Azzalini, Adelchi .
SCANDINAVIAN JOURNAL OF STATISTICS, 2006, 33 (03) :561-574
[3]   The multivariate skew-normal distribution [J].
Azzalini, A ;
DallaValle, A .
BIOMETRIKA, 1996, 83 (04) :715-726
[4]  
AZZALINI A, 1985, SCAND J STAT, V12, P171
[5]   Distributions generated by perturbation of symmetry with emphasis on a multivariate skew t-distribution [J].
Azzalini, A ;
Capitanio, A .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2003, 65 :367-389
[6]  
Azzalini A., 1986, STATISTICA, V46, P199
[7]   MODEL-BASED GAUSSIAN AND NON-GAUSSIAN CLUSTERING [J].
BANFIELD, JD ;
RAFTERY, AE .
BIOMETRICS, 1993, 49 (03) :803-821
[8]   Bayesian density estimation using Skew Student-t-Normal mixtures [J].
Barbosa Cabral, Celso Romulo ;
Bolfarine, Heleno ;
Gomes Pereira, Jose Raimundo .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2008, 52 (12) :5075-5090
[9]   The Rush Memory and Aging Project: Study design and baseline characteristics of the study cohort [J].
Bennett, DA ;
Schneider, JA ;
Buchman, AS ;
de Leon, CM ;
Bienias, JL ;
Wilson, RS .
NEUROEPIDEMIOLOGY, 2005, 25 (04) :163-175
[10]   Assessing a mixture model for clustering with the integrated completed likelihood [J].
Biernacki, C ;
Celeux, G ;
Govaert, G .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (07) :719-725