Bayesian inference in mixtures-of-experts and hierarchical mixtures-of-experts models with an application to speech recognition

被引:87
作者
Peng, FC
Jacobs, RA
Tanner, MA
机构
[1] UNIV ROCHESTER,DEPT BRAIN & COGNIT SCI,ROCHESTER,NY 14627
[2] NORTHWESTERN UNIV,DEPT STAT,EVANSTON,IL 60208
关键词
D O I
10.2307/2291714
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Machine classification of acoustic waveforms as speech events is often difficult due to context dependencies. Here a vowel recognition task with multiple speakers is studied via the use of a class of modular and hierarchical systems referred to as mixtures-of-experts and hierarchical mixtures-of-experts models. The statistical model underlying the systems is a mixture model in which both the mixture coefficients and the mixture components are generalized linear models. A full Bayesian approach is used as a basis of inference and prediction. Computations are performed using Markov chain Monte Carlo methods. A key benefit of this approach is the ability to obtain a sample from the posterior distribution of any functional of the parameters of the given model. In this way, more information is obtained than can be provided by a point estimate. Also avoided is the need to rely on a normal approximation to the posterior as the basis of inference. This is particularly important in cases where the posterior is skewed or multimodal. Comparisons between a hierarchical mixtures-of-experts model and other pattern classification systems on the vowel recognition task are reported. The results indicate that this model showed good classification performance and also gave the additional benefit of providing for the opportunity to assess the degree of certainty of the model in its classification predictions.
引用
收藏
页码:953 / 960
页数:8
相关论文
共 17 条
[1]  
Breiman L., 1984, Classification and Regression Trees, DOI DOI 10.2307/2530946
[2]  
DIEBOLT J, 1994, J ROY STAT SOC B MET, V56, P363
[3]   MULTIVARIATE ADAPTIVE REGRESSION SPLINES [J].
FRIEDMAN, JH .
ANNALS OF STATISTICS, 1991, 19 (01) :1-67
[4]  
Gelman A., 1992, Stat. Sci., V7, P457, DOI DOI 10.1214/SS/1177011136
[5]   Adaptive Mixtures of Local Experts [J].
Jacobs, Robert A. ;
Jordan, Michael I. ;
Nowlan, Steven J. ;
Hinton, Geoffrey E. .
NEURAL COMPUTATION, 1991, 3 (01) :79-87
[6]   HIERARCHICAL MIXTURES OF EXPERTS AND THE EM ALGORITHM [J].
JORDAN, MI ;
JACOBS, RA .
NEURAL COMPUTATION, 1994, 6 (02) :181-214
[7]  
JORDAN MI, 1993, 9303 MIT DEP BRAIN C
[8]  
McCullagh P., 1989, GEN LINEAR MODELS, DOI [DOI 10.1007/978-1-4899-3242-6, 10.1201/9780203753736, DOI 10.2307/2347392]
[9]  
MULLER P, 1991, 1991 09 PURD U DEP S
[10]  
Neal R. M., 1991, CRGTR912 U TOR DEP C