Heteroscedastic discriminant analysis and reduced rank HMMs for improved speech recognition

被引:218
作者
Kumar, N [1 ]
Andreou, AG [1 ]
机构
[1] Johns Hopkins Univ, Dept Elect & Comp Engn, Ctr Language & Speech Proc, Baltimore, MD 21218 USA
基金
美国国家科学基金会;
关键词
heteroscedastic; discriminant analysis; speech recognition; reduced rank HMMs;
D O I
10.1016/S0167-6393(98)00061-2
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We present the theory for heteroscedastic discriminant analysis (HDA), a model-based generalization of linear discriminant analysis (LDA) derived in the maximum-likelihood framework to handle heteroscedastic-unequal variance-classifier models. We show how to estimate the heteroscedastic Gaussian model parameters jointly with the dimensionality reducing transform, using the EM algorithm. In doing so, we alleviate the need for an a priori ad hoc class assignment. We apply the theoretical results to the problem of speech recognition and observe word-error reduction in systems that employed both diagonal and full covariance heteroscedastic Gaussian models tested on the TI-DIGITS database. (C) 1998 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:283 / 297
页数:15
相关论文
共 43 条
  • [1] NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION
    AKAIKE, H
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) : 716 - 723
  • [2] [Anonymous], P ICASSP
  • [3] [Anonymous], 1989, SELECTED PAPERS C R
  • [4] [Anonymous], 1984, Multivariate Analysis
  • [5] [Anonymous], [No title captured], DOI DOI 10.1111/J.1467-842X.1984.TB01271.X
  • [6] AUBERT X, 1993, P ICASSP, V2, P648
  • [7] Ayer C.M., 1993, P EUROSPEECH, V1, P583
  • [8] AYER CM, 1992, THESIS U LONDON
  • [9] Bartlett M, 1947, J R STAT SOC B, V9, P176
  • [10] A MAXIMIZATION TECHNIQUE OCCURRING IN STATISTICAL ANALYSIS OF PROBABILISTIC FUNCTIONS OF MARKOV CHAINS
    BAUM, LE
    PETRIE, T
    SOULES, G
    WEISS, N
    [J]. ANNALS OF MATHEMATICAL STATISTICS, 1970, 41 (01): : 164 - &