ML Estimation of a Stochastic Linear System with the EM Algorithm and Its Application to Speech Recognition

被引:146
作者
Digalakis, V. [1 ]
Rohlicek, J. R. [2 ]
Ostendorf, M. [3 ]
机构
[1] SRI Int, Menlo Pk, CA 94025 USA
[2] BBN Labs Inc, Cambridge, MA 02138 USA
[3] Boston Univ, Boston, MA 02215 USA
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 1993年 / 1卷 / 04期
关键词
D O I
10.1109/89.242489
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper we present a nontraditional approach to the problem of estimating the parameters of a stochastic linear system. The method is based on the Expectation-Maximization algorithm and can be considered as the continuous analog of the Baum-Welch estimation algorithm for hidden Markov models. We use the algorithm for training the parameters of a dynamical system model that we propose for better representing the spectral dynamics of speech for recognition. We assume that the observed feature vectors of a phone segment are the output of a stochastic linear dynamical system, and we show how the evolution of the dynamics as a function of the segment length can be modeled using alternative assumptions. We show on a phoneme classification task using the TIMIT database that our approach is the first effective use of an explicit model for statistical dependence between frames of speech.
引用
收藏
页码:431 / 442
页数:12
相关论文
共 34 条
[1]  
AKAIKE H, 1973, BIOMETRIKA, V60, P255, DOI 10.2307/2334537
[2]  
[Anonymous], 2016, LINEAR NONLINEAR PRO
[3]  
Astrom K. J., 1965, PROC 2 IFAC S THEORY, P96
[4]   DRAGON SYSTEM - OVERVIEW [J].
BAKER, JK .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1975, AS23 (01) :24-29
[5]   A MAXIMIZATION TECHNIQUE OCCURRING IN STATISTICAL ANALYSIS OF PROBABILISTIC FUNCTIONS OF MARKOV CHAINS [J].
BAUM, LE ;
PETRIE, T ;
SOULES, G ;
WEISS, N .
ANNALS OF MATHEMATICAL STATISTICS, 1970, 41 (01) :164-&
[6]  
BROWN P, 1987, THESIS CMU
[7]  
Caines P. E., 1988, LINEAR STOCHASTIC SY
[8]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[9]  
DIGALAKIS V, 1991, P ICASSP 91, P289
[10]  
Digalakis V., 1989, P WORKSH SPEECH NAT, P332, DOI [10.3115/1075434.1075491., DOI 10.3115/1075434.1075491]