On second-order statistics and linear estimation of cepstral coefficients

被引:40
作者
Ephraim, Y [1 ]
Rahim, M [1 ]
机构
[1] George Mason Univ, Dept Elect & Comp Engn, Fairfax, VA 22030 USA
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 1999年 / 7卷 / 02期
关键词
cepstral statistics; hidden Markov model; speech recognition;
D O I
10.1109/89.748121
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Explicit expressions for the second-order statistics of cepstral components representing clean and noisy signal waveforms are derived, The noise is assumed additive to the signal, and the spectral components of each process are assumed statistically independent complex Gaussian random variables. The key result developed here is an explicit expression for the cross-covariance between the log-periodograms of the clean and noisy signals. In the absence of noise, this expression is used to show that the covariance matrix of cepstral components representing N signal samples, is a fixed signal independent matrix, which approaches a diagonal matrix at a rate of 1/N. In addition, the cross-covariance expression is used to develop an explicit linear minimum mean square error estimator for the clean cepstral components given noisy cepstral components. Recognition results on the English digits using the fixed covariance and linear estimator are presented.
引用
收藏
页码:162 / 176
页数:15
相关论文
共 41 条
[1]  
[Anonymous], 1976, LINEAR PREDICTION SP
[2]  
[Anonymous], AUTOMATIC SPEECH SPE
[3]  
Baum L.E., 1972, Inequalities III: Proceedings of the Third Symposium on Inequalities, page, V3, P1
[4]   A MAXIMIZATION TECHNIQUE OCCURRING IN STATISTICAL ANALYSIS OF PROBABILISTIC FUNCTIONS OF MARKOV CHAINS [J].
BAUM, LE ;
PETRIE, T ;
SOULES, G ;
WEISS, N .
ANNALS OF MATHEMATICAL STATISTICS, 1970, 41 (01) :164-&
[5]  
BEATTIE VL, 1991, INT CONF ACOUST SPEE, P917, DOI 10.1109/ICASSP.1991.150489
[6]  
BEATTIE VL, 1992, P ICSLP, P519
[7]  
BERSTEIN AD, 1991, INT CONF ACOUST SPEE, P913, DOI 10.1109/ICASSP.1991.150488
[8]  
Brillinger DR., 1975, TIME SERIES DATA ANA
[9]  
Chen Y., 1987, Proceedings: ICASSP 87. 1987 International Conference on Acoustics, Speech, and Signal Processing (Cat. No.87CH2396-0), P717
[10]   COMPARISON OF PARAMETRIC REPRESENTATIONS FOR MONOSYLLABIC WORD RECOGNITION IN CONTINUOUSLY SPOKEN SENTENCES [J].
DAVIS, SB ;
MERMELSTEIN, P .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (04) :357-366