Iterative and sequential Kalman filter-based speech enhancement algorithms

被引:163
作者
Gannot, S [1 ]
Burshtein, D [1 ]
Weinstein, E [1 ]
机构
[1] Tel Aviv Univ, Dept Elect Engn Syst, IL-69978 Tel Aviv, Israel
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 1998年 / 6卷 / 04期
关键词
D O I
10.1109/89.701367
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech quality and intelligibility might significantly deteriorate in the presence of background noise, especially when the speech signal is subject to subsequent processing. In particular, speech coders and automatic speech recognition (ASR) systems that were designed or trained to act on clean speech signals might be rendered useless in the presence of background noise. Speech enhancement algorithms have therefore attracted a great deal of interest in the past two decades. Zn this paper, we present a class of Kalman filter-based algorithms with some extensions, modifications, and improvements of previous work. The first algorithm employs the estimate-maximize (ER I) method to iteratively estimate the spectral parameters of the speech and noise parameters. The enhanced speech signal is obtained as a byproduct of the parameter estimation algorithm. The second algorithm is a sequential, computationally efficient, gradient descent algorithm. We discuss various topics concerning the practical implementation of these algorithms. Extensive experimental study using real speech and noise signals is provided to compare these algorithms with alternative speech enhancement algorithms, and to compare the performance of the iterative and sequential algorithms.
引用
收藏
页码:373 / 385
页数:13
相关论文
共 30 条
[1]  
Anderson B. D. O., 1979, OPTIMAL FILTERING
[2]  
[Anonymous], 1990, DARPA TIMIT AC PHON
[3]   SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION [J].
BOLL, SF .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02) :113-120
[4]   JOINT MODELING AND MAXIMUM-LIKELIHOOD-ESTIMATION OF PITCH AND LINEAR PREDICTION COEFFICIENT PARAMETERS [J].
BURSHTEIN, D .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1992, 91 (03) :1531-1537
[5]  
Dempster A. P., 1977, JRSSSB, V39, P1
[6]   ON THE APPLICATION OF HIDDEN MARKOV-MODELS FOR ENHANCING NOISY SPEECH [J].
EPHRAIM, Y ;
MALAH, D ;
JUANG, BH .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (12) :1846-1856
[7]   SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR LOG-SPECTRAL AMPLITUDE ESTIMATOR [J].
EPHRAIM, Y ;
MALAH, D .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1985, 33 (02) :443-445
[8]   A BAYESIAN-ESTIMATION APPROACH FOR SPEECH ENHANCEMENT USING HIDDEN MARKOV-MODELS [J].
EPHRAIM, Y .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1992, 40 (04) :725-735
[9]   SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR SHORT-TIME SPECTRAL AMPLITUDE ESTIMATOR [J].
EPHRAIM, Y ;
MALAH, D .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (06) :1109-1121
[10]   Theory of statistical estimation. [J].
Fisher, RA .
PROCEEDINGS OF THE CAMBRIDGE PHILOSOPHICAL SOCIETY, 1925, 22 :700-725