Iterative and sequential Kalman filter-based speech enhancement algorithms

被引：163

作者：

Gannot, S ^{[1
]}

Burshtein, D ^{[1
]}

Weinstein, E ^{[1
]}

机构：

[1] Tel Aviv Univ, Dept Elect Engn Syst, IL-69978 Tel Aviv, Israel

来源：

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 1998年 / 6卷 / 04期

关键词：

D O I：

10.1109/89.701367

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Speech quality and intelligibility might significantly deteriorate in the presence of background noise, especially when the speech signal is subject to subsequent processing. In particular, speech coders and automatic speech recognition (ASR) systems that were designed or trained to act on clean speech signals might be rendered useless in the presence of background noise. Speech enhancement algorithms have therefore attracted a great deal of interest in the past two decades. Zn this paper, we present a class of Kalman filter-based algorithms with some extensions, modifications, and improvements of previous work. The first algorithm employs the estimate-maximize (ER I) method to iteratively estimate the spectral parameters of the speech and noise parameters. The enhanced speech signal is obtained as a byproduct of the parameter estimation algorithm. The second algorithm is a sequential, computationally efficient, gradient descent algorithm. We discuss various topics concerning the practical implementation of these algorithms. Extensive experimental study using real speech and noise signals is provided to compare these algorithms with alternative speech enhancement algorithms, and to compare the performance of the iterative and sequential algorithms.

引用

页码：373 / 385

页数：13

共 30 条

[21]

Masgrau E., 1992, SPEECH PROCESSING AD, P143

[22]

Nikias C., 1993, Higher-order Spectra Analysis: A Nonlinear Signal Processing Framework

[23]

Paliwal K. K., 1987, Proceedings: ICASSP 87. 1987 International Conference on Acoustics, Speech, and Signal Processing (Cat. No.87CH2396-0), P177

[24]

PALIWAL KK, 1991, P ICASSP, P429

[25]

SAMETI H, 1994, P INT C AC SPEECH SI, V1, P13

[26]

SHEN X, 1996, P INT C SPOK LANG PR, P873

[27]

Shumway R. H., 1982, Journal of Time Series Analysis, V3, P253, DOI 10.1111/j.1467-9892.1982.tb00349.x

[28] ITERATIVE AND SEQUENTIAL ALGORITHMS FOR MULTISENSOR SIGNAL ENHANCEMENT [J].

WEINSTEIN, E ;

OPPENHEIM, AV ;

FEDER, M ;

BUCK, JR .

IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1994, 42 (04) :846-859

[29]

WEINSTEIN E, 1990, 560 RLE MIT

[30]

[No title captured]

← 1 2 3 →