GSVD-based optimal filtering for single and multimicrophone speech enhancement

被引：260

作者：

Doclo, S ^{[1
]}

Moonen, M ^{[1
]}

机构：

[1] Katholieke Univ Leuven, SISTA, ESAT, Dept Elect Engn, Louvain, Belgium

来源：

IEEE TRANSACTIONS ON SIGNAL PROCESSING | 2002年 / 50卷 / 09期

关键词：

generalized singular value decomposition; optimal filtering; robust beamforming; speech enhancement;

D O I：

10.1109/TSP.2002.801937

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper, a generalized singular value decomposition (GSVD) based algorithm is proposed for enhancing multimicrophone speech signals degraded by additive colored noise. This GSVD-based multimicrophone algorithm can be considered to be an extension of the single-microphone signal subspace algorithms for enhancing noisy speech signals and amounts to a specific optimal filtering problem when the desired response signal cannot be observed. The optimal filter can be written as a function of the generalized singular vectors and singular values of a speech and noise data matrix. A number of symmetry properties are derived for the single-microphone and multimicrophone optimal filter, which are valid for the white noise case as well as for the colored noise case. In addition, the averaging step of some single-microphone signal subspace algorithms is examined, leading to the conclusion that this averaging operation is unnecessary and even suboptimal. For simple situations, where we consider localized sources and no multipath propagation, the GSVD-based optimal filtering technique exhibits the spatial directivity pattern of a beamformer. When comparing the noise reduction performance for realistic situations, simulations show that the GSVD-based optimal filtering technique has a better performance than standard fixed and adaptive beamforming techniques for all reverberation times and that it is more robust to deviations from the nominal situation, as, e.g., encountered in uncalibrated microphone arrays.

引用

页码：2230 / 2244

页数：15

共 42 条

[11]

DOCLO S, 2002, IN PRESS IEEE T SPEE

[12]

DOCLO S, 1999, P 1999 IEEE INT WORK, P80

[13] PHYSICAL INTERPRETATION OF SIGNAL RECONSTRUCTION FROM REDUCED RANK MATRICES [J].

DOLOGLOU, I ;

CARAYANNIS, G .

IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1991, 39 (07) :1681-1682

[14] Projection-based rank reduction algorithms for multichannel modelling and image compression [J].

Dologlou, I ;

Pesquet, JC ;

Skowronski, J .

SIGNAL PROCESSING, 1996, 48 (02) :97-109

[15] SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR LOG-SPECTRAL AMPLITUDE ESTIMATOR [J].

EPHRAIM, Y ;

MALAH, D .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1985, 33 (02) :443-445

[16] A SIGNAL SUBSPACE APPROACH FOR SPEECH ENHANCEMENT [J].

EPHRAIM, Y ;

VANTREES, HL .

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (04) :251-266

[17]

Everest F. A., 1989, MASTER HDB ACOUSTICS

[18] PARAMETRIC CODING OF SPEECH SPECTRA [J].

FLANAGAN, JL .

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1980, 68 (02) :412-419

[19] Iterative and sequential Kalman filter-based speech enhancement algorithms [J].

Gannot, S ;

Burshtein, D ;

Weinstein, E .

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (04) :373-385

[20] FILTERING OF COLORED NOISE FOR SPEECH ENHANCEMENT AND CODING [J].

GIBSON, JD ;

KOO, BR ;

GRAY, SD .

IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1991, 39 (08) :1732-1742

← 1 2 3 4 5 →