A generalized subspace approach for enhancing speech corrupted by colored noise

被引:284
作者
Hu, Y [1 ]
Loizou, PC [1 ]
机构
[1] Univ Texas, Dept Elect Engn, Richardson, TX 75083 USA
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 2003年 / 11卷 / 04期
基金
美国国家卫生研究院;
关键词
colored noise; KLT; noise reduction; speech enhancement; subspace-based method;
D O I
10.1109/TSA.2003.814458
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A generalized subspace approach is proposed for enhancement of speech corrupted by colored noise. A nonunitary transform, based on the simultaneous diagonalization of the clean speech and noise covariance matrices, is used to project the noisy signal onto a signal-plus-noise subspace and a noise subspace. The clean signal is estimated by nulling the signal components in the noise subspace and retaining the components in the signal subspace. The applied, transform has built-in prewhitening and can therefore be used in general for colored noise. The proposed approach is shown to be a generalization of the approach proposed by Ephraim and Van Trees for white noise. Two estimators were derived based on the nonunitary transform, one based on time-domain constraints and one based on spectral domain constraints. Objective and subjective measures demonstrated improvements over other subspace-based methods when tested with TIMIT sentences corrupted with speech-shaped noise and multi-talker babble.
引用
收藏
页码:334 / 341
页数:8
相关论文
共 17 条
[1]  
[Anonymous], P IEEE INT C AC SPEE
[2]   ALGORITHM - SOLUTION OF MATRIX EQUATION AX+XB = C [J].
BARTELS, RH ;
STEWART, GW .
COMMUNICATIONS OF THE ACM, 1972, 15 (09) :820-&
[3]  
Deller J., 2000, Discrete-Time Processing of Speech Signals
[4]   SPEECH ENHANCEMENT FROM NOISE - A REGENERATIVE APPROACH [J].
DENDRINOS, M ;
BAKAMIDIS, S ;
CARAYANNIS, G .
SPEECH COMMUNICATION, 1991, 10 (01) :45-57
[5]   A SIGNAL SUBSPACE APPROACH FOR SPEECH ENHANCEMENT [J].
EPHRAIM, Y ;
VANTREES, HL .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (04) :251-266
[6]  
GRAY RM, 1972, IEEE T INFORM THEORY, V18, P725, DOI 10.1109/TIT.1972.1054924
[7]  
Hansen J.H. L., 1998, INT C SPEECH LANGUAG, V7, P2819
[8]  
Hu Y, 2002, INT CONF ACOUST SPEE, P573
[9]   REDUCTION OF BROAD-BAND NOISE IN SPEECH BY TRUNCATED QSVD [J].
JENSEN, SH ;
HANSEN, PC ;
HANSEN, SD ;
SORENSEN, JA .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (06) :439-448
[10]  
Lancaster P, 1985, THEORY MATRICES