Signal/noise KLT based approach for enhancing speech degraded by colored noise

被引:113
作者
Mittal, U [1 ]
Phamdo, N [1 ]
机构
[1] SUNY Stony Brook, Dept Elect & Comp Engn, Stony Brook, NY 11794 USA
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 2000年 / 8卷 / 02期
关键词
colored noise; Karhunen-Loeve transform (KLT); speech enhancement;
D O I
10.1109/89.824700
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A signal/noise Karhunen-Loeve transform (KLT) based approach for enhancing speech degraded by colored noise is proposed. The noisy speech frames are classified into speech-dominated frames and noise-dominated frames. In the speech-dominated frames, the: signal KLT matrix is used and in the noise dominated frames, the noise KLT matrix is used. The approach does not require noise whitening and hence works well even with narrowband noise, A two-dimensional objective measure which captures both the speech distortion and the noise shaping characteristics of the algorithm is proposed. This measure indicates that the proposed method performs better noise shaping than a modified form of the signal subspace approach proposed by Ephraim and Van Trees and the standard spectral subtraction method. Informal listening tests show that the proposed algorithm does not suffer from the problem of residual musical noise and performs better noise masking than the signal subspace approach.
引用
收藏
页码:159 / 167
页数:9
相关论文
共 13 条
[1]   SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION [J].
BOLL, SF .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02) :113-120
[2]  
EPHRAIM Y, 1995, INT CONF ACOUST SPEE, P804, DOI 10.1109/ICASSP.1995.479816
[3]   A BAYESIAN-ESTIMATION APPROACH FOR SPEECH ENHANCEMENT USING HIDDEN MARKOV-MODELS [J].
EPHRAIM, Y .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1992, 40 (04) :725-735
[4]   A SIGNAL SUBSPACE APPROACH FOR SPEECH ENHANCEMENT [J].
EPHRAIM, Y ;
VANTREES, HL .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (04) :251-266
[5]  
Golub G.H., 2013, MATRIX COMPUTATIONS
[6]   CONSTRAINED ITERATIVE SPEECH ENHANCEMENT WITH APPLICATION TO SPEECH RECOGNITION [J].
HANSEN, JHL ;
CLEMENTS, MA .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1991, 39 (04) :795-805
[7]   REDUCTION OF BROAD-BAND NOISE IN SPEECH BY TRUNCATED QSVD [J].
JENSEN, SH ;
HANSEN, PC ;
HANSEN, SD ;
SORENSEN, JA .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (06) :439-448
[8]   SUBSPACE METHODS FOR THE BLIND IDENTIFICATION OF MULTICHANNEL FIR FILTERS [J].
MOULINES, E ;
DUHAMEL, P ;
CARDOSO, JF ;
MAYRARGUE, S .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1995, 43 (02) :516-525
[9]   RETRIEVAL OF HARMONICS FROM A COVARIANCE FUNCTION [J].
PISARENKO, VF .
GEOPHYSICAL JOURNAL OF THE ROYAL ASTRONOMICAL SOCIETY, 1973, 33 (03) :347-366
[10]  
Poor H. V., 1988, An Introduction to Signal Detection Estimation