On the importance of the Pearson correlation coefficient in noise reduction

被引:275
作者
Benesty, Jacob
Chen, Jingdong [1 ]
Huang, Yiteng [2 ]
机构
[1] Bell Labs, Murray Hill, NJ 07974 USA
[2] WeVoice Inc, Bridgewater, NJ 08807 USA
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2008年 / 16卷 / 04期
关键词
mean-square error (MSE); noise reduction; Pearson correlation coefficient; speech distortion; speech enhancement; Wiener filter;
D O I
10.1109/TASL.2008.919072
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Noise reduction, which aims at estimating a clean speech from noisy observations, has attracted a considerable amount of research and engineering attention over the past few decades. In the single-channel scenario, an estimate of the clean speech can be obtained by passing the noisy signal picked up by the microphone through a linear filter/transformation. The core issue, then, is how to find an optimal filter/transformation such that, after the filtering process, the signal-to-noise ratio (SNR) is improved but the desired speech signal is not noticeably distorted. Most of the existing optimal filters (such as the Wiener filter and subspace transformation) are formulated from the mean-square error (MSE) criterion. However, with the MSE formulation, many desired properties of the optimal noise-reduction filters such as the SNR behavior cannot be seen. In this paper, we present a new criterion based on the Pearson correlation coefficient (PCC). We show that in the context of noise reduction the squared PCC (SPCC) has many appealing properties and. can be used as an optimization cost function to derive many optimal and suboptimal noise-reduction filters. The clear advantage of using the SPCC over the MSE is that the noise-reduction performance (in terms of the SNR improvement and speech distortion) of the resulting optimal filters can be easily analyzed. This shows that, as far as noise reduction is concerned, the SPCC-based cost function serves as a more natural criterion to optimize as compared to the MSE.
引用
收藏
页码:757 / 765
页数:9
相关论文
共 32 条
[1]  
ABUTALED AS, 1998, IEEE T CIRCUITS SYST, V35, P1201
[2]  
[Anonymous], 1896, Phil Trans R Soc A, DOI 10.1098/rsta.1896.0007
[3]  
[Anonymous], AUDIO SIGNAL PROCESS
[4]  
[Anonymous], MICROPHONE ARRAYS SI
[5]  
Benesty J, 2005, SPEECH ENHANCEMENT
[6]   SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION [J].
BOLL, SF .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02) :113-120
[7]  
CARTER GC, 1988, SIGNAL PROCESSING HD
[8]   New insights into the noise reduction Wiener filter [J].
Chen, Jingdong ;
Benesty, Jacob ;
Huang, Yiteng ;
Doclo, Simon .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (04) :1218-1234
[9]   SPEECH ENHANCEMENT FROM NOISE - A REGENERATIVE APPROACH [J].
DENDRINOS, M ;
BAKAMIDIS, S ;
CARAYANNIS, G .
SPEECH COMMUNICATION, 1991, 10 (01) :45-57
[10]   On the output SNR of the speech-distortion weighted multichannel Wiener filter [J].
Doclo, S ;
Moonen, M .
IEEE SIGNAL PROCESSING LETTERS, 2005, 12 (12) :809-811