On the importance of the Pearson correlation coefficient in noise reduction

被引：275

作者：

Benesty, Jacob

Chen, Jingdong ^{[1
]}

Huang, Yiteng ^{[2
]}

机构：

[1] Bell Labs, Murray Hill, NJ 07974 USA

[2] WeVoice Inc, Bridgewater, NJ 08807 USA

来源：

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2008年 / 16卷 / 04期

关键词：

mean-square error (MSE); noise reduction; Pearson correlation coefficient; speech distortion; speech enhancement; Wiener filter;

D O I：

10.1109/TASL.2008.919072

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Noise reduction, which aims at estimating a clean speech from noisy observations, has attracted a considerable amount of research and engineering attention over the past few decades. In the single-channel scenario, an estimate of the clean speech can be obtained by passing the noisy signal picked up by the microphone through a linear filter/transformation. The core issue, then, is how to find an optimal filter/transformation such that, after the filtering process, the signal-to-noise ratio (SNR) is improved but the desired speech signal is not noticeably distorted. Most of the existing optimal filters (such as the Wiener filter and subspace transformation) are formulated from the mean-square error (MSE) criterion. However, with the MSE formulation, many desired properties of the optimal noise-reduction filters such as the SNR behavior cannot be seen. In this paper, we present a new criterion based on the Pearson correlation coefficient (PCC). We show that in the context of noise reduction the squared PCC (SPCC) has many appealing properties and. can be used as an optimization cost function to derive many optimal and suboptimal noise-reduction filters. The clear advantage of using the SPCC over the MSE is that the noise-reduction performance (in terms of the SNR improvement and speech distortion) of the resulting optimal filters can be easily analyzed. This shows that, as far as noise reduction is concerned, the SPCC-based cost function serves as a more natural criterion to optimize as compared to the MSE.

引用

页码：757 / 765

页数：9

共 32 条

[1]

ABUTALED AS, 1998, IEEE T CIRCUITS SYST, V35, P1201

[2]

[Anonymous], 1896, Phil Trans R Soc A, DOI 10.1098/rsta.1896.0007

[3]

[Anonymous], AUDIO SIGNAL PROCESS

[4]

[Anonymous], MICROPHONE ARRAYS SI

[5]

Benesty J, 2005, SPEECH ENHANCEMENT

[6] SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION [J].

BOLL, SF .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02) :113-120

[7]

CARTER GC, 1988, SIGNAL PROCESSING HD

[8] New insights into the noise reduction Wiener filter [J].

Chen, Jingdong ;

Benesty, Jacob ;

Huang, Yiteng ;

Doclo, Simon .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (04) :1218-1234

[9] SPEECH ENHANCEMENT FROM NOISE - A REGENERATIVE APPROACH [J].

DENDRINOS, M ;

BAKAMIDIS, S ;

CARAYANNIS, G .

SPEECH COMMUNICATION, 1991, 10 (01) :45-57

[10] On the output SNR of the speech-distortion weighted multichannel Wiener filter [J].

Doclo, S ;

Moonen, M .

IEEE SIGNAL PROCESSING LETTERS, 2005, 12 (12) :809-811

← 1 2 3 4 →