Binaural Noise Reduction in the Time Domain With a Stereo Setup

被引:15
作者
Benesty, Jacob [1 ]
Chen, Jingdong [2 ]
Huang, Yiteng [3 ]
机构
[1] Univ Quebec, INRS EMT, Montreal, PQ H5A 1K6, Canada
[2] Northwestern Polytech Univ, Xian 710072, Shaanxi, Peoples R China
[3] WeVoice Inc, Bridgewater, NJ 08807 USA
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2011年 / 19卷 / 08期
关键词
Binaural noise reduction; maximum signal-to-noise ratio (SNR) filter; minimum variance distortionless response (MVDR) filter; noncircularity; speech enhancement; stereo sound system; time domain; tradeoff filter; widely linear estimation; Wiener filter; ARRAY HEARING-AIDS; COMPLEX-VARIABLES; WIENER FILTER; SPEECH; ENHANCEMENT; SUPPRESSION; MICROPHONES; STATISTICS; SIGNALS; OUTPUT;
D O I
10.1109/TASL.2011.2119313
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Binaural noise reduction with a stereophonic (or simply stereo) setup has become a very important problem as stereo sound systems and devices are being more and more deployed in modern voice communications. This problem is very challenging since it requires not only the reduction of the noise at the stereo inputs, but also the preservation of the spatial information embodied in the two channels so that after noise reduction the listener can still localize the sound source from the binaural outputs. As a result, simply applying a traditional single-channel noise reduction technique to each channel individually may not work as the spatial effects may be destroyed. In this paper, we present a new formulation of the binaural noise reduction problem in stereo systems. We first form a complex signal from the stereo inputs with one channel being its real part and the other being its imaginary part. By doing so, the binaural noise reduction problem can be processed by a single-channel widely linear filter. The widely linear estimation theory is then used to derive optimal noise reduction filters that can fully take advantage of the noncircularity of the complex speech signal to achieve noise reduction while preserving the desired signal (speech) and spatial information. With this new formulation, the Wiener, minimum variance distortionless response (MVDR), maximum signal-to-noise ratio (SNR), and tradeoff filters are derived. Experiments are provided to justify the effectiveness of these filters.
引用
收藏
页码:2260 / 2272
页数:13
相关论文
共 36 条
[1]   Statistics for complex variables and signals .1. Variables [J].
Amblard, PO ;
Gaeta, M ;
Lacoume, JL .
SIGNAL PROCESSING, 1996, 53 (01) :1-13
[2]   Statistics for complex variables and signals .2. Signals [J].
Amblard, PO ;
Gaeta, M ;
Lacoume, JL .
SIGNAL PROCESSING, 1996, 53 (01) :15-25
[3]  
[Anonymous], 2001, MICROPHONE ARRAYS SI
[4]  
Benesty J, 2005, SIG COM TEC, P9, DOI 10.1007/3-540-27489-8_2
[5]  
Benesty J, 2009, SPRINGER TOP SIGN PR, V2, P1, DOI 10.1007/978-3-642-00296-0_1
[6]  
Benesty J, 2008, SPRINGER TOP SIGN PR, V1, P1
[7]   SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION [J].
BOLL, SF .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02) :113-120
[8]   HIGH-RESOLUTION FREQUENCY-WAVENUMBER SPECTRUM ANALYSIS [J].
CAPON, J .
PROCEEDINGS OF THE IEEE, 1969, 57 (08) :1408-&
[9]  
CHEN J, 2007, SPRINGER HDB SPEECH
[10]   A minimum distortion noise reduction algorithm with multiple microphones [J].
Chen, Jingdong ;
Benesty, Jacob ;
Huang, Yiteng .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (03) :481-493