Blind Spatial Subtraction Array for Speech Enhancement in Noisy Environment

被引:85
作者
Takahashi, Yu [1 ]
Takatani, Tomoya [1 ]
Osako, Keiichi [1 ]
Saruwatari, Hiroshi [1 ]
Shikano, Kiyohiro [1 ]
机构
[1] Nara Inst Sci & Technol, Grad Sch Informat Sci, Nara 6300192, Japan
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2009年 / 17卷 / 04期
关键词
Blind source separation (BSS); independent component analysis (ICA); microphone array; speech enhancement; INDEPENDENT COMPONENT ANALYSIS; SEPARATION; ALGORITHM;
D O I
10.1109/TASL.2008.2011517
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We propose a new blind spatial subtraction array (BSSA) consisting of a noise estimator based on independent component analysis (ICA) for efficient speech enhancement. In this paper, first, we theoretically and experimentally point out that ICA is proficient in noise estimation under a non-point-source noise condition rather than in speech estimation. Therefore, we propose BSSA that utilizes ICA as a noise estimator. In BSSA, speech extraction is achieved by subtracting the power spectrum of noise signals estimated using ICA from the power spectrum of the partly enhanced target speech signal with a delay-and-sum beamformer. This "power-spectrum-domain subtraction" procedure enables better noise reduction than the conventional ICA with estimation-error robustness. Another benefit of BSSA architecture is "permutation robustness." Although the ICA part in BSSA suffers from a source permutation problem, the BSSA architecture can reduce the negative affection when permutation arises. The results of various speech enhancement test reveal that the noise reduction and speech recognition performance of the proposed BSSA are superior to those of conventional methods.
引用
收藏
页码:650 / 664
页数:15
相关论文
共 31 条
[1]  
[Anonymous], 1998, Independent Component Analysis: Theory and Applications
[2]   Equivalence between frequency-domain blind source separation and frequency-domain adaptive beamforming for convolutive mixtures [J].
Araki, S ;
Makino, S ;
Hinamoto, Y ;
Mukai, R ;
Nishikawa, T ;
Saruwatari, H .
EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2003, 2003 (11) :1157-1166
[3]   SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION [J].
BOLL, SF .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02) :113-120
[4]  
Brandstein M., 2001, MICROPHONE ARRAYS SI
[5]  
Cardoso J.F., 1989, P IEEE INT C ACOUSTI, P2109
[6]   INDEPENDENT COMPONENT ANALYSIS, A NEW CONCEPT [J].
COMON, P .
SIGNAL PROCESSING, 1994, 36 (03) :287-314
[7]   COMPARISON OF PARAMETRIC REPRESENTATIONS FOR MONOSYLLABIC WORD RECOGNITION IN CONTINUOUSLY SPOKEN SENTENCES [J].
DAVIS, SB ;
MERMELSTEIN, P .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (04) :357-366
[8]   Microphone array systems for hands-free telecommunication [J].
Elko, GW .
SPEECH COMMUNICATION, 1996, 20 (3-4) :229-240
[9]   COMPUTER-STEERED MICROPHONE ARRAYS FOR SOUND TRANSDUCTION IN LARGE ROOMS [J].
FLANAGAN, JL ;
JOHNSTON, JD ;
ZAHN, R ;
ELKO, GW .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1985, 78 (05) :1508-1518
[10]   ALGORITHM FOR LINEARLY CONSTRAINED ADAPTIVE ARRAY PROCESSING [J].
FROST, OL .
PROCEEDINGS OF THE INSTITUTE OF ELECTRICAL AND ELECTRONICS ENGINEERS, 1972, 60 (08) :926-&