Combined approach of array processing and independent component analysis for blind separation of acoustic signals

被引:71
作者
Asano, F [1 ]
Ikeda, S
Ogawa, M
Asoh, H
Kitawaki, N
机构
[1] Natl Inst Adv Ind Sci & Technol, Tsukuba, Ibaraki, Japan
[2] Kyushu Inst Technol, Kitakyushu, Fukuoka, Japan
[3] Univ Tsukuba, Tsukuba, Ibaraki 305, Japan
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 2003年 / 11卷 / 03期
关键词
array signal processing; blind signal separation; independent component analysis; permutation; room reflection;
D O I
10.1109/TSA.2003.809191
中图分类号
O42 [声学];
学科分类号
070206 [声学]; 082403 [水声工程];
摘要
In this paper, two array signal processing techniques are combined with independent component analysis (ICA) to enhance the performance of blind separation of acoustic signals in a reflective environment. The first technique is the subspace method which reduces the effect of room reflection, when the system is used in a room. Room reflection is one of the biggest problems in blind source separation (BSS) in acoustic environments. The second technique is a method of solving permutation. For employing the subspace method, ICA must be used in the frequency domain, and precise permutation is necessary for all frequencies. In this method, a physical property of the mixing matrix, i.e., the coherency in adjacent frequencies, is utilized to solve the permutation. The experiments in a meeting room showed that the subspace method improved the rate of automatic speech recognition from 50% to 68% and that the method of solving permutation achieves performance that closely approaches that of the correct permutation, differing by only 4% in recognition rate.
引用
收藏
页码:204 / 215
页数:12
相关论文
共 22 条
[1]
Stability analysis of learning algorithms for blind source separation [J].
Amari, S ;
Chen, TP ;
Cichocki, A .
NEURAL NETWORKS, 1997, 10 (08) :1345-1351
[2]
Speech enhancement based on the subspace method [J].
Asano, F ;
Hayamizu, S ;
Yamada, T ;
Nakamura, S .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (05) :497-507
[3]
Asano F., 2000, Second International Workshop on Independent Component Analysis and Blind Signal Separation. Proceedings, P411
[4]
Asano F., 2000, P ICA2000 JUN, P57
[5]
AN INFORMATION MAXIMIZATION APPROACH TO BLIND SEPARATION AND BLIND DECONVOLUTION [J].
BELL, AJ ;
SEJNOWSKI, TJ .
NEURAL COMPUTATION, 1995, 7 (06) :1129-1159
[6]
A blind source separation technique using second-order statistics [J].
Belouchrani, A ;
AbedMeraim, K ;
Cardoso, JF ;
Moulines, E .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1997, 45 (02) :434-444
[7]
RESOLVING THE DIRECTIONS OF SOURCES IN A CORRELATED FIELD INCIDENT ON AN ARRAY [J].
CANTONI, A ;
GODARA, LC .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1980, 67 (04) :1247-1255
[8]
A robust adaptive beamformer for microphone arrays with a blocking matrix using constrained adaptive filters [J].
Hoshuyama, O ;
Sugiyama, A ;
Hirano, A .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1999, 47 (10) :2677-2684
[9]
Hung H., 1985, IEEE T ACOUST SPEECH, V33, P823
[10]
FOCUSING MATRICES FOR COHERENT SIGNAL-SUBSPACE PROCESSING [J].
HUNG, HS ;
KAVEH, M .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1988, 36 (08) :1272-1281