A generalization of blind source separation algorithms for convolutive mixtures based on second-order statistics

被引:151
作者
Buchner, H [1 ]
Aichner, R [1 ]
Kellermann, W [1 ]
机构
[1] Univ Erlangen Nurnberg, D-91058 Erlangen, Germany
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 2005年 / 13卷 / 01期
关键词
adaptive filtering; blind source separation (BSS); convolutive mixtures; multiple-input-multiple-output (MIMO) systems; second-order statistics;
D O I
10.1109/TSA.2004.838775
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we present a general broadband approach to blind source separation (BSS) for convolutive mixtures based on second-order statistics. This avoids several known limitations of the conventional narrowband approximation, such as the internal permutation problem. In contrast to traditional narrowband approaches, the new framework simultaneously exploits the nonwhiteness property and nonstationarity property of the source signals. Using a novel matrix formulation, we rigorously derive the corresponding time-domain and frequency-domain broadband algorithms by generalizing a known cost-function which inherently allows joint optimization for several time-lags of the correlations. Based on the broadband approach time-domain, constraints are obtained which provide a deeper understanding of the internal permutation problem in traditional narrowband frequency-domain BSS. For both the time-domain and the frequency-domain versions, we discuss links to well-known, and also, to novel algorithms that constitute special cases. Moreover, using the so-called generalized coherence, links between the time-domain and the frequency-domain algorithms can be established, showing that our cost function leads to an update equation with an inherent normalization ensuring a robust adaptation behaviour. The concept is applicable to offline, online, and block-online algorithms by introducing a general weighting function allowing for tracking of time-varying real acoustic environments.
引用
收藏
页码:120 / 134
页数:15
相关论文
共 43 条
[1]  
Aichner R, 2002, NEURAL NETWORKS FOR SIGNAL PROCESSING XII, PROCEEDINGS, P445, DOI 10.1109/NNSP.2002.1030056
[2]  
AICHNER R, 2004, P JOINT M GERM FRENC, P583
[3]  
AICHNER R, 2004, P INT S ICA GRAN SPA, P833
[4]  
AICHNER R, 2003, P 4 INT S IND COMP A, P987
[5]  
Amari S, 1996, ADV NEUR IN, V8, P757
[6]  
ANEMULLER J, 2001, THESIS OLDENBURG U I
[7]  
[Anonymous], 2002, Adaptive Blind Signal and Image Processing: Learning Algorithms and Applications
[8]  
[Anonymous], J LOND MATH SOC
[9]   The fundamental limitation of frequency domain blind source separation for convolutive mixtures of speech [J].
Araki, S ;
Mukai, R ;
Makino, S ;
Nishikawa, T ;
Saruwatari, H .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (02) :109-116
[10]  
Buchner H, 2003, SIG COM TEC, P95