Single-Channel Source Separation Using EMD-Subband Variable Regularized Sparse Features

被引:95
作者
Gao, Bin [1 ]
Woo, W. L. [1 ]
Dlay, S. S. [1 ]
机构
[1] Newcastle Univ, Sch Elect Elect & Comp Engn, Newcastle Upon Tyne NE1 7RU, Tyne & Wear, England
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2011年 / 19卷 / 04期
关键词
Audio processing; blind source separation (BSS); empirical mode decomposition (EMD); non-negative matrix factorization (NMF); single-channel source separation (SCSS); sparse features; BLIND SOURCE SEPARATION; NONNEGATIVE MATRIX FACTORIZATION; SIGNAL SEPARATION; MONAURAL SPEECH;
D O I
10.1109/TASL.2010.2072500
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A novel approach to solve the single-channel source separation (SCSS) problem is presented. Most existing supervised SCSS methods resort exclusively to the independence waveform criteria as exemplified by training the prior information before the separation process. This poses a significant limiting factor to the applicability of these methods to real problem. Our proposed method does not require training knowledge for separating the mixture and it is based on decomposing the mixture into a series of oscillatory components termed as the intrinsic mode functions (IMFs). We show, in this paper, that the IMFs have several desirable properties unique to SCSS problem and how these properties can be advantaged to relax the constraints posed by the problem. In addition, we have derived a novel sparse non-negative matrix factorization to estimate the spectral bases and temporal codes of the sources. The proposed algorithm is a more complete and efficient approach to matrix factorization where a generalized criterion for variable sparseness is imposed onto the solution. Experimental testing has been conducted to show that the proposed method gives superior performance over other existing approaches.
引用
收藏
页码:961 / 976
页数:16
相关论文
共 44 条
[41]   Neural network approach to blind signal separation of mono-nonlinearly mixed sources [J].
Woo, WL ;
Dlay, SS .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2005, 52 (06) :1236-1247
[42]   A study of the characteristics of white noise using the empirical mode decomposition method [J].
Wu, ZH ;
Huang, NE .
PROCEEDINGS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2004, 460 (2046) :1597-1611
[43]   Blind separation of speech mixtures via time-frequency masking [J].
Yilmaz, Ö ;
Rickard, S .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2004, 52 (07) :1830-1847
[44]   Blind source separation of postnonlinear convolutive mixture [J].
Zhang, Jingyi ;
Woo, W. L. ;
Dlay, S. S. .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (08) :2311-2330