An Improved permutation solver for blind signal separation based front-ends in robot audition

被引:9
作者
Even, Jani [1 ]
Saruwatari, Hiroshi [1 ]
Shikano, Kiyohiro [1 ]
机构
[1] Nara Inst Sci & Technol Ikoma, Grad Sch Informat Sci, Nara, Japan
来源
2008 IEEE/RSJ INTERNATIONAL CONFERENCE ON ROBOTS AND INTELLIGENT SYSTEMS, VOLS 1-3, CONFERENCE PROCEEDINGS | 2008年
关键词
D O I
10.1109/IROS.2008.4650602
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The model of the human/machine hands-free speech interface is defined as a point source (the user voice) and a diffuse background noise. This situation is very different from the usual cocktail party model, separation of a mixture of speeches, that is usually treated in frequency domain blind signal separation (FD-BSS). In particular, the fast permutation solvers proposed for the cocktail party model results in poor separation performance in this case. In order to resolve the permutation more efficiently, this paper proposes a new approach that exploits the statistical discrepancy between the target speech and the diffuse background noise.
引用
收藏
页码:2172 / 2177
页数:6
相关论文
共 13 条
[1]   AN INFORMATION MAXIMIZATION APPROACH TO BLIND SEPARATION AND BLIND DECONVOLUTION [J].
BELL, AJ ;
SEJNOWSKI, TJ .
NEURAL COMPUTATION, 1995, 7 (06) :1129-1159
[2]  
CAPDEVIELLE V, 1995, INT CONF ACOUST SPEE, P2080, DOI 10.1109/ICASSP.1995.478484
[3]   INDEPENDENT COMPONENT ANALYSIS, A NEW CONCEPT [J].
COMON, P .
SIGNAL PROCESSING, 1994, 36 (03) :287-314
[4]  
Ito K., 1999, J ACOUST SOC JPN, V20, P196
[5]  
Kim T, 2006, LECT NOTES COMPUT SC, V3889, P165
[6]  
MEJUTO C, 2000, P INT WORKSH ICA SIG, P315
[7]   Convolutive blind separation of non-stationary sources [J].
Parra, L ;
Spence, C .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (03) :320-327
[8]  
Pedersen M. S., 2007, Springer handbook on Speech Processing and Speech Communication
[9]   Blind source separation combining independent component analysis and beamforming [J].
Saruwatari, H ;
Kurita, S ;
Takeda, K ;
Itakura, F ;
Nishikawa, T ;
Shikano, K .
EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2003, 2003 (11) :1135-1146
[10]   A robust and precise method for solving the permutation problem of frequency-domain blind source separation [J].
Sawada, H ;
Mukai, R ;
Araki, S ;
Makino, S .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (05) :530-538