The 2010 Signal Separation Evaluation Campaign (SiSEC2010): Audio Source Separation

被引:21
作者
Araki, Shoko
Ozerov, Alexey
Gowreesunker, Vikrham
Sawada, Hiroshi
Theis, Fabian
Nolte, Guido
Lutter, Dominik
Duong, Ngoc Q. K.
机构
来源
LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION | 2010年 / 6365卷
关键词
CONVOLUTIVE MIXTURES;
D O I
10.1007/978-3-642-15995-4_15
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper introduces the audio part of the 2010 community-based Signal Separation Evaluation Campaign (SiSEC2010). Seven speech and music datasets were contributed, which include datasets recorded in noisy or dynamic environments, in addition to the SiSEC2008 datasets. The source separation problems were split into five tasks, and the results for each task were evaluated using different objective performance criteria. We provide an overview of the audio datasets, tasks and criteria. We also report the results achieved with the submitted systems, and discuss organization strategies for future campaigns.
引用
收藏
页码:114 / 122
页数:9
相关论文
共 31 条
[1]  
Araki S., 2007, P IEEE INT C AC SPEE, P41
[2]  
Arberet S, 2010, P ISSPA
[3]  
Bonada J., 2006, P AES
[4]   Monaural speech separation and recognition challenge [J].
Cooke, Martin ;
Hershey, John R. ;
Rennie, Steven J. .
COMPUTER SPEECH AND LANGUAGE, 2010, 24 (01) :1-15
[5]  
Dang HTV, 2010, INT CONF ACOUST SPEE, P241, DOI 10.1109/ICASSP.2010.5495994
[6]   Under-Determined Reverberant Audio Source Separation Using Local Observed Covariance and Auditory-Motivated Time-Frequency Representation [J].
Duong, Ngoc Q. K. ;
Vincent, Emmanuel ;
Gribonval, Remi .
LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION, 2010, 6365 :73-80
[7]  
Emiya V., IEEE T AUDIO S UNPUB
[8]   SPEECH ENHANCEMENT IN PRESENCE OF DIFFUSE BACKGROUND NOISE: WHY USING BLIND SIGNAL EXTRACTION? [J].
Even, Jani ;
Saruwatari, Hiroshi ;
Shikano, Kiyorhiro ;
Takatani, Tomoya .
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, :4770-4773
[9]   PEMO-Q - A new method for objective: Audio quality assessment using a model of auditory perception [J].
Huber, Rainer ;
Kollmeier, Birger .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (06) :1902-1911
[10]   Time-Domain Blind Audio Source Separation Method Producing Separating Filters of Generalized Feedforward Structure [J].
Koldovsky, Zbynek ;
Tichavsky, Petr ;
Malek, Jiri .
LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION, 2010, 6365 :17-+