Robust voice activity detection using perceptual wavelet-packet transform and Teager energy operator

被引:23
作者
Chen, Shi-Huang [1 ]
Wu, Hsin-Te
Chang, Yukon
Truong, T. K.
机构
[1] Shu Te Univ, Dept Comp Sci & Informat Engn, Kaohsiung 824, Taiwan
[2] I Shou Univ, Dept Informat Engn, Kaohsiung 840, Taiwan
关键词
voice activity detection (VAD); perceptual wavelet-packet transform (PWPT); Teager energy operator (TEO);
D O I
10.1016/j.patrec.2006.11.023
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this letter, a robust voice activity detection (VAD) algorithm is presented. This proposed VAD algorithm makes use of the perceptual wavelet-packet transform and the Teager energy operator to compute a robust parameter called voice activity shape for VAD. The main advantage of this algorithm is that the preset threshold values or a priori knowledge of the SNR usually needed in conventional VAD methods can be completely avoided. Various experimental results show that the proposed VAD algorithm is capable of outperforming the VAD of Adaptive Multi Rate (AMR) speech codec in both additive noisy and real noisy environments. (c) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:1327 / 1332
页数:6
相关论文
共 17 条
[1]  
Addison P.S., 2002, ILLUSTRATED WAVELET, V1st ed.
[2]  
[Anonymous], 1993, Ten Lectures of Wavelets
[3]   Wavelet speech enhancement based on the Teager Energy operator [J].
Bahoura, M ;
Rouat, J .
IEEE SIGNAL PROCESSING LETTERS, 2001, 8 (01) :10-12
[4]  
Burrus C.S., 1998, introduction to Wavelets and Wavelet Transforms-A Primer
[5]   Perceptual speech coding and enhancement using frame-synchronized fast wavelet packet transform algorithms [J].
Carnero, B ;
Drygajlo, A .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1999, 47 (06) :1622-1635
[6]   SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR SHORT-TIME SPECTRAL AMPLITUDE ESTIMATOR [J].
EPHRAIM, Y ;
MALAH, D .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (06) :1109-1121
[7]  
*ETSI, 301708 ETSI EN
[8]  
*ITU, 1996, G729 ITU T REC
[9]   Teager energy based feature parameters for speech recognition in car noise [J].
Jabloun, F ;
Çetin, AE ;
Erzin, E .
IEEE SIGNAL PROCESSING LETTERS, 1999, 6 (10) :259-261
[10]   Wavelet threshold estimators for data with correlated noise [J].
Johnstone, IM ;
Silverman, BW .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1997, 59 (02) :319-351