Performance evaluation and comparison of G.729/AMR/fuzzy voice activity detectors

被引:46
作者
Beritelli, F [1 ]
Casale, S [1 ]
Ruggeri, G [1 ]
Serrano, S [1 ]
机构
[1] Univ Catania, Dept Informat & Telecommun Engn, I-95125 Catania, Italy
关键词
discontinuous transmission; speech quality evaluation; voice activity detector;
D O I
10.1109/97.995824
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The paper proposes a performance evaluation and comparison of G.729, AMR, and fuzzy voice activity detection (FVAD) algorithms. The comparison was made using objective, psychoacoustic, and subjective parameters. A highly varied speech database was also set up to evaluate the extent to which VADs depend on language, the signal-to-noise ratio (SNR), or the power level.
引用
收藏
页码:85 / 88
页数:4
相关论文
共 9 条
[1]  
[Anonymous], 1996, P800 ITUT
[2]  
[Anonymous], 1999, 0694 GSM
[3]   ITU-T recommendation G.729 Annex B: A silence compression scheme for use with G.729 optimized for V.70 digital simultaneous voice and data applications [J].
Benyassine, A ;
Shlomot, E ;
Su, HY ;
Massaloux, D ;
Lamblin, C ;
Petit, JP .
IEEE COMMUNICATIONS MAGAZINE, 1997, 35 (09) :64-73
[4]   A robust voice activity detector for wireless communications using soft computing [J].
Beritelli, F ;
Casale, S ;
Cavallaro, A .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 1998, 16 (09) :1818-1829
[5]   A psychoacoustic auditory model to evaluate the performance of a voice activity detector [J].
Beritelli, F ;
Casale, S ;
Ruggeri, G .
SIGNAL PROCESSING, 2000, 80 (07) :1393-1397
[6]  
BERITELLI F, 1998, IEEE INT C TEL ICT 9, V1, P223
[7]  
BERITELLI F, IN PRESS INT J SPEEC
[8]  
CAVALLARO A, 1999, THESIS U PALERMO PAL
[9]   Low bit-rate speech coders for multimedia communication [J].
Cox, RV ;
Kroon, P .
IEEE COMMUNICATIONS MAGAZINE, 1996, 34 (12) :34-41