A psychoacoustic auditory model to evaluate the performance of a voice activity detector

被引:2
作者
Beritelli, F [1 ]
Casale, S [1 ]
Ruggeri, G [1 ]
机构
[1] Univ Catania, Fac Engn, Dipartimento Ingn Informat & Telecomunicazioni, I-92125 Catania, Italy
关键词
D O I
10.1016/S0165-1684(00)00111-0
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A new psychoacoustic auditory model to evaluate the subjective performance of a voice activity detector (VAD) is presented in this letter. The mathematical model proposed makes it possible to pass from the power spectral density of the speech signal processed by a VAD to analysis of the subjective loudness density and thus to subjective measures expressed in terms of comparison mean opinion scores (CMOS). In case studies, the correlation between the measured and predicted CMOS values always remained above 0.9, using traditional analytical methods such as regression curves. (C) 2000 Published by Elsevier Science B.V. All rights reserved.
引用
收藏
页码:1393 / 1397
页数:5
相关论文
共 7 条
[1]   ITU-T recommendation G.729 Annex B: A silence compression scheme for use with G.729 optimized for V.70 digital simultaneous voice and data applications [J].
Benyassine, A ;
Shlomot, E ;
Su, HY ;
Massaloux, D ;
Lamblin, C ;
Petit, JP .
IEEE COMMUNICATIONS MAGAZINE, 1997, 35 (09) :64-73
[2]   A robust voice activity detector for wireless communications using soft computing [J].
Beritelli, F ;
Casale, S ;
Cavallaro, A .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 1998, 16 (09) :1818-1829
[3]  
BERITELLI F, 1998, 1916 ITUT
[4]   SUBJECTIVE EFFECTS OF VARIABLE DELAY AND SPEECH CLIPPING IN DYNAMICALLY MANAGED VOICE SYSTEMS [J].
GRUBER, JG ;
STRAWCZYNSKI, L .
IEEE TRANSACTIONS ON COMMUNICATIONS, 1985, 33 (08) :801-808
[5]  
KROON P, 1996, IEEE COMM MAG, V34, P34
[6]   Objective estimation of perceived speech quality - Part I: Development of the measuring normalizing block technique [J].
Voran, S .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (04) :371-382
[7]   AN OBJECTIVE-MEASURE FOR PREDICTING SUBJECTIVE QUALITY OF SPEECH CODERS [J].
WANG, SH ;
SEKEY, A ;
GERSHO, A .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 1992, 10 (05) :819-829