OBJECTIVE SPEECH QUALITY ASSESSMENT AND THE RPE-LTP CODING ALGORITHM IN DIFFERENT NOISE AND LANGUAGE CONDITIONS

被引:8
作者
HANSEN, JHL
NANDKUMAR, S
机构
[1] Robust Speech Processing Laboratory, Department of Electrical Engineering, Duke University, Durham, North Carolina 27708-0291
关键词
D O I
10.1121/1.412283
中图分类号
O42 [声学];
学科分类号
070206 [声学]; 082403 [水声工程];
摘要
The formulation of reliable signal processing algorithms for speech coding and synthesis require the selection of a prior criterion of performance. Though coding efficiency (bits/second) or computational requirements can be used, a final performance measure must always include speech quality. In this paper, three objective speech quality measures are considered with respect to quality assessment for American English, noisy American English, and noise-free versions of seven languages. The purpose is to determine whether objective quality measures can be used to quantify changes in quality for a given voice coding method, with a known subjective performance level, as background noise or language conditions are changed. The speech coding algorithm chosen is regular-pulse excitation with long-term prediction (RPE-LTP), which has been chosen as the standard voice compression algorithm for the European Digital Mobile Radio system. Three areas are considered for objective quality assessment which include: (i) vocoder performance for American English in a noise-free environment, (ii) speech quality variation for three additive background noise sources, and (iii) noise-free performance for seven languages which include English, Japanese, Finnish, German, Hindi, Spanish, and French. It is suggested that although existing objective quality measures will never replace subjective testing, they can be a useful means of assessing changes in performance, identifying areas for improvement in algorithm design, and augmenting subjective quality tests for voice coding/compression algorithms in noise-free, noisy, and/or non-English applications. © 1995, Acoustical Society of America. All rights reserved.
引用
收藏
页码:609 / 627
页数:19
相关论文
共 52 条
[1]
Atal B. S., 1982, Proceedings of ICASSP 82. IEEE International Conference on Acoustics, Speech and Signal Processing, P614
[2]
Atal B. S., 1991, ADV SPEECH CODING
[3]
BARNWELL TP, 1980, 1980 P IEEE INT C AC, P706
[4]
CAMPBELL JP, 1991, ADV SPEECH CODING, P122
[5]
CAMPBELL JP, 1989, 1989 P IEEE INT C AC, P735
[6]
A LOW-DELAY CELP CODER FOR THE CCITT 16 KB S SPEECH CODING STANDARD [J].
CHEN, JH ;
COX, RV ;
LIN, YC ;
JAYANT, N ;
MELCHNER, MJ .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 1992, 10 (05) :830-849
[7]
CHEN JH, 1987, 1987 P INT C COMM SE, P756
[8]
A FREQUENCY WEIGHTED ITAKURA-SAITO SPECTRAL DISTANCE MEASURE [J].
CHU, PL ;
MESSERSCHMITT, DG .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1982, 30 (04) :545-560
[9]
COETZEE HJ, 1989, 1989 P IEEE A INT C, P596
[10]
CROCHIERE RE, 1980, IEEE T ACOUST SPEECH, V28, P367