Low bit-rate speech coders for multimedia communication

被引:46
作者
Cox, RV [1 ]
Kroon, P [1 ]
机构
[1] AT&T BELL LABS,LUCENT TECHNOL,MURRAY HILL,NJ 07974
关键词
D O I
10.1109/35.556484
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The International Telecommunications Union (ITU) has recently standardized three speech coders which are applicable to low-bit-rate multimedia communications. ITU Rec. G.729 8 kb/s CS-ACELP has a 15 ms algorithmic codec delay and provides network-quality speech. It was originally designed for wireless applications, but is applicable to multimedia communications as well. Annex A of Rec. G.729 is a reduced-complexity version of the CS-ACELP coder. It was designed explicitly for simultaneous voice and data applications that are prevalent in low-bit-rate multimedia communications. These two coders use the same bitstream format and can interoperate. The ITU Rec. G.723.1 6.3 and 5.3 kb/s speech coder for multimedia communications was designed originally for low-bit-rate videophones. Its frame size of 30 ms and one-way algorithmic codec delay of 37.5 ms allow for a further reduction in bit rate compared to the G.729 coder. In applications where low delay is important, the delay of G.723.1 may be too large. However, if the delay is acceptable, G.723.1 provides a lower-complexity alternative to G.729 at the expense of a slight degradation in quality. This article describes the attributes of speech coders such as bit rate, complexity, delay, and quality. Then it discusses the basic concepts of the three new ITU coders by comparing their specific attributes. The second part of this article describes the standardization process for each of these coders.
引用
收藏
页码:34 / 41
页数:8
相关论文
共 8 条
[1]  
Adoul J., 1987, Proceedings: ICASSP 87. 1987 International Conference on Acoustics, Speech, and Signal Processing (Cat. No.87CH2396-0), P1957
[2]  
Atal B. S., 1982, Proceedings of ICASSP 82. IEEE International Conference on Acoustics, Speech and Signal Processing, P614
[3]   PREDICTIVE CODING OF SPEECH SIGNALS AND SUBJECTIVE ERROR CRITERIA [J].
ATAL, BS ;
SCHROEDER, MR .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (03) :247-254
[4]   ADVANCES IN SPEECH AND AUDIO COMPRESSION [J].
GERSHO, A .
PROCEEDINGS OF THE IEEE, 1994, 82 (06) :900-918
[5]  
Kleijn W. B., 1995, Speech Coding and Synthesis
[6]  
Rabiner LR., 1978, DIGITAL PROCESSING S
[7]  
Schroeder M. R., 1985, P IEEE INT C AC SPEE, V10, P937, DOI DOI 10.1109/ICASSP.1985.1168147
[8]   SPEECH CODING - A TUTORIAL REVIEW [J].
SPANIAS, AS .
PROCEEDINGS OF THE IEEE, 1994, 82 (10) :1541-1582