ITU-T recommendation G.729 Annex B: A silence compression scheme for use with G.729 optimized for V.70 digital simultaneous voice and data applications

被引:217
作者
Benyassine, A
Shlomot, E
Su, HY
Massaloux, D
Lamblin, C
Petit, JP
机构
[1] Rockwell International, Newport Beach, CA
[2] University of Texas, Austin, TX
[3] Technion, Israel Inst. of Technology
[4] Efrat Future Technology, Tel-Aviv
[5] University of California, Santa Barbara, CA
[6] Chinese Univ. of Sci. and Technology, Hefei
[7] IRISA, Rennes
[8] Bell Northern Research, Montreal, Que.
[9] Ecl. Natl. Sup. des Telecom., Paris
[10] Image Laboratory, ENST
[11] ENST, Paris
[12] CNET, Lannion
关键词
D O I
10.1109/35.620527
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
article describes the recently adopted Annex B to ITU-T Recommendation G.729. Annex B defines a low-bit-rate silence compression scheme designed and optimized to work in conjunction with both the full version of G.729 and its low-complexity Annex A. To achieve good quality low-bit-rate silence compression, a robust frame-based voice activity detector module is essential to detect inactive voice frames, also called silence or background noise frames. For these detected inactive voice frames, a discontinuous transmission module measures the changes over time of the inactive voice signal characteristics and decides whether a new silence information descriptor frame should be sent to maintain the reproduction quality of the background noise at the receiving end. If such a frame is needed, the spectrum and energy parameters describing the perceptual characteristics of the background noise are efficiently coded and transmitted using 15 biframe. At the receiving end, the comfort noise generation module regenerates the output background noise using transmitted updates or previously available parameters. The synthesized background noise is obtained by linear predictive filtering of a locally generated pseudo-white excitation signal of a controlled level. This method of coding the background noise enables the achievement of bit-rate savings for coded speech at average rates as low as 4 kb/s during normal speech conversation while maintaining reproduction quality.
引用
收藏
页码:64 / 73
页数:10
相关论文
共 2 条
[1]  
Rabiner LR., 1978, DIGITAL PROCESSING S
[2]  
SOUTHCOTT C, 1989, P GLOBECOM 89, P1070