DIGITAL AUDIO CODING FOR VISUAL COMMUNICATIONS

被引:15
作者
NOLL, P
机构
[1] Technische Universität Berlin, Institut für Fernmeldetechnik
关键词
D O I
10.1109/5.387093
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Current and future visual communications for applications such as broadcasting, videotelephony, video- and audiographic-conferencing, and interactive multimedia services assume a substantial audio component. Even text, graphics, fax, still images, email documents, etc. will gain from voice annotation and audio clips. A wide range of speech, wideband speech, and wideband audio coders is available for such applications. In the context of audiovisual communications, the quality of telephone-bandwidth speech is acceptable for some videotelephony and videoconferencing services. Higher bandwidths (wideband speech) may be necessary to improve the intelligibility and naturalness of speech. High quality audio coding including multichannel audio will be necessary in advanced digital TV and multimedia services. This paper explains basic approaches to speech, wideband speech, and audio bit rate compressions in audiovisual communications. These signal classes differ in bandwidth, dynamic range, and in listener expectation of offered quality. It will become obvious that the use of our knowledge of auditory perception helps minimizing perception of coding artifacts and leads to efficient low bit rate coding algorithms which can achieve substantially more compression than was thought possible only a few years ago. The paper concentrates on worldwide source coding standards beneficial for consumers, service providers, and manufacturers.
引用
收藏
页码:925 / 943
页数:19
相关论文
共 76 条
[1]  
Aizawa K., 1989, Signal Processing: Image Communication, V1, P139, DOI 10.1016/0923-5965(89)90006-4
[2]   DIGITAL TELEVISION [J].
ANASTASSIOU, D .
PROCEEDINGS OF THE IEEE, 1994, 82 (04) :510-519
[3]  
ATAL B, 1993, SPEECH AUDIO CODING
[4]  
Atal B. S., 1982, Proceedings of ICASSP 82. IEEE International Conference on Acoustics, Speech and Signal Processing, P614
[5]  
BONICEL P, 1993, 94ND AUD ENG SOC CON
[6]  
BRANDENBURG K, 1992, 92ND AUD ENG SOC CON
[7]  
BRANDENBURG K, 1993, 95TH AUD ENG SOC CON
[8]   EMERGING RESIDENTIAL BROAD-BAND TELECOMMUNICATIONS [J].
BURPEE, DS ;
SHUMATE, PW .
PROCEEDINGS OF THE IEEE, 1994, 82 (04) :604-614
[9]  
Campbell J. P. Jr., 1991, Digital Signal Processing, V1, P145, DOI 10.1016/1051-2004(91)90106-U
[10]  
CHEN H, 1994, ISOIECWG11 DOC