DIGITAL AUDIO CODING FOR VISUAL COMMUNICATIONS

被引:15
作者
NOLL, P
机构
[1] Technische Universität Berlin, Institut für Fernmeldetechnik
关键词
D O I
10.1109/5.387093
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Current and future visual communications for applications such as broadcasting, videotelephony, video- and audiographic-conferencing, and interactive multimedia services assume a substantial audio component. Even text, graphics, fax, still images, email documents, etc. will gain from voice annotation and audio clips. A wide range of speech, wideband speech, and wideband audio coders is available for such applications. In the context of audiovisual communications, the quality of telephone-bandwidth speech is acceptable for some videotelephony and videoconferencing services. Higher bandwidths (wideband speech) may be necessary to improve the intelligibility and naturalness of speech. High quality audio coding including multichannel audio will be necessary in advanced digital TV and multimedia services. This paper explains basic approaches to speech, wideband speech, and audio bit rate compressions in audiovisual communications. These signal classes differ in bandwidth, dynamic range, and in listener expectation of offered quality. It will become obvious that the use of our knowledge of auditory perception helps minimizing perception of coding artifacts and leads to efficient low bit rate coding algorithms which can achieve substantially more compression than was thought possible only a few years ago. The paper concentrates on worldwide source coding standards beneficial for consumers, service providers, and manufacturers.
引用
收藏
页码:925 / 943
页数:19
相关论文
共 76 条
[21]   ADVANCES IN SPEECH AND AUDIO COMPRESSION [J].
GERSHO, A .
PROCEEDINGS OF THE IEEE, 1994, 82 (06) :900-918
[22]  
Gersho A., 1991, VECTOR QUANTIZATION
[23]  
GERSON IA, 1990, APR INT C AC SPEECH, P461
[24]   TRENDS IN CELLULAR AND CORDLESS COMMUNICATIONS [J].
GOODMAN, DJ .
IEEE COMMUNICATIONS MAGAZINE, 1991, 29 (06) :31-40
[25]  
GRILL B, 1994, 96TH AUD ENG SOC CON
[26]  
HATHAWAY GT, 1992, AUDIVISUAL TELECOMMU, P74
[27]  
HERRE J, 1994, 96TH AUD ENG SOC CON
[28]   DIGITAL COMPACT CASSETTE [J].
HOOGENDOORN, A .
PROCEEDINGS OF THE IEEE, 1994, 82 (10) :1479-1489
[29]   CHOOSING AN AMERICAN DIGITAL HDTV TERRESTRIAL BROADCASTING SYSTEM [J].
HOPKINS, R .
PROCEEDINGS OF THE IEEE, 1994, 82 (04) :554-563
[30]   A 128KB/S HI-FI AUDIO CODEC BASED ON ADAPTIVE TRANSFORM CODING WITH ADAPTIVE BLOCK SIZE MDCT [J].
IWADARE, M ;
SUGIYAMA, A ;
HAZU, F ;
HIRANO, A ;
NISHITANI, T .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 1992, 10 (01) :138-144