ADVANCES IN SPEECH AND AUDIO COMPRESSION

被引:91
作者
GERSHO, A
机构
[1] Center for Information Processing Research, Department of Electrical and Computer Engineering, University of California, Santa Barbara
基金
美国国家科学基金会;
关键词
D O I
10.1109/5.286194
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Speech and audio compression has advanced rapidly in recent years spurred on by cost-effective digital technology and diverse commercial applications. Recent activity in speech compression is dominated by research and development of a family of techniques commonly described as code-excited linear prediction (CELP) coding. These algorithms exploit models of speech production and auditory perception and offer a quality versus bit rate tradeoff that significantly exceeds most prior compression techniques for rates in the range of 4 to 16 kb/s. Techniques have also been emerging in recent years that offer enhanced quality in the neighborhood of 2.4 kb/s over traditional vocoder methods. Wideband audio compression is generally aimed at a quality that is nearly indistinguishable from consumer compact-disc audio. Subband and transform coding methods combined with sophisticated perceptual coding techniques dominate in this arena with nearly transparent quality achieved at bit rates in the neighborhood of 128 kb/s per channel.
引用
收藏
页码:900 / 918
页数:19
相关论文
共 268 条
[1]  
Adoul J., 1987, Proceedings: ICASSP 87. 1987 International Conference on Acoustics, Speech, and Signal Processing (Cat. No.87CH2396-0), P1957
[2]  
AETA BS, 1982, MAY IEEE INT C AC SP, V1, P614
[3]  
AKAMINE M, 1991, 1991 P IEEE INT S CI, V1, P586
[4]  
AKAMINE M, 1990, P IEEE INT C AC SPEE, V1, P29
[5]  
Almeida L. B., 1982, Proceedings of ICASSP 82. IEEE International Conference on Acoustics, Speech and Signal Processing, P1664
[6]  
ANDREOTTI FG, 1991, P IEEE INT C ACOUSTI, V1, P621
[7]  
ATAL B, 1993, SPEECH AUDIO CODING
[8]  
Atal B. S., 1991, ADV SPEECH CODING
[9]   SPEECH ANALYSIS AND SYNTHESIS BY LINEAR PREDICTION OF SPEECH WAVE [J].
ATAL, BS ;
HANAUER, SL .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1971, 50 (02) :637-+
[10]  
ATAL BS, 1991, ADV SPEECH CODING, P29