A perceptually based embedded subband speech coder

被引:11
作者
Tang, B
Shen, A
Alwan, A
Pottie, G
机构
[1] INTEL CORP,HILLSBORO,OR 97124
[2] UNIV CALIF LOS ANGELES,DEPT ELECT ENGN,LOS ANGELES,CA 90095
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 1997年 / 5卷 / 02期
基金
美国国家科学基金会;
关键词
speech coding; subband coding; embedded coding; perceptual metrics;
D O I
10.1109/89.554775
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A new scheme for robust, high-quality, embedded speech coding based on subband decomposition and perceptually optimized bit allocation and prioritization is presented, An infinite impulse response (IIR) quadrature mirror filterbank (QMF) performs subband decomposition, A perceptual model, computed using subband spectral analysis, optimizes the coder's perceptual quality, Dynamic bit allocation and prioritization is combined with embedded quantization resulting in little performance degradation relative to a nonembedded implementation, The coder output is scalable from high quality at higher bit rates to lower quality at lower bit rates, supporting a wide range of service and resource utilization, The lower bit-rate representation is obtained simply through truncation of the higher bit-rate representation, Since source-rate adaptation is performed through truncation of the encoded stream, interaction with the coder is not required, making the embedded coder ideally suited for rate-adaptive communication systems, performance for both speech and music was verified through subjective listening tests.
引用
收藏
页码:131 / 140
页数:10
相关论文
共 28 条
[1]  
BRANDENBURG K, 1994, J AUDIO ENG SOC, V42, P780
[2]   SUBBAND SPEECH CODING AND MATCHED CONVOLUTIONAL CHANNEL CODING FOR MOBILE RADIO CHANNELS [J].
COX, RV ;
HAGENAUER, J ;
SESHADRI, N ;
SUNDBERG, CEW .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1991, 39 (08) :1717-1731
[3]   NEW DIRECTIONS IN SUBBAND CODING [J].
COX, RV ;
GAY, SL ;
SHOHAM, Y ;
QUACKENBUSH, SR ;
SESHADRI, N ;
JAYANT, NS .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 1988, 6 (02) :391-409
[4]  
DEIACOVO RD, 1991, P ICASSP, P681
[5]  
DUBNOWSKI JJ, 1979, AT&T TECH J, P577
[6]   PRINCIPLES OF QUANTIZATION [J].
GERSHO, A .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS, 1978, 25 (07) :427-436
[7]   ADVANCES IN SPEECH AND AUDIO COMPRESSION [J].
GERSHO, A .
PROCEEDINGS OF THE IEEE, 1994, 82 (06) :900-918
[8]  
GOODMAN D, 1983, BELL SYST TECH J, P2017
[9]  
GOULD K, 1993, AT T TECH J JUL, P64