An efficient, low-complexity audio coder delivering multiple levels of quality for interactive applications

被引:25
作者
Lu, ZT [1 ]
Pearlman, WA [1 ]
机构
[1] Rensselaer Polytech Inst, Dept Elect Comp & Syst Engn, Troy, NY 12180 USA
来源
1998 IEEE SECOND WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING | 1998年
关键词
D O I
10.1109/MMSP.1998.739035
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes an efficient, low complexity audio coder based on the SPIHT (set partitioning in hierarchical trees) coding algorithm [5], which has achieved notable success in still image coding. A wavelet packet transform is used to decompose the audio signal into 29 frequency subbands corresponding roughly to the critical subbands of the human auditory system. A psychoacoustic model,which, for simplicity, is based on MPEG model I, is used to calculate the signal to mask ratio, and then calculate the bit rate allocation among subbands. We distinguish the subbands into two groups: the low frequency group which contains the first 17 subbands corresponding to 0-3.4 KHz, and the high frequency group which contains the remaining high frequency subbands. The SPIHT algorithm is used to encode and decode the low frequency group and a reverse sorting process plus arithmetic coding algorithm is used to encode and decode the high frequency group. The experiment shows that this coder yields nearly transparent quality at bit rates 55-68 Kbits/sec, and degrades only gradually at lower rates. The low complexity of this coding system shows its potential for interactive applications with levels of quality from good to perceptually transparent.
引用
收藏
页码:529 / 534
页数:6
相关论文
共 7 条
[1]  
BOLAND M, 1996, P IEEE INT C AC SPEE, V2, P1041
[2]   ORTHONORMAL BASES OF COMPACTLY SUPPORTED WAVELETS [J].
DAUBECHIES, I .
COMMUNICATIONS ON PURE AND APPLIED MATHEMATICS, 1988, 41 (07) :909-996
[3]  
HAMDY KN, 1996, P IEEE INT C AC SPEE, V2, P1045
[4]  
PURAT M, 1996, P IEEE INT C ASSP MA, V2, P1021
[5]   A new, fast, and efficient image codec based on set partitioning in hierarchical trees [J].
Said, A ;
Pearlman, WA .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1996, 6 (03) :243-250
[6]   LOW BIT-RATE TRANSPARENT AUDIO COMPRESSION USING ADAPTED WAVELETS [J].
SINHA, DP ;
TEWFIK, AH .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1993, 41 (12) :3463-3479
[7]  
TRINKAUS TR, 1995, THESIS RENSSELAER PO