Vector quantization based on Gaussian mixture models

被引:115
作者
Hedelin, P [1 ]
Skoglund, J [1 ]
机构
[1] Chalmers, Dept Signals & Syst, Informat Theory Grp, SE-41296 Gothenburg, Sweden
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 2000年 / 8卷 / 04期
关键词
D O I
10.1109/89.848220
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we model the underlying probability density function of vectors in a database as a Gaussian mixture (GM) model. The model is employed for high rate vector quantization analysis and for design of vector quantizers. It is shown that the high rate formulas accurately predict the performance of model-based quantizers. We propose a novel method for optimizing GM model parameters for high rate performance, and an extension to the EM algorithm for densities having bounded support is also presented. The methods are applied to quantization of LPC parameters in speech coding and we present new high rate analysis results for band-limited spectral distortion and outlier statistics. In practical terms, we find that an optimal single-stage VQ can operate at approximately 3 bits less than a state-of-the-art LSF-based 2-split VQ.
引用
收藏
页码:385 / 401
页数:17
相关论文
共 33 条
[1]  
CELEUX G, 1995, 2514 I NAT RECH INF
[2]  
Collura J., 1995, PROC INT C ACOUST SP, V1, P744
[3]  
Cover T. M., 2005, ELEM INF THEORY, DOI 10.1002/047174882X
[4]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[5]   THEORETICAL-ANALYSIS OF THE HIGH-RATE VECTOR QUANTIZATION OF LPC PARAMETERS [J].
GARDNER, WR ;
RAO, BD .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (05) :367-381
[6]  
GERSHO A, 1979, IEEE T INFORM THEORY, V25, P373, DOI 10.1109/TIT.1979.1056067
[7]  
GERSHO A, 1991, VECTOR ORG SIGNAL CO
[8]   DISTANCE MEASURES FOR SPEECH PROCESSING [J].
GRAY, AH ;
MARKEL, JD .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1976, 24 (05) :380-391
[9]  
Gray R. M., 1990, Source Coding Theory
[10]   Quantization, classification, and density estimation for Kohonen's Gaussian mixture [J].
Gray, RM ;
Perlmutter, KO ;
Olshen, RA .
DCC '98 - DATA COMPRESSION CONFERENCE, 1998, :63-72