THEORY AND PRACTICE OF VECTOR QUANTIZERS TRAINED ON SMALL TRAINING SETS

被引:16
作者
COHN, D
RISKIN, EA
LADNER, R
机构
[1] UNIV WASHINGTON, DEPT COMP SCI & ENGN, SEATTLE, WA 98195 USA
[2] UNIV WASHINGTON, DEPT ELECT ENGN, SEATTLE, WA 98195 USA
基金
美国国家科学基金会;
关键词
IMAGE CODING; LEARNING THEORY; VAPNIK-CHERVONENKIS DIMENSION; VECTOR QUANTIZATION;
D O I
10.1109/34.273717
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We examine how the performance of a memoryless vector quantizer changes as a function of its training set size. Specifically, we study how well the training set distortion predicts test distortion when the training set is a randomly drawn subset of blocks from the test or training image(s). Using the Vapnik-Chervonenkis (VC) dimension, we derive formal bounds for the difference of test and training distortion of vector quantizer codebooks. We then describe extensive empirical simulations that test these bounds for a variety of codebook sizes and vector dimensions, and give practical suggestions for determining the training set size necessary to achieve good generalization from a codebook. We conclude that, by using training sets comprised of only a small fraction of the available data, one can produce results that are close to the results obtainable when all available data are used.
引用
收藏
页码:54 / 65
页数:12
相关论文
共 20 条
[1]   LEARNABILITY AND THE VAPNIK-CHERVONENKIS DIMENSION [J].
BLUMER, A ;
EHRENFEUCHT, A ;
HAUSSLER, D ;
WARMUTH, MK .
JOURNAL OF THE ACM, 1989, 36 (04) :929-965
[2]   HOW TIGHT ARE THE VAPNIK-CHERVONENKIS BOUNDS [J].
COHN, D ;
TESAURO, G .
NEURAL COMPUTATION, 1992, 4 (02) :249-269
[3]  
COHN D, 1992, THESIS U WASHINGTON
[4]  
COSMAN P, 1991, 25 AS C SIGN SYST CO, P434
[5]  
Crutchfield J. P., 1990, COMPLEXITY ENTROPY P, P223
[6]  
Floyd. R. W., 1975, SID DIGEST, P36
[7]  
Gersho A., 1992, VECTOR QUANTIZATION
[8]  
Gray R. M., 1984, IEEE ASSP Magazine, V1, P4, DOI 10.1109/MASSP.1984.1162229
[9]  
ITAKURA F, 1968, 6TH P INT C AC TOK, pC17
[10]   ALGORITHM FOR VECTOR QUANTIZER DESIGN [J].
LINDE, Y ;
BUZO, A ;
GRAY, RM .
IEEE TRANSACTIONS ON COMMUNICATIONS, 1980, 28 (01) :84-95