Compression of facial images using the K-SVD algorithm

被引:253
作者
Bryt, Ori [2 ]
Elad, Michael [1 ]
机构
[1] Technion Israel Inst Technol, Dept Comp Sci, IL-32000 Technion, Haifa, Israel
[2] Technion Israel Inst Technol, Dept Elect Engn, IL-32000 Haifa, Israel
基金
以色列科学基金会;
关键词
image compression; sparse representations; redundancy; K-SVD; OMP; facial images; PCA; JPEG; JPEG2000; VQ;
D O I
10.1016/j.jvcir.2008.03.001
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The use of sparse representations in signal and image processing is gradually increasing in the past several years. Obtaining an overcomplete dictionary from a set of signals allows us to represent them as a sparse linear combination of dictionary atoms. Pursuit algorithms are then used for signal decomposition. A recent work introduced the K-SVD algorithm, which is a novel method for training overcomplete dictionaries that lead to sparse signal representation. In this work we propose a new method for compressing facial images, based on the K-SVD algorithm. We train K-SVD dictionaries for predefined image patches, and compress each new image according to these dictionaries. The encoding is based on sparse coding of each image patch using the relevant trained dictionary, and the decoding is a simple reconstruction of the patches by linear combination of atoms. An essential pre-process stage for this method is an image alignment procedure, where several facial features are detected and geometrically warped into a canonical spatial location. We present this new method, analyze its results and compare it to several competing compression techniques. (c) 2008 Published by Elsevier Inc.
引用
收藏
页码:270 / 282
页数:13
相关论文
共 34 条
[1]   On the uniqueness of overcomplete dictionaries, and a practical way to retrieve them [J].
Aharon, Michal ;
Elad, Michael ;
Bruckstein, Alfred M. .
LINEAR ALGEBRA AND ITS APPLICATIONS, 2006, 416 (01) :48-67
[2]   K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation [J].
Aharon, Michal ;
Elad, Michael ;
Bruckstein, Alfred .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (11) :4311-4322
[3]   THE LAPLACIAN PYRAMID AS A COMPACT IMAGE CODE [J].
BURT, PJ ;
ADELSON, EH .
IEEE TRANSACTIONS ON COMMUNICATIONS, 1983, 31 (04) :532-540
[4]  
Chen SSB, 2001, SIAM REV, V43, P129, DOI [10.1137/S003614450037906X, 10.1137/S1064827596304010]
[5]   Vector quantization of image subbands: A survey [J].
Cosman, PC ;
Gray, RM ;
Vetterli, M .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 1996, 5 (02) :202-225
[6]   Optimally sparse representation in general (nonorthogonal) dictionaries via l1 minimization [J].
Donoho, DL ;
Elad, M .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (05) :2197-2202
[7]   Uncertainty principles and ideal atomic decomposition [J].
Donoho, DL ;
Huo, XM .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2001, 47 (07) :2845-2862
[8]  
DONOHO DL, 2007, IEEE T INFORM THEORY, V147, P185
[9]   Low bit-rate compression of facial images [J].
Elad, Michael ;
Goldenberg, Roman ;
Kimmel, Ron .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2007, 16 (09) :2379-2383
[10]   Image denoising via sparse and redundant representations over learned dictionaries [J].
Elad, Michael ;
Aharon, Michal .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2006, 15 (12) :3736-3745