Scanned Compound Document Encoding Using Multiscale Recurrent Patterns

被引:17
作者
Francisco, Nelson C. [1 ,2 ]
Rodrigues, Nuno M. M. [2 ,3 ]
da Silva, Eduardo A. B. [1 ]
de Carvalho, Murilo Bresciani [4 ]
de Faria, Sergio M. M. [2 ,3 ]
Silva, Vitor M. M. [5 ,6 ]
机构
[1] Univ Fed Rio de Janeiro, PEE, COPPE, DEL Poli, BR-21945970 Rio De Janeiro, Brazil
[2] Inst Telecomunicacoes, P-2411901 Leiria, Portugal
[3] ESTG, Inst Polytech Leiria, P-2411901 Leiria, Portugal
[4] Univ Fed Fluminense, TET, CTC, BR-24210240 Niteroi, RJ, Brazil
[5] Univ Coimbra Polo II, Inst Telecomunicacoes, P-3030290 Coimbra, Portugal
[6] Univ Coimbra Polo II, Dep Engn Electrotecn & Comp, P-3030290 Coimbra, Portugal
关键词
Adaptive pattern matching; compound images; dictionary based coding; image coding; scanned document compression; vector quantization; IMAGE COMPRESSION; MATCHING IMAGE; SEGMENTATION;
D O I
10.1109/TIP.2010.2049181
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a new encoder for scanned compound documents, based upon a recently introduced coding paradigm called multidimensional multiscale parser (MMP). MMP uses approximate pattern matching, with adaptive multiscale dictionaries that contain concatenations of scaled versions of previously encoded image blocks. These features give MMP the ability to adjust to the input image's characteristics, resulting in high coding efficiencies for a wide range of image types. This versatility makes MMP a good candidate for compound digital document encoding. The proposed algorithm first classifies the image blocks as smooth (texture) and nonsmooth (text and graphics). Smooth and nonsmooth blocks are then compressed using different MMP-based encoders, adapted for encoding either type of blocks. The adaptive use of these two types of encoders resulted in performance gains over the original MMP algorithm, further increasing the performance advantage over the current state-of-the-art image encoders for scanned compound images, without compromising the performance for other image types.
引用
收藏
页码:2712 / 2724
页数:13
相关论文
共 42 条
[1]   2D-pattern matching image and video compression: Theory, algorithms, and experiments [J].
Alzina, M ;
Szpankowski, W ;
Grama, A .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2002, 11 (03) :318-331
[2]  
[Anonymous], 2012, VECTOR QUANTIZATION
[3]   Pattern matching image compression:: Algorithmic and empirical results [J].
Atallah, M ;
Génin, Y ;
Szpankowski, W .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1999, 21 (07) :614-627
[4]  
BOTTOU L, 1998, P DCC98 SNOWB MAR
[5]   Grayscale true two-dimensional dictionary-based image compression [J].
Brittain, Nathanael J. ;
El-Sakka, Mahmoud R. .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2007, 18 (01) :35-44
[6]  
CHAN C, 1995, P IEEE INT C AC SPEE, V4, P2491
[7]  
CHENG D, 2001, J ELECT IMAG, V10
[8]   Multidimensional signal compression using multiscale recurrent patterns [J].
de Carvalho, MB ;
da Silva, EAB ;
Finamore, WA .
SIGNAL PROCESSING, 2002, 82 (11) :1559-1580
[9]   Universal image compression using multiscale recurrent patterns with adaptive probability model [J].
de Lima Filho, Eddie Batista ;
da Silva, Eduardo A. B. ;
de Carvalho, Murilo Bresciani ;
Pinage, Frederico Silva .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2008, 17 (04) :512-527
[10]   Optimizing block-thresholding segmentation for multilayer compression of compound images [J].
de Queiroz, RL ;
Fan, ZG ;
Tran, TD .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2000, 9 (09) :1461-1471