Fast segmentation of the JPEG compressed documents

被引:11
作者
de Queiroz, RL [1 ]
Eschbach, R [1 ]
机构
[1] Xerox Corp, Webster, NY 14580 USA
关键词
D O I
10.1117/1.482607
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We present a novel technique for segmentation of a JPEG-compressed document based on block activity. The activity is measured as the number of bits spent to encode each block. Each number is mapped to a pixel brightness value in an auxiliary image which is then used for segmentation. We introduce the use of such an image and show an example of a simple segmentation algorithm, which was successfully applied to test documents. The document is segmented into characteristics regions labeled as background, halftones, text, graphics, and continuous tone images. The key feature of the proposed framework is that the desired region can be identified and cropped (or removed) from the compressed data without decompressing the image. (C) 1998 SPIE and IS&T. [S1017-9909(98)01002-2].
引用
收藏
页码:367 / 377
页数:11
相关论文
共 13 条
[1]  
[Anonymous], 1993, JPEG still image compression standard
[2]  
DEQUEIROZ R, IN PRESS IEEE T IMAG
[3]  
Dougherty ER, 1992, INTRO MORPHOLOGIC TT, VTT9
[4]  
DUNN D, 1996, P INT C IM PROC LAUS, V2, P225
[5]  
ESCHBACH R, Patent No. 5521718
[6]  
FAN Z, Patent No. 5495538
[7]   Segmentation of scanned documents for efficient compression [J].
Fung, HT ;
Parker, KJ .
VISUAL COMMUNICATIONS AND IMAGE PROCESSING '96, 1996, 2727 :701-712
[8]  
MURATA K, Patent No. 5535013
[9]   PAGE SEGMENTATION AND CLASSIFICATION [J].
PAVLIDIS, T ;
ZHOU, JY .
CVGIP-GRAPHICAL MODELS AND IMAGE PROCESSING, 1992, 54 (06) :484-496
[10]  
Rao K. R., 2014, Discrete cosine transform: algorithms, advantages, applications