High quality document image compression with "DjVU"

被引:160
作者
Bottou, L [1 ]
Haffner, P [1 ]
Howard, PG [1 ]
Simard, P [1 ]
Bengio, Y [1 ]
LeCun, Y [1 ]
机构
[1] AT&T Labs Res, Red Bank, NJ 07701 USA
关键词
D O I
10.1117/1.482609
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We present a new image compression technique called "DjVu" that is specifically geared towards the compression of high-resolution, high-quality images of scanned documents in color. This enables fast transmission of document images over low-speed connections, while faithfully reproducing the visual aspect of the document, including color, fonts, pictures, and paper texture. The DjVu compressor separates the text and drawings, which need a high spatial resolution, from the pictures and backgrounds, which are smoother and can be coded at a lower spatial resolution. Then, several novel techniques are used to maximize the compression ratio: the bi-level foreground image is encoded with AT&T's proposal to the new JBIG2 fax standard and a new wavelet-based compression method is used for the backgrounds and pictures; Both techniques use a new adaptive binary arithmetic coder called the ZP-coder. A typical magazine page in color at 300 dpi (dots per inch) can be compressed down to between 40 and 60 kbytes, approximately 5-10 times better than JPEG for a similar level of subjective quality. A real-time, memory efficient version of the decoder was implemented, and is available as a plug-in for popular web browsers. (C) 1998 SPIE and IS&T. [S1017-9909(98)02803-7].
引用
收藏
页码:410 / 425
页数:16
相关论文
共 27 条
[1]  
Adelson E. H., 1987, Proceedings of the SPIE - The International Society for Optical Engineering, V845, P50, DOI 10.1117/12.976485
[2]  
[Anonymous], P INT C COMP SYST SI
[3]  
[Anonymous], 1994, MANAGING GIGABYTES C
[4]   MEANS FOR ACHIEVING A HIGH DEGREE OF COMPACTION ON SCAN-DIGITIZED PRINTED TEXT [J].
ASCHER, RN ;
NAGY, G .
IEEE TRANSACTIONS ON COMPUTERS, 1974, C 23 (11) :1174-1179
[5]   The Z-Coder adaptive binary coder [J].
Bottou, L ;
Howard, PG ;
Bengio, Y .
DCC '98 - DATA COMPRESSION CONFERENCE, 1998, :13-22
[6]  
BOTTOU L, 1995, ADV NEURAL INFORMATI, V7
[7]   RUN-LENGTH ENCODINGS [J].
GOLOMB, SW .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1966, 12 (03) :399-+
[8]  
HOLT MJ, 1986, ICL TECH J
[9]   Text image compression using soft pattern matching [J].
Howard, PG .
COMPUTER JOURNAL, 1997, 40 (2-3) :146-156
[10]   ARITHMETIC CODING FOR DATA-COMPRESSION [J].
HOWARD, PG ;
VITTER, JS .
PROCEEDINGS OF THE IEEE, 1994, 82 (06) :857-865