Unbounded length contexts for PPM

被引：98

作者：

Cleary, JG

Teahan, WJ

机构：

[1] Department of Computer Science, University of Waikato, Hamilton

来源：

COMPUTER JOURNAL | 1997年 / 40卷 / 2-3期

关键词：

D O I：

10.1093/comjnl/40.2_and_3.67

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

The PPM data compression scheme has set the performance standard in lossless compression of text throughout the past decade, PPM is a finite context statistical modelling technique that can be viewed as blending together several fixed-order context models to predict the next character in the input sequence, This paper gives a brief introduction to PPM, and describes a variant of the algorithm, called PPM*, which exploits contexts of unbounded length. Although requiring considerably greater computational resources (in both time and space), this reliably achieves compression superior to the benchmark PPMC version, Its major contribution is that it shows that the full. information available by considering all substrings of the input string can be used effectively to generate high-quality predictions, Hence, it provides a useful tool for exploring the bounds of compression.

引用

页码：67 / 75

页数：9

共 25 条

[1] ABERG J, 1997, P DAT COMPR C LOS AL
[2] A NOTE ON THE DMC DATA-COMPRESSION SCHEME
BELL, T
MOFFAT, A
[J]. COMPUTER JOURNAL, 1989, 32 (01) : 16 - 20
[3] Bell T. C., 1990, TEXT COMPRESSION
[4] BLOOM C, 1996, PPMZ
[5] Bunton S., 1996, THESIS U WASHINGTON
[6] BUNTON S, 1997, IN PRESS COMP J, V40
[7] Burrows M., 1994, BLOCK SORTING LOSSLE, DOI 10.1.1.37.6774
[8] Cleary J. G., 1995, Proceedings. DCC '95 Data Compression Conference (Cat. No.95TH8037), P52, DOI 10.1109/DCC.1995.515495
[9] DATA-COMPRESSION USING ADAPTIVE CODING AND PARTIAL STRING MATCHING
CLEARY, JG
WITTEN, IH
[J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 1984, 32 (04) : 396 - 402
[10] DATA-COMPRESSION WITH FINITE WINDOWS
FIALA, ER
GREENE, DH
[J]. COMMUNICATIONS OF THE ACM, 1989, 32 (04) : 490 - 505

← 1 2 3 →