Unbounded length contexts for PPM

被引:98
作者
Cleary, JG
Teahan, WJ
机构
[1] Department of Computer Science, University of Waikato, Hamilton
关键词
D O I
10.1093/comjnl/40.2_and_3.67
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The PPM data compression scheme has set the performance standard in lossless compression of text throughout the past decade, PPM is a finite context statistical modelling technique that can be viewed as blending together several fixed-order context models to predict the next character in the input sequence, This paper gives a brief introduction to PPM, and describes a variant of the algorithm, called PPM*, which exploits contexts of unbounded length. Although requiring considerably greater computational resources (in both time and space), this reliably achieves compression superior to the benchmark PPMC version, Its major contribution is that it shows that the full. information available by considering all substrings of the input string can be used effectively to generate high-quality predictions, Hence, it provides a useful tool for exploring the bounds of compression.
引用
收藏
页码:67 / 75
页数:9
相关论文
共 25 条