The Burrows-Wheeler transform for block sorting text compression: Principles and improvements

被引:69
作者
Fenwick, PM
机构
[1] Department of Computer Science, University of Auckland, Auckland
关键词
D O I
10.1093/comjnl/39.9.731
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A recent development in text compression is a 'block sorting' algorithm which permutes the input text according to a special sort procedure and then processes the permuted text with Move-To-Front (MTF) and a final statistical compressor, The technique combines good speed with excellent compression performance, This paper investigates the fundamental operation of the algorithm and presents some improvements based on that analysis, Although block sorting is clearly related to previous compression techniques, it appears that it is best described by techniques derived from work by Shannon on the prediction and entropy of English text, A simple model is developed which relates the compression to the proportion of zeros after the MTF stage.
引用
收藏
页码:731 / 740
页数:10
相关论文
共 14 条
[1]  
ARNAVUT Z, 1996, IEEE DAT COMPR C DCC
[2]   A LOCALLY ADAPTIVE DATA-COMPRESSION SCHEME [J].
BENTLEY, JL ;
SLEATOR, DD ;
TARJAN, RE ;
WEI, VK .
COMMUNICATIONS OF THE ACM, 1986, 29 (04) :320-330
[3]  
BLOOM C, 1996, IEEE DAT COMPR C DCC
[4]  
Burrows M., 1994, BLOCK SORTING LOSSLE, DOI 10.1.1.37.6774
[5]  
CLEARY JG, 1995, IEEE DATA COMPRESSIO, P52
[6]  
FENWICK PM, 1996, ACSC 96 P 19 AUSTR C, P193
[7]  
HOWARD P, 1993, IEEE P DATA COMPRESS, P98
[8]  
LELEWER DA, 1991, IEEE DAT COMPR C DCC, P313
[9]   IMPLEMENTING THE PPM DATA-COMPRESSION SCHEME [J].
MOFFAT, A .
IEEE TRANSACTIONS ON COMMUNICATIONS, 1990, 38 (11) :1917-1921
[10]  
NELSON N, 1996, DR DOBBS J SEP, P46