Universal data compression based on the Burrows-Wheeler transformation: Theory and practice

被引:33
作者
Balkenhol, B
Kurtz, S
机构
[1] Univ Bielefeld, Fak Math, D-33501 Bielefeld, Germany
[2] Univ Bielefeld, Tech Fak, D-33501 Bielefeld, Germany
关键词
lossless data compression; Burrows-Wheeler Transformation; context trees; suffix trees;
D O I
10.1109/12.888040
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A very interesting recent development in data compression is the Burrows-Wheeler Transformation [1]. The idea is to permute the input sequence in such a way that characters with a similar context are grouped together. We provide a thorough analysis of the Burrows-Wheeler Transformation from an information theoretic point of view. Based on this analysis, the main part of the paper systematically considers techniques to efficiently implement a practical data compression program based on the transformation. We show that our program achieves a better compression rate than other programs that have similar requirements in space and time.
引用
收藏
页码:1043 / 1053
页数:11
相关论文
共 36 条
[11]  
FENWICK P, 1996, 130 U AUCKL DEP COMP
[12]  
GAILLY J, 1993, GZIP PROGRAM VERSION
[13]   A comparison of imperative and purely functional suffix tree constructions [J].
Giegerich, R ;
Kurtz, S .
SCIENCE OF COMPUTER PROGRAMMING, 1995, 25 (2-3) :187-218
[14]   From Ukkonen to McCreight and Weiner: A unifying view of linear-time suffix tree construction [J].
Giegerich, R ;
Kurtz, S .
ALGORITHMICA, 1997, 19 (03) :331-353
[15]  
IRVING RW, 1996, SUFFIX BINARY SEARCH
[16]   THE PERFORMANCE OF UNIVERSAL ENCODING [J].
KRICHEVSKY, RE ;
TROFIMOV, VK .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1981, 27 (02) :199-207
[17]  
Kurtz S, 1999, SOFTWARE PRACT EXPER, V29, P1149, DOI 10.1002/(SICI)1097-024X(199911)29:13<1149::AID-SPE274>3.0.CO
[18]  
2-O
[19]   The context trees of block sorting compression [J].
Larsson, NJ .
DCC '98 - DATA COMPRESSION CONFERENCE, 1998, :189-198
[20]  
Levenstein V.E., 1968, Problems of Cybernetics, V20, P173