On the average redundancy rate of the Lempel-Ziv code

被引:55
作者
Louchard, G [1 ]
Szpankowski, W [1 ]
机构
[1] PURDUE UNIV, DEPT COMP SCI, W LAFAYETTE, IN 47907 USA
基金
美国国家科学基金会;
关键词
average redundancy rate; Lempel-Ziv parsing scheme; data compression; generalized Lempel-Ziv scheme; digital search trees; pattern matching; analytical techniques of the precise analysis of algorithms;
D O I
10.1109/18.567640
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we settle a long-standing open problem concerning the average redundancy r(n) of the Lempel-Ziv'78 (LZ78) code, We prove that for a memoryless source the average redundancy rate attains asymptotically Er-n=(A+delta(n))/log N+ O(log log n/log(2) n), where A is an explicitly given constant that depends on source characteristics, and delta(x) is a fluctuating function with a small amplitude, We also derive the leading term for the kth moment of the number of phrases, We conclude by conjecturing a precise formula on the expected redundancy for a Markovian source, The main result of this paper is a consequence of recently obtained second-order properties of the Lempel-Ziv algorithm by Jacquet and Szpankowski, These findings have been established by analytical techniques of the precise analysis of algorithms, We give a brief survey of these results since they are interesting in their own right, and shed some light on the probabilistic behavior of pattern matching based data compression.
引用
收藏
页码:2 / 8
页数:7
相关论文
共 19 条
[1]   A DIFFUSION LIMIT FOR A CLASS OF RANDOMLY-GROWING BINARY-TREES [J].
ALDOUS, D ;
SHIELDS, P .
PROBABILITY THEORY AND RELATED FIELDS, 1988, 79 (04) :509-542
[2]  
[Anonymous], 1980, Gos. Izd-vo Fiz.-Mat. Literatury
[3]  
Billingsley P, 1968, CONVERGE PROBAB MEAS
[4]  
Cover T. M., 2005, ELEM INF THEORY, DOI 10.1002/047174882X
[5]   THE LEMPEL ZIV ALGORITHM AND MESSAGE COMPLEXITY [J].
GILBERT, EN ;
KADOTA, TT .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1992, 38 (06) :1839-1842
[6]   ASYMPTOTIC-BEHAVIOR OF THE LEMPEL-ZIV PARSING SCHEME AND DIGITAL SEARCH-TREES [J].
JACQUET, P ;
SZPANKOWSKI, W .
THEORETICAL COMPUTER SCIENCE, 1995, 144 (1-2) :161-197
[7]   AUTOCORRELATION ON WORDS AND ITS APPLICATIONS - ANALYSIS OF SUFFIX TREES BY STRING-RULER APPROACH [J].
JACQUET, P ;
SZPANKOWSKI, W .
JOURNAL OF COMBINATORIAL THEORY SERIES A, 1994, 66 (02) :237-269
[8]  
Knuth Donald E., 1973, ART COMPUTER PROGRAM, V1
[9]   AVERAGE PROFILE AND LIMITING DISTRIBUTION FOR A PHRASE SIZE IN THE LEMPEL-ZIV PARSING ALGORITHM [J].
LOUCHARD, G ;
SZPANKOWSKI, W .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1995, 41 (02) :478-488
[10]  
LOUCHARD G, IN PRESS SIAM J COMP