Entropy estimation of symbol sequences

被引:198
作者
Schurmann, T
Grassberger, P
机构
[1] Department of Theoretical Physics, University of Wuppertal
关键词
D O I
10.1063/1.166191
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
We discuss algorithms for estimating the Shannon entropy h of finite symbol sequences with long range correlations. In particular, we consider algorithms which estimate h from the code lengths produced by some compression algorithm. Our interest is in describing their convergence with sequence length, assuming no limits for the space and time complexities of the compression algorithms. A scaling law is proposed for extrapolation from finite sample lengths. This is applied to sequences of dynamical systems in non-trivial chaotic regimes, a 1-D cellular automaton, and to written English texts. (C) 1996 American Institute of Physics.
引用
收藏
页码:414 / 427
页数:14
相关论文
共 54 条
[1]   UNIVERSAL SCHEMES FOR PREDICTION, GAMBLING AND PORTFOLIO SELECTION [J].
ALGOET, P .
ANNALS OF PROBABILITY, 1992, 20 (02) :901-941
[2]   LANGUAGE AND CODIFICATION DEPENDENCE OF LONG-RANGE CORRELATIONS IN TEXTS [J].
Amit, M. ;
Shmerler, Y. ;
Eisenberg, E. ;
Abraham, M. ;
Shnerb, N. .
FRACTALS-COMPLEX GEOMETRY PATTERNS AND SCALING IN NATURE AND SOCIETY, 1994, 2 (01) :7-13
[3]  
Bell T. C., 1990, TEXT COMPRESSION
[4]  
Billingsley P., 1965, ERGODIC THEORY INFOR
[5]  
BURTON N G, 1955, Am J Psychol, V68, P650, DOI 10.2307/1418794
[6]  
CASWELL WE, 1986, DIMENSIONS ENTROPIES
[8]   ON LENGTH OF PROGRAMS FOR COMPUTING FINITE BINARY SEQUENCES [J].
CHAITIN, GJ .
JOURNAL OF THE ACM, 1966, 13 (04) :547-+
[9]   CONVERGENT GAMBLING ESTIMATE OF ENTROPY OF ENGLISH [J].
COVER, TM ;
KING, RC .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1978, 24 (04) :413-421
[10]  
COVER TM, 1974, 12 STANF U STAT DEP