Large-scale typicality of Markov sample paths and consistency of MDL order estimators

被引:51
作者
Csiszár, I [1 ]
机构
[1] Hungarian Acad Sci, A Renyi Inst Math, H-1364 Budapest, Hungary
基金
匈牙利科学研究基金会;
关键词
conditional typicality; empirical distribution; iterated logarithm; Markov chain; minimum description length (MDL); normalized maximum likelihood (NML); order estimation;
D O I
10.1109/TIT.2002.1003842
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
For Markov chains of arbitrary order, with finite alphabet A, almost sure sense limit theorems are proved on relative frequencies of k-blocks, and of symbols preceded by a given k-block, when k is permitted to grow as the sample size n grows. As an application, the consistency of two kinds of minimum description length (MDL) Markov order estimators is proved, with upper bound o(log n), respectively, alpha log n with alpha < 1/log \A\, on the permissible value of the estimated order. It was shown by Csiszar and Shields that in the absence of any bound, or with bound alpha log n with large alpha, consistency falls.
引用
收藏
页码:1616 / 1628
页数:13
相关论文
共 12 条
[1]  
[Anonymous], 1990, THESIS U MARYLAND CO
[2]   The minimum description length principle in coding and modeling [J].
Barron, A ;
Rissanen, J ;
Yu, B .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1998, 44 (06) :2743-2760
[3]  
Barron A. R., 1985, THESIS STANFORD U ST
[4]  
Bühlmann P, 1999, ANN STAT, V27, P480
[5]  
Csiszár I, 2000, ANN STAT, V28, P1601
[6]   DEVIATIONS FROM UNIFORMITY IN RANDOM STRINGS [J].
FLAJOLET, P ;
KIRSCHENHOFER, P ;
TICHY, RF .
PROBABILITY THEORY AND RELATED FIELDS, 1988, 80 (01) :139-150
[7]   THE PERFORMANCE OF UNIVERSAL ENCODING [J].
KRICHEVSKY, RE ;
TROFIMOV, VK .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1981, 27 (02) :199-207
[8]  
Neveu J., 1975, Discrete Parameter Martingales
[9]  
Rissanen J., 1989, STOCHASTIC COMPLEXIT
[10]  
SHTARKOV J, 1977, TOPICS INFORMATION T, P559