Stochastic complexity in learning

被引:20
作者
Rissanen, J
机构
[1] IBM Almaden Research Center, San Jose
关键词
D O I
10.1006/jcss.1997.1501
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This is an expository paper on the latest results in the theory of stochastic complexity and the associated MDL principle with special interest in modeling problems arising in machine learning. As an illustration we discuss the problem of designing MDL decision trees, which are meant to improve the earlier designs in two ways: First, by use of the sharper formula for the stochastic complexity at the nodes the earlier found tendency of getting too small trees appears to be overcome. Second, a dynamic programming-based pruning algorithm is described for finding the optimal trees, which generalizes an algorithm described in R. Nohre (Ph.D, thesis Linkoping University, 1994). (C) 1997 Academic Press.
引用
收藏
页码:89 / 95
页数:7
相关论文
共 20 条
[2]   ON LENGTH OF PROGRAMS FOR COMPUTING FINITE BINARY SEQUENCES [J].
CHAITIN, GJ .
JOURNAL OF THE ACM, 1966, 13 (04) :547-+
[3]   UNIVERSAL NOISELESS CODING [J].
DAVISSON, LD .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1973, 19 (06) :783-795
[4]   3 APPROACHES TO QUANTITATIVE DEFINITION OF INFORMATION [J].
KOLMOGOROV, AN .
INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 1968, 2 (02) :157-+
[5]   THE PERFORMANCE OF UNIVERSAL ENCODING [J].
KRICHEVSKY, RE ;
TROFIMOV, VK .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1981, 27 (02) :199-207
[6]  
LEHTOKANGAS M, P INT JOINT C NEUR N
[7]  
Li M., 1993, An introduction to Kolmogorov complexity and its applications
[8]  
NOHRE R, 1994, THESIS LINKOPING U
[9]  
QUINLAN JR, 1989, INFORM COMPUT, V80, P227, DOI 10.1016/0890-5401(89)90010-2
[10]   UNIVERSAL CODING, INFORMATION, PREDICTION, AND ESTIMATION [J].
RISSANEN, J .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1984, 30 (04) :629-636