Nonmonotonic generalization bias of gaussian mixture models

被引:12
作者
Akaho, S [1 ]
Kappen, HJ
机构
[1] Informat Sci Div, Electrotech Lab, Tsukuba, Ibaraki 3058568, Japan
[2] Catholic Univ Nijmegen, Dept Med Phys & Biophys, RWCP Theoret Fdn SNN, NL-6525 EZ Nijmegen, Netherlands
关键词
D O I
10.1162/089976600300015439
中图分类号
TP18 [人工智能理论];
学科分类号
081104 [模式识别与智能系统]; 0812 [计算机科学与技术]; 0835 [软件工程]; 1405 [智能科学与技术];
摘要
Theories of learning and generalization hold that the generalization bias, defined as the difference between the training error and the generalization error, increases on average with the number of adaptive parameters. This article, however, shows that this general tendency is violated for a gaussian mixture model. For temperatures just below the first symmetry breaking point, the effective number of adaptive parameters increases and the generalization bias decreases. We compute the dependence of the neural information criterion on temperature around the symmetry breaking. Our results are confirmed by numerical cross-validation experiments.
引用
收藏
页码:1411 / 1427
页数:17
相关论文
共 17 条
[1]
A UNIVERSAL THEOREM ON LEARNING-CURVES [J].
AMARI, SI .
NEURAL NETWORKS, 1993, 6 (02) :161-166
[2]
SCALING LAWS IN LEARNING OF CLASSIFICATION TASKS [J].
BARKAI, N ;
SEUNG, HS ;
SOMPOLINSKY, H .
PHYSICAL REVIEW LETTERS, 1993, 70 (20) :3167-3170
[4]
MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[5]
Jordan M.I., 1991, ADV NEURAL INFORMATI, V3, P767
[6]
HIERARCHICAL MIXTURES OF EXPERTS AND THE EM ALGORITHM [J].
JORDAN, MI ;
JACOBS, RA .
NEURAL COMPUTATION, 1994, 6 (02) :181-214
[7]
KAPPEN B, 1993, P INT C ART NEUR NET, P521
[8]
DETERMINISTIC LEARNING RULES FOR BOLTZMANN MACHINES [J].
KAPPEN, HJ .
NEURAL NETWORKS, 1995, 8 (04) :537-548
[9]
MOODY JE, 1992, ADV NEUR IN, V4, P847
[10]
NETWORK INFORMATION CRITERION - DETERMINING THE NUMBER OF HIDDEN UNITS FOR AN ARTIFICIAL NEURAL-NETWORK MODEL [J].
MURATA, N ;
YOSHIZAWA, S ;
AMARI, S .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (06) :865-872