Learning from uniformly ergodic Markov chains

被引:30
作者
Zou, Bin [1 ,2 ]
Zhang, Hai [2 ]
Xu, Zongben [2 ]
机构
[1] Hubei Univ, Fac Math & Comp Sci, Wuhan 430062, Peoples R China
[2] Xi An Jiao Tong Univ, Fac Sci, Inst Informat & Syst Sci, Inst Informat & Syst Sci, Xian 710049, Peoples R China
关键词
ERM algorithms; Uniform ergodic Markov chain samples; Generalization bound; Uniform convergence; Relative uniform convergence; DEPENDENT OBSERVATIONS; MIXING SEQUENCES;
D O I
10.1016/j.jco.2009.01.001
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Evaluation for generalization performance of learning algorithms has been the main thread of machine learning theoretical research. The previous bounds describing the generalization performance of the empirical risk minimization (ERM) algorithm are usually established based on independent and identically distributed (i.i.d.) samples. In this paper we go far beyond this classical framework by establishing the generalization bounds of the ERM algorithm with uniformly ergodic Markov chain (u.e.M.c.) samples. We prove the bounds on the rate of uniform convergence/relative uniform convergence of the ERM algorithm with u.e.M.c. samples, and show that the ERM algorithm with u.e.M.c. samples is consistent. The established theory underlies application of ERM type of learning algorithms. (C) 2009 Elsevier Inc. All rights reserved.
引用
收藏
页码:188 / 200
页数:13
相关论文
共 24 条
[1]   Mixing times for uniformly ergodic Markov chains [J].
Aldous, D ;
Lovasz, L ;
Winkler, P .
STOCHASTIC PROCESSES AND THEIR APPLICATIONS, 1997, 71 (02) :165-185
[2]   An introduction to MCMC for machine learning [J].
Andrieu, C ;
de Freitas, N ;
Doucet, A ;
Jordan, MI .
MACHINE LEARNING, 2003, 50 (1-2) :5-43
[3]  
[Anonymous], APPL STAT, DOI DOI 10.2307/2347565
[4]  
[Anonymous], 2013, A Probabilistic Theory of Pattern Recognition
[5]  
Asmussen S., 1992, ACM Transactions on Modeling and Computer Simulation, V2, P130, DOI 10.1145/137926.137932
[6]   New approaches to statistical learning theory [J].
Bousquet, O .
ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2003, 55 (02) :371-389
[7]  
Chen DR, 2004, J MACH LEARN RES, V5, P1143
[8]  
Cucker F, 2007, C MO AP C M, P1, DOI 10.1017/CBO9780511618796
[9]   Best choices for regularization parameters in learning theory: On the bias-variance problem [J].
Cucker, F ;
Smale, S .
FOUNDATIONS OF COMPUTATIONAL MATHEMATICS, 2002, 2 (04) :413-428
[10]  
Cucker F, 2002, B AM MATH SOC, V39, P1