Reinforcement learning for robot soccer

被引:151
作者
Riedmiller, Martin [1 ]
Gabel, Thomas [1 ]
Hafner, Roland [1 ]
Lange, Sascha [1 ]
机构
[1] Univ Freiburg, Dept Comp Sci, Freiburg, Germany
关键词
Learning mobile robots; Autonomous learning robots; Neural control; RoboCup; Batch reinforcement learning;
D O I
10.1007/s10514-009-9120-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Batch reinforcement learning methods provide a powerful framework for learning efficiently and effectively in autonomous robots. The paper reviews some recent work of the authors aiming at the successful application of reinforcement learning in a challenging and complex domain. It discusses several variants of the general batch learning framework, particularly tailored to the use of multilayer perceptrons to approximate value functions over continuous state spaces. The batch learning framework is successfully used to learn crucial skills in our soccer-playing robots participating in the RoboCup competitions. This is demonstrated on three different case studies.
引用
收藏
页码:55 / 73
页数:19
相关论文
共 45 条
  • [1] [Anonymous], 1957, Dynamic Programming
  • [2] [Anonymous], P FBIT 2007 C JEJ KO
  • [3] Cooperative behavior acquisition for mobile robots in dynamically changing real worlds via vision-based reinforcement learning and development
    Asada, M
    Uchibe, E
    Hosoda, K
    [J]. ARTIFICIAL INTELLIGENCE, 1999, 110 (02) : 275 - 292
  • [4] Bagnell JA, 2001, IEEE INT CONF ROBOT, P1615, DOI 10.1109/ROBOT.2001.932842
  • [5] BEHNKE S, 2003, LNCS, P712
  • [6] Bertsekas Dimitri P, 1996, Neuro-Dynamic Programming
  • [7] CHERNOVA S, 2004, P 2004 IEEE RSJ INT
  • [8] CRITES RH, 1995, ADV NEURAL INFORMATI, V8, P1017
  • [9] Ernst D, 2005, J MACH LEARN RES, V6, P503
  • [10] GABEL T, 2006, P 7 PORT C AUT CONTR