Layered approach to learning client behaviors in the RoboCup soccer server

被引:64
作者
Stone, P [1 ]
Veloso, M [1 ]
机构
[1] Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15213 USA
关键词
D O I
10.1080/088395198117811
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the past few years, multiagent systems (MAS) have emerged as an active subfield of artificial intelligence (Al). Because of the inherent complexity of MAS, there is much interest in using machine learning (ML) techniques to help build multiagent learning. Our approach to using ML as a tool for building Soccer Server clients involves layering increasingly complex learned behaviors. In this article, we describe two levels of learned behaviors. First, the clients learn a low-level individual skill that allows them to control the ball effectively. Then, using this learning skill, they learn a higher level skill that involves multiple players. For both skills, we describe the learning method in detail and report on our extensive empirical testing. We also verify empirically that the learned skills are applicable to game situations.
引用
收藏
页码:165 / 188
页数:24
相关论文
共 26 条
[1]  
*AAAI, 1996, 1996 AAAI SPRING S
[2]  
Asada M, 1996, IROS 96 - PROCEEDINGS OF THE 1996 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS - ROBOTIC INTELLIGENCE INTERACTING WITH DYNAMIC WORLDS, VOLS 1-3, P1502, DOI 10.1109/IROS.1996.569012
[3]   Purposive behavior acquisition for a real robot by vision-based reinforcement learning [J].
Asada, M ;
Noda, S ;
Tawaratsumida, S ;
Hosoda, K .
MACHINE LEARNING, 1996, 23 (2-3) :279-303
[4]   A ROBUST LAYERED CONTROL-SYSTEM FOR A MOBILE ROBOT [J].
BROOKS, RA .
IEEE JOURNAL OF ROBOTICS AND AUTOMATION, 1986, 2 (01) :14-23
[5]  
FORD R, 1994, EXPLOITING NATURAL S
[6]  
GREFENSTETTE J, 1996, 1996 AAAI SPRING S, P45
[7]  
Haynes T., 1996, Adaption and Learning in Multi-Agent Systems. IJCAI '95 Workshop. Proceedings, P113
[8]  
HUBER MJ, 1995, P 1 INT C MULT SYST, P163
[9]  
Kitano H, 1997, AI MAG, V18, P73
[10]  
MATARIC MJ, 1995, ADAPTIVE BEHAV, V4