Action-based sensor space segmentation for soccer robot learning

被引:9
作者
Asada, M [1 ]
Noda, S [1 ]
Hosoda, K [1 ]
机构
[1] Osaka Univ, Dept Mech Engn Comp Controlled Machinery, Suita, Osaka 565, Japan
关键词
D O I
10.1080/088395198117802
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Robot learning, such as reinforcement learning, generally needs a well-defined state space in order to converge. However, building such a state space is one of the main issue of robot learning because of the interdependence between state and action spaces, which resembles the well-known "chicken and egg" problem. This article proposes a method of action-based state space construction for vision-based mobile robots. Basic ideas to cope with the interdependence are that we define a state as a cluster of input vectors from which the robot can research the goal state or the state already obtained by a sequence of one kind of action primitive regardless of its length, and that this sequence is defined as one action. To realize these ideas, we need many data (experiences) of the robot and we must cluster the input vectors as hyper ellipsoids so that the whole state space is segmented into a state transition map in terms of action from which the optimal action sequence is obtained. To show the validity of the method, we apply it to a soccer robot that tries to shoot a ball into a goal. The simulation and real experiments are shown.
引用
收藏
页码:149 / 164
页数:16
相关论文
共 15 条
[1]  
ASADA M, 1995, IEEE INT CONF ROBOT, P146, DOI 10.1109/ROBOT.1995.525277
[2]  
Chapman D, 1991, IJCAI, V91, P726
[3]  
CONNEL JH, 1993, ROBOT LEARNING
[4]  
Cramer H., 1951, MATH METHODS STAT
[5]  
DUBRAWSKI A, 1994, P IEEE RSJ INT C INT, V2, P1272
[6]  
INOUE H, 1996, ROB RES 7 INT S, P162
[7]  
Ishiguro H, 1996, IROS 96 - PROCEEDINGS OF THE 1996 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS - ROBOTIC INTELLIGENCE INTERACTING WITH DYNAMIC WORLDS, VOLS 1-3, P1496, DOI 10.1109/IROS.1996.569011
[8]  
KITANO H, 1995, IJCAI 95 WORKSH ENT
[9]  
KROSE BJA, 1992, P IEEE RSJ INT C INT, P1327
[10]  
MATARIC MJ, 1994, P 11 INT C MACH LEAR, P181