Action-based sensor space segmentation for soccer robot learning

被引:9
作者
Asada, M [1 ]
Noda, S [1 ]
Hosoda, K [1 ]
机构
[1] Osaka Univ, Dept Mech Engn Comp Controlled Machinery, Suita, Osaka 565, Japan
关键词
D O I
10.1080/088395198117802
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Robot learning, such as reinforcement learning, generally needs a well-defined state space in order to converge. However, building such a state space is one of the main issue of robot learning because of the interdependence between state and action spaces, which resembles the well-known "chicken and egg" problem. This article proposes a method of action-based state space construction for vision-based mobile robots. Basic ideas to cope with the interdependence are that we define a state as a cluster of input vectors from which the robot can research the goal state or the state already obtained by a sequence of one kind of action primitive regardless of its length, and that this sequence is defined as one action. To realize these ideas, we need many data (experiences) of the robot and we must cluster the input vectors as hyper ellipsoids so that the whole state space is segmented into a state transition map in terms of action from which the optimal action sequence is obtained. To show the validity of the method, we apply it to a soccer robot that tries to shoot a ball into a goal. The simulation and real experiments are shown.
引用
收藏
页码:149 / 164
页数:16
相关论文
共 15 条
[11]  
NAKAMURA T, 1995, P INT JOINT C ART IN, P126
[12]  
NAKAMURA T, 1996, P IEEE INT C ROB AUT, P1314
[13]  
Sutton R. S., 1992, MACHINE LEARNING, V8
[14]  
WATKINS CJCH, 1992, MACH LEARN, V8, P279, DOI 10.1007/BF00992698
[15]  
WHITEHEAD SD, 1991, PROCEEDINGS : NINTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 AND 2, P607