Path planning for mobile robots using an improved reinforcement learning scheme

被引:9
作者
Fujisawa, S [1 ]
Kurozumi, R [1 ]
Yamamoto, T [1 ]
Suita, Y [1 ]
机构
[1] Takamatsu Natl Coll Technol, Dept Electro Mech Syst Engn, Kagawa, Japan
来源
PROCEEDINGS OF THE 2002 IEEE INTERNATIONAL SYMPOSIUM ON INTELLIGENT CONTROL | 2002年
关键词
reinfocement learning; CMAC; mobile robot; path planning; on-line learning;
D O I
10.1109/ISIC.2002.1157740
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The current method for establishing travel routes provides modeled environmental information. However, it is difficult to create an environment model for the environments in which mobile robot travel because the environment changes constantly due to the existence of moving objects, including pedestrians. In this study, we propose a path planning system for mobile robots using reinforcement-learning systems and Cerebellar Model Articulation Controllers (CMACs). We selected the best travel route utilizing these reinforcement-learning systems. When a CMAC learns the value function of Q-Learning, it improves learning speed by utilizing the generalizing action. CMACs enable us to reduce the time needed to select the best travel route. Using simulation and real robots, we performed a path-planning experiment. We report the results of simulation and experiment on traveling by on-line learning.
引用
收藏
页码:67 / 74
页数:8
相关论文
共 14 条
[1]  
Albus J. S., 1975, Transactions of the ASME. Series G, Journal of Dynamic Systems, Measurement and Control, V97, P220, DOI 10.1115/1.3426922
[2]  
Albus J. S., 1975, Transactions of the ASME. Series G, Journal of Dynamic Systems, Measurement and Control, V97, P228, DOI 10.1115/1.3426923
[3]  
ASADA M, 1995, J ROBOTICS SOC JAPAN, V13, P68
[4]  
HOSOKAWA D, 2000, P SICE SYST INT DIV, P23
[5]  
KAEBLING LP, 1996, RECENT ADV REINFORCE
[6]  
KIMURA H, 1997, P 6 EUR WORKSH LEARN, P144
[7]  
KIMURA H, 1995, P 12 INT C MACH LEAR, P295
[8]  
KUROZUMI R, 2000, P 9 SICE CHUG BRANCH, P264
[9]  
KUROZUMI R, 2002, 2002 IEEE WORLD C CO, P1690
[10]  
Miyazaki K., 1997, Journal of Japanese Society for Artificial Intelligence, V12, P811