基于强化学习的机器人路径规划算法

被引：138

作者：

张福海

李宁

袁儒鹏

付宜利

机构：

[1] 哈尔滨工业大学机器人技术与系统国家重点实验室

来源：

华中科技大学学报(自然科学版) | 2018年 / 46卷 / 12期

基金：

黑龙江省自然科学基金;

关键词：

移动机器人; 强化学习; 路径规划; 连续报酬函数; 导航实验;

D O I：

10.13245/j.hust.181212

中图分类号：

TP242 [机器人];

学科分类号：

140102 [集成电路设计与设计自动化];

摘要：

提出了一种基于强化学习的机器人路径规划算法,该算法将激光雷达所获取的移动机器人周围障碍物信息与目标点所在方位信息离散成有限个状态,进而合理地设计环境模型与状态空间数目;设计了一种连续的报酬函数,使得机器人采取的每一个动作都能获得相应的报酬,提高了算法训练效率.最后在Gazebo中建立仿真环境,对该智能体进行学习训练,训练结果验证了算法的有效性;同时在实际机器人上进行导航实验,实验结果表明该算法在实际环境中也能够完成导航任务.

引用

页码：65 / 70

页数：6

共 6 条

[1]

Neural networks based reinforcement learning for mobile robots obstacle avoidance.[J].Mihai Duguleana;Gheorghe Mogan.Expert Systems With Applications.2016,

[2]

An adaptive genetic algorithm for robot motion planning in 2D complex environments.[J].Amir Hossein Karami;Maryam Hasanzadeh.Computers and Electrical Engineering.2015,

[3]

Spatial semantic hybrid map building and application of mobile service robot.[J].Hao Wu;Guo-hui Tian;Yan Li;Feng-yu Zhou;Peng Duan.Robotics and Autonomous Systems.2013,

[4]

Dynamic path planning of mobile robots with improved genetic algorithm [J].

Tuncer, Adem ;

Yildirim, Mehmet .

COMPUTERS & ELECTRICAL ENGINEERING, 2012, 38 (06) :1564-1572

[5]

Modifications of VFH Navigation Methods for Mobile Robots.[J]..Procedia Engineering.2012,

[6]

Reinforcement based mobile robot navigation in dynamic environment.[J].Mohammad Abdel Kareem Jaradat;Mohammad Al-Rousan;Lara Quadan.Robotics and Computer Integrated Manufacturing.2010, 1

← 1 →