Robot docking based on omnidirectional vision and reinforcement learning

被引：10

作者：

Muse, David ^{[1
]}

Weber, Cornelius ^{[1
]}

Wermter, Stefan ^{[1
]}

机构：

[1] Univ Sunderland, Sch Comp & Technol, Sunderland SR2 7EE, Durham, England

来源：

KNOWLEDGE-BASED SYSTEMS | 2006年 / 19卷 / 05期

关键词：

reinforcement learning; robot control; robotics; neural networks;

D O I：

10.1016/j.knosys.2005.11.018

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a system for visual robotic docking using an omnidirectional camera coupled with the actor critic reinforcement learning algorithm. The system enables a PeopleBot robot to locate and approach a table so that it can pick an object from it using the pan-tilt camera mounted on the robot. We use a staged approach to solve this problem as there are distinct subtasks and different sensors used. Starting with random wandering of the robot until the table is located via a landmark, then a network trained via reinforcement allows the robot to turn to and approach the table. Once at the table the robot is to pick the object from it. We argue that our approach has a lot of potential allowing the learning of robot control for navigation and remove the need for internal maps of the environment. This is achieved by allowing the robot to learn couplings between motor actions and the position of a landmark. (c) 2006 Elsevier B.V. All rights reserved.

引用

页码：324 / 332

页数：9

共 21 条

[1]

[Anonymous], J COGN SYST RES

[2] Corridor navigation and wall-following stable control for sonar-based mobile robots [J].

Carelli, R ;

Freire, EO .

ROBOTICS AND AUTONOMOUS SYSTEMS, 2003, 45 (3-4) :235-247

[3] Sonar-based robot navigation using nonlinear robust observers [J].

Delgado, E ;

Barreiro, A .

AUTOMATICA, 2003, 39 (07) :1195-1203

[4] A solution to the simultaneous localization and map building (SLAM) problem [J].

Dissanayake, MWMG ;

Newman, P ;

Clark, S ;

Durrant-Whyte, HF ;

Csorba, M .

IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION, 2001, 17 (03) :229-241

[5] Robot navigation using panoramic tracking [J].

Fiala, M ;

Basu, A .

PATTERN RECOGNITION, 2004, 37 (11) :2195-2215

[6]

Filliat D., 2003, COGNITIVE SYSTEMS RE, V4, P243, DOI DOI 10.1016/S1389-0417(03)00008-1

[7]

Foster DJ, 2000, HIPPOCAMPUS, V10, P1, DOI 10.1002/(SICI)1098-1063(2000)10:1<1::AID-HIPO1>3.0.CO

[8]

2-1

[9] Living in a partially structured environment: How to bypass the limitations of classical reinforcement techniques [J].

Gaussier, P ;

Revel, A ;

Joulain, C ;

Zrehen, S .

ROBOTICS AND AUTONOMOUS SYSTEMS, 1997, 20 (2-4) :225-250

[10] The visual homing problem:: An example of robotics/biology cross fertilization [J].

Gaussier, P ;

Joulain, C ;

Banquet, JP ;

Leprêtre, S ;

Revel, A .

ROBOTICS AND AUTONOMOUS SYSTEMS, 2000, 30 (1-2) :155-180

← 1 2 3 →