Incremental online sparsification for model learning in real-time robot control

被引：52

作者：

Nguyen-Tuong, Duy ^{[1
]}

Peters, Jan ^{[1
]}

机构：

[1] Max Planck Inst Biol Cybernet, D-72076 Tubingen, Germany

来源：

NEUROCOMPUTING | 2011年 / 74卷 / 11期

关键词：

Sparse data; Machine learning; Real-time online model learning; Inverse dynamics; Robot control;

D O I：

10.1016/j.neucom.2010.06.033

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

For many applications such as compliant, accurate robot tracking control, dynamics models learned from data can help to achieve both compliant control performance as well as high tracking quality. Online learning of these dynamics models allows the robot controller to adapt itself to changes in the dynamics (e.g., due to time-variant nonlinearities or unforeseen loads). However, online learning in real-time applications - as required in control - cannot be realized by straightforward usage of off-the-shelf machine learning methods such as Gaussian process regression or support vector regression. In this paper, we propose a framework for online, incremental sparsification with a fixed budget designed for fast real-time model learning. The proposed approach employs a sparsification method based on an independence measure. In combination with an incremental learning approach such as incremental Gaussian process regression, we obtain a model approximation method which is applicable in real-time online learning. It exhibits competitive learning accuracy when compared with standard regression techniques. Implementation on a real Barrett WAM robot demonstrates the applicability of the approach in real-time online model learning for real world systems. (C) 2011 Elsevier B.V. All rights reserved.

引用

页码：1859 / 1867

页数：9

共 19 条

[1] [Anonymous], ADV NEURAL INFORM PR
[2] [Anonymous], 2006, ROBOT DYNAMICS CONTR
[3] [Anonymous], 2007, Robotics: Science and Systems
[4] Evaluation of parametric and nonparametric nonlinear adaptive controllers
Burdet, E
Codourey, A
[J]. ROBOTICA, 1998, 16 : 59 - 73
[5] The kernel recursive least-squares algorithm
Engel, Y
Mannor, S
Meir, R
[J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2004, 52 (08) : 2275 - 2285
[6] Hastie T., 2001, ELEMENTS STAT LEARNI
[7] LIU Y, 2009, J PROCESS CONTR, P181
[8] Feedback error learning and nonlinear adaptive control
Nakanishi, J
Schaal, S
[J]. NEURAL NETWORKS, 2004, 17 (10) : 1453 - 1465
[9] Nguyen-Tuong D., 2008, ADV NEURAL INFORM PR
[10] Nguyen-Tuong D., 2008, P 2008 AM CONTR C AC

← 1 2 →