Incremental online sparsification for model learning in real-time robot control

被引:52
作者
Nguyen-Tuong, Duy [1 ]
Peters, Jan [1 ]
机构
[1] Max Planck Inst Biol Cybernet, D-72076 Tubingen, Germany
关键词
Sparse data; Machine learning; Real-time online model learning; Inverse dynamics; Robot control;
D O I
10.1016/j.neucom.2010.06.033
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For many applications such as compliant, accurate robot tracking control, dynamics models learned from data can help to achieve both compliant control performance as well as high tracking quality. Online learning of these dynamics models allows the robot controller to adapt itself to changes in the dynamics (e.g., due to time-variant nonlinearities or unforeseen loads). However, online learning in real-time applications - as required in control - cannot be realized by straightforward usage of off-the-shelf machine learning methods such as Gaussian process regression or support vector regression. In this paper, we propose a framework for online, incremental sparsification with a fixed budget designed for fast real-time model learning. The proposed approach employs a sparsification method based on an independence measure. In combination with an incremental learning approach such as incremental Gaussian process regression, we obtain a model approximation method which is applicable in real-time online learning. It exhibits competitive learning accuracy when compared with standard regression techniques. Implementation on a real Barrett WAM robot demonstrates the applicability of the approach in real-time online model learning for real world systems. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:1859 / 1867
页数:9
相关论文
共 19 条
  • [1] [Anonymous], ADV NEURAL INFORM PR
  • [2] [Anonymous], 2006, ROBOT DYNAMICS CONTR
  • [3] [Anonymous], 2007, Robotics: Science and Systems
  • [4] Evaluation of parametric and nonparametric nonlinear adaptive controllers
    Burdet, E
    Codourey, A
    [J]. ROBOTICA, 1998, 16 : 59 - 73
  • [5] The kernel recursive least-squares algorithm
    Engel, Y
    Mannor, S
    Meir, R
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2004, 52 (08) : 2275 - 2285
  • [6] Hastie T., 2001, ELEMENTS STAT LEARNI
  • [7] LIU Y, 2009, J PROCESS CONTR, P181
  • [8] Feedback error learning and nonlinear adaptive control
    Nakanishi, J
    Schaal, S
    [J]. NEURAL NETWORKS, 2004, 17 (10) : 1453 - 1465
  • [9] Nguyen-Tuong D., 2008, ADV NEURAL INFORM PR
  • [10] Nguyen-Tuong D., 2008, P 2008 AM CONTR C AC