Provably safe and robust learning-based model predictive control

被引:365
作者
Aswani, Anil [1 ]
Gonzalez, Humberto [2 ]
Sastry, S. Shankar [1 ]
Tomlin, Claire [1 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
[2] Washington Univ, St Louis, MO 63130 USA
基金
美国国家科学基金会;
关键词
Predictive control; Statistics; Robustness; Safety analysis; Learning control; PIECEWISE-CONSTANT REFERENCES; DISCRETE-TIME-SYSTEMS; TO-STATE STABILITY; REACHABILITY ANALYSIS; ADAPTIVE-CONTROL; INVARIANT-SETS; MPC; COMPUTATION; CONSISTENCY; ALGORITHM;
D O I
10.1016/j.automatica.2013.02.003
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Controller design faces a trade-off between robustness and performance, and the reliability of linear controllers has caused many practitioners to focus on the former. However, there is renewed interest in improving system performance to deal with growing energy constraints. This paper describes a learning-based model predictive control (LBMPC) scheme that provides deterministic guarantees on robustness, while statistical identification tools are used to identify richer models of the system in order to improve performance; the benefits of this framework are that it handles state and input constraints, optimizes system performance with respect to a cost function, and can be designed to use a wide variety of parametric or nonparametric statistical tools. The main insight of LBMPC is that safety and performance can be decoupled under reasonable conditions in an optimization framework by maintaining two models of the system. The first is an approximate model with bounds on its uncertainty, and the second model is updated by statistical methods. LBMPC improves performance by choosing inputs that minimize a cost subject to the learned dynamics, and it ensures safety and robustness by checking whether these same inputs keep the approximate model stable when it is subject to uncertainty. Furthermore, we show that if the system is sufficiently excited, then the LBMPC control action probabilistically converges to that of an MPC computed using the true dynamics. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1216 / 1226
页数:11
相关论文
共 75 条
[1]   Autonomous Helicopter Aerobatics through Apprenticeship Learning [J].
Abbeel, Pieter ;
Coates, Adam ;
Ng, Andrew Y. .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2010, 29 (13) :1608-1639
[2]   Robust adaptive MPC for constrained uncertain nonlinear systems [J].
Adetola, Veronica ;
Guay, Martin .
INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2011, 25 (02) :155-167
[3]   Robust reinforcement learning control using integral quadratic constraints for recurrent neural networks [J].
Anderson, Charles W. ;
Young, Peter Michael ;
Buehner, Michael R. ;
Knight, James N. ;
Bush, Keith A. ;
Hittle, Douglas C. .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2007, 18 (04) :993-1002
[4]  
[Anonymous], 2008 AIAA GUID NAV C
[5]  
[Anonymous], 1998, Variational Analysis
[6]  
[Anonymous], 2004, MULTIPARAMETRIC TOOL
[7]  
[Anonymous], 1977, SOLUTION ILL POSED P
[8]  
[Anonymous], TECHNOMETRICS
[9]  
Asarin E, 2003, LECT NOTES COMPUT SC, V2623, P20
[10]  
Asarin E, 2000, LECT NOTES COMPUT SC, V1790, P20