Probabilistic Model Predictive Safety Certification for Learning-Based Control

被引：54

作者：

Wabersich, Kim J. ^{[1
]}

Hewing, Lukas ^{[1
]}

Carron, Andrea ^{[1
]}

Zeilinger, Melanie N. ^{[1
]}

机构：

[1] Swiss Fed Inst Technol, Inst Dynam Syst & Control, CH-8053 Zurich, Switzerland

来源：

IEEE TRANSACTIONS ON AUTOMATIC CONTROL | 2022年 / 67卷 / 01期

基金：

瑞士国家科学基金会;

关键词：

Safety; Predictive models; Probabilistic logic; Predictive control; Prediction algorithms; Optimization; Uncertainty; reinforcement learning (RL); safety; stochastic systems;

D O I：

10.1109/TAC.2021.3049335

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Reinforcement learning (RL) methods have demonstrated their efficiency in simulation. However, many of the applications for which RL offers great potential, such as autonomous driving, are also safety critical and require a certified closed-loop behavior in order to meet the safety specifications in the presence of physical constraints. This article introduces a concept called probabilistic model predictive safety certification (PMPSC), which can be combined with any RL algorithm and provides provable safety certificates in terms of state and input chance constraints for potentially large-scale systems. The certificate is realized through a stochastic tube that safely connects the current system state with a terminal set of states that is known to be safe. A novel formulation allows a recursively feasible real-time computation of such probabilistic tubes, despite the presence of possibly unbounded disturbances. A design procedure for PMPSC relying on Bayesian inference and recent advances in probabilistic set invariance is presented. Using a numerical car simulation, the method and its design procedure are illustrated by enhancing an RL algorithm with safety certificates.

引用

页码：176 / 188

页数：13

共 64 条

[1] Probabilistic reachability and safety for controlled discrete time stochastic hybrid systems
Abate, Alessandro
Prandini, Maria
Lygeros, John
Sastry, Shankar
[J]. AUTOMATICA, 2008, 44 (11) : 2724 - 2734
[2] Achiam J, 2017, PR MACH LEARN RES, V70
[3] Agrawal A., 2019, Advances in Neural Information Processing Systems, P9562
[4] Akametalu AK, 2014, IEEE DECIS CONTR P, P1424, DOI 10.1109/CDC.2014.7039601
[5] Amodei D, 2016, ARXIV
[6] PID control system analysis, design, and technology
Ang, KH
Chong, G
Li, Y
[J]. IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2005, 13 (04) : 559 - 576
[7] Provably safe and robust learning-based model predictive control
Aswani, Anil
Gonzalez, Humberto
Sastry, S. Shankar
Tomlin, Claire
[J]. AUTOMATICA, 2013, 49 (05) : 1216 - 1226
[8] Berkenkamp F, 2016, IEEE DECIS CONTR P, P4661, DOI 10.1109/CDC.2016.7798979
[9] Berkenkamp F, 2015, 2015 EUROPEAN CONTROL CONFERENCE (ECC), P2496, DOI 10.1109/ECC.2015.7330913
[10] Berkenkamp Felix., 2017, Advances in neural information processing systems, P908

← 1 2 3 4 5 6 7 →