A supervised Actor–Critic approach for adaptive cruise control

被引:3
作者
Dongbin Zhao
Bin Wang
Derong Liu
机构
[1] Chinese Academy of Sciences,The State Key Laboratory of Management and Control for Complex Systems, Institute of Automation
来源
Soft Computing | 2013年 / 17卷
关键词
Supervised reinforcement learning; Actor–Critic; Adaptive cruise control; Uniformly ultimate bounded; Neural networks;
D O I
暂无
中图分类号
学科分类号
摘要
A novel supervised Actor–Critic (SAC) approach for adaptive cruise control (ACC) problem is proposed in this paper. The key elements required by the SAC algorithm namely Actor and Critic, are approximated by feed-forward neural networks respectively. The output of Actor and the state are input to Critic to approximate the performance index function. A Lyapunov stability analysis approach has been presented to prove the uniformly ultimate bounded property of the estimation errors of the neural networks. Moreover, we use the supervisory controller to pre-train Actor to achieve a basic control policy, which can improve the training convergence and success rate. We apply this method to learn an approximate optimal control policy for the ACC problem. Experimental results in several driving scenarios demonstrate that the SAC algorithm performs well, so it is feasible and effective for the ACC problem.
引用
收藏
页码:2089 / 2099
页数:10
相关论文
共 54 条
  • [1] Andreas T(2012)Vehicle trajectory effects of adaptive cruise control J Intell Trans Syst 16 36-44
  • [2] Dierks T(2009)Optimal control of unknown affine nonlinear discrete-time systems using offline-trained neural networks with proof of convergence Neural Netw 22 851-860
  • [3] Thumati B(2001)Nonlinear acc in simulation and measurement Veh Syst Dyn 36 159-177
  • [4] Jagannathan S(2006)Adaptive cruise control simulator: a low-cost, multiple-driver-in-the-loop simulator IEEE Control Syst Mag 26 42-55
  • [5] Fritz A(2008)Neural network adaptive control for a class of nonlinear uncertain dynamical systems with asymptotic stability guarantees IEEE Trans Neural Netw 19 80-89
  • [6] Schiehlen W(2005)Reinforcement learning-based output feedback control of nonlinear systems with input constraints IEEE Trans Syst Man Cybern Part B Cybern 35 150-154
  • [7] Guvenc B(2011)Adaptive cruise control based on reinforcement leaning with shaping rewards J Adv Comput Intell Intell Info 15 4645-4650
  • [8] Kural E(2011)Model predictive multi-objective vehicular adaptive cruise control IEEE Trans Control Syst Technol 19 556-566
  • [9] Hayakawa T(2012)A boundedness result for the direct heuristic dynamic programming Neural Netw 32 229-235
  • [10] Haddad W(2007)A safe longitudinal control for adaptive cruise control and stop-and-go scenarios IEEE Trans Control Syst Technol 15 246-258