Continuous action reinforcement learning applied to vehicle suspension control

被引:76
作者
Howell, MN [1 ]
Frost, GP [1 ]
Gordon, TJ [1 ]
Wu, QH [1 ]
机构
[1] UNIV LIVERPOOL,DEPT ELECT ENGN & ELECT,LIVERPOOL L69 3BX,MERSEYSIDE,ENGLAND
关键词
D O I
10.1016/S0957-4158(97)00003-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A new reinforcement learning algorithm is introduced which can be applied over a continuous range of actions. The learning algorithm is reward-inaction based, with a set of probability density functions being used to determine the action set. An experimental study is presented, based on the control of a semi-active suspension system on a road-going, four wheeled, passenger vehicle. The control objective is to minimise the mean square acceleration of the vehicle body, thus improving the ride isolation qualities of the vehicle. This represents a difficult class of learning problems, owing to the stochastic nature of the road input disturbance together with unknown high order dynamics, sensor noise and the non-linear (semi-active) control actuators. The learning algorithm described here operates over a bounded continuous action set, is robust to high levels of noise and is ideally suited to operating in a parallel computing environment. (C) 1997 Elsevier Science Ltd.
引用
收藏
页码:263 / 276
页数:14
相关论文
共 10 条
[1]  
[Anonymous], 1993, P CONN MOD SUMM SCH
[2]  
Best M.C., 1995, Ph.D. Thesis
[3]  
BEST MC, P FISITA C CHIN 1994, V2, P16
[4]  
BOYAN J, 1992, THESIS CAMBRIDGE
[5]  
FROST GP, 1994, CONTROL VIBRATION
[6]   Stochastic optimal control of active vehicle suspensions using learning automata [J].
Gordon, T.J. ;
Marsh, C. ;
Wu, Q.H. .
Proceedings of the Institution of Mechanical Engineers. Part I, Journal of systems and control engineering, 1993, 207 (03) :143-152
[7]   DESIGN PRINCIPLES FOR VIBRATION CONTROL-SYSTEMS USING SEMI-ACTIVE DAMPERS [J].
KARNOPP, D .
JOURNAL OF DYNAMIC SYSTEMS MEASUREMENT AND CONTROL-TRANSACTIONS OF THE ASME, 1990, 112 (03) :448-655
[8]  
LIN LJ, 1991, 9TH P NAT C ART INT, P781
[9]  
NARENDRA K, 1989, LEARNING AUTOMATA IN
[10]  
WU QH, 1993, P IEEE SYST MAN CYB, V3, P728