Continuous action reinforcement learning applied to vehicle suspension control

被引：76

作者：

Howell, MN ^{[1
]}

Frost, GP ^{[1
]}

Gordon, TJ ^{[1
]}

Wu, QH ^{[1
]}

机构：

[1] UNIV LIVERPOOL,DEPT ELECT ENGN & ELECT,LIVERPOOL L69 3BX,MERSEYSIDE,ENGLAND

来源：

MECHATRONICS | 1997年 / 7卷 / 03期

关键词：

D O I：

10.1016/S0957-4158(97)00003-2

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

A new reinforcement learning algorithm is introduced which can be applied over a continuous range of actions. The learning algorithm is reward-inaction based, with a set of probability density functions being used to determine the action set. An experimental study is presented, based on the control of a semi-active suspension system on a road-going, four wheeled, passenger vehicle. The control objective is to minimise the mean square acceleration of the vehicle body, thus improving the ride isolation qualities of the vehicle. This represents a difficult class of learning problems, owing to the stochastic nature of the road input disturbance together with unknown high order dynamics, sensor noise and the non-linear (semi-active) control actuators. The learning algorithm described here operates over a bounded continuous action set, is robust to high levels of noise and is ideally suited to operating in a parallel computing environment. (C) 1997 Elsevier Science Ltd.

引用

页码：263 / 276

页数：14

共 10 条

[1]

[Anonymous], 1993, P CONN MOD SUMM SCH

[2]

Best M.C., 1995, Ph.D. Thesis

[3]

BEST MC, P FISITA C CHIN 1994, V2, P16

[4]

BOYAN J, 1992, THESIS CAMBRIDGE

[5]

FROST GP, 1994, CONTROL VIBRATION

[6] Stochastic optimal control of active vehicle suspensions using learning automata [J].

Gordon, T.J. ;

Marsh, C. ;

Wu, Q.H. .

Proceedings of the Institution of Mechanical Engineers. Part I, Journal of systems and control engineering, 1993, 207 (03) :143-152

[7] DESIGN PRINCIPLES FOR VIBRATION CONTROL-SYSTEMS USING SEMI-ACTIVE DAMPERS [J].

KARNOPP, D .

JOURNAL OF DYNAMIC SYSTEMS MEASUREMENT AND CONTROL-TRANSACTIONS OF THE ASME, 1990, 112 (03) :448-655

[8]

LIN LJ, 1991, 9TH P NAT C ART INT, P781

[9]

NARENDRA K, 1989, LEARNING AUTOMATA IN

[10]

WU QH, 1993, P IEEE SYST MAN CYB, V3, P728

← 1 →