Reinforcement learning to adaptive control of nonlinear systems

被引：29

作者：

Hwang, KS ^{[1
]}

Tan, SW ^{[1
]}

Tsai, MC ^{[1
]}

机构：

[1] Natl Chung Cheng Univ, Dept Elect Engn, Chiayi 62117, Taiwan

来源：

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS | 2003年 / 33卷 / 03期

关键词：

linearization; neural networks; reinforcement learning; system identification;

D O I：

10.1109/TSMCB.2003.811112

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Based on the feedback linearization theory, this paper presents how a reinforcement learning scheme that is adopted to construct artificial neural networks (ANNs) can linearize a nonlinear system effectively. The proposed reinforcement linearization learning system (RLLS) consists of two sub-systems: The evaluation predictor (EP) is a long-term policy selector, and the other is a short-term action selector composed of linearizing control (LC) and reinforce predictor (RP) elements. In addition, a reference model plays the role of the environment, which provides, the reinforcement signal to the linearizing process. The RLLS thus receives reinforcement signals to accomplish the linearizing behavior to control a nonlinear system such that it can behave similarly to the reference model. Eventually, the RILLS performs identification and -linearization concurrently. Simulation results demonstrate that the proposed learning scheme, which is applied to linearizing a pendulum system, provides better control reliability and robustness than conventional ANN schemes. Furthermore, a PI controller is used to control the linearized plant where the affine system behaves like a linear system.

引用

页码：514 / 521

页数：8

共 11 条

[1]

Anderson C. W., 1989, IEEE Control Systems Magazine, V9, P31, DOI 10.1109/37.24809

[2]

DELGADO A, 1994, P 2 INT C INT SYST E, P113

[3] A STOCHASTIC REINFORCEMENT LEARNING ALGORITHM FOR LEARNING REAL-VALUED FUNCTIONS [J].

GULLAPALLI, V .

NEURAL NETWORKS, 1990, 3 (06) :671-692

[4] Adaptive reinforcement learning system for linearization control [J].

Hwang, KS ;

Chao, HJ .

IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2000, 47 (05) :1185-1188

[5]

Narendra K S, 1990, IEEE Trans Neural Netw, V1, P4, DOI 10.1109/72.80202

[6] Neural network-based model reference adaptive control system [J].

Patiño, HD ;

Liu, DR .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2000, 30 (01) :198-204

[7] Neural network approach for linearizing control of nonlinear process plants [J].

Rahman, MHRF ;

Devanathan, R ;

Zhu, KY .

IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2000, 47 (02) :470-477

[8]

SOULING H, 1998, IEEE T NEURAL NETWOR, V9, P1409

[9]

Sutton R. S., 1988, Machine Learning, V3, P9, DOI 10.1023/A:1022633531479

[10] Reinforcement learning to train a cooperative network with both discrete and continuous output neurons [J].

Yamada, S ;

Nakashima, M ;

Shiono, S .

IEEE TRANSACTIONS ON NEURAL NETWORKS, 1998, 9 (06) :1502-1508

← 1 2 →