Reinforcement learning to adaptive control of nonlinear systems

被引:29
作者
Hwang, KS [1 ]
Tan, SW [1 ]
Tsai, MC [1 ]
机构
[1] Natl Chung Cheng Univ, Dept Elect Engn, Chiayi 62117, Taiwan
来源
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS | 2003年 / 33卷 / 03期
关键词
linearization; neural networks; reinforcement learning; system identification;
D O I
10.1109/TSMCB.2003.811112
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Based on the feedback linearization theory, this paper presents how a reinforcement learning scheme that is adopted to construct artificial neural networks (ANNs) can linearize a nonlinear system effectively. The proposed reinforcement linearization learning system (RLLS) consists of two sub-systems: The evaluation predictor (EP) is a long-term policy selector, and the other is a short-term action selector composed of linearizing control (LC) and reinforce predictor (RP) elements. In addition, a reference model plays the role of the environment, which provides, the reinforcement signal to the linearizing process. The RLLS thus receives reinforcement signals to accomplish the linearizing behavior to control a nonlinear system such that it can behave similarly to the reference model. Eventually, the RILLS performs identification and -linearization concurrently. Simulation results demonstrate that the proposed learning scheme, which is applied to linearizing a pendulum system, provides better control reliability and robustness than conventional ANN schemes. Furthermore, a PI controller is used to control the linearized plant where the affine system behaves like a linear system.
引用
收藏
页码:514 / 521
页数:8
相关论文
共 11 条
[1]  
Anderson C. W., 1989, IEEE Control Systems Magazine, V9, P31, DOI 10.1109/37.24809
[2]  
DELGADO A, 1994, P 2 INT C INT SYST E, P113
[3]   A STOCHASTIC REINFORCEMENT LEARNING ALGORITHM FOR LEARNING REAL-VALUED FUNCTIONS [J].
GULLAPALLI, V .
NEURAL NETWORKS, 1990, 3 (06) :671-692
[4]   Adaptive reinforcement learning system for linearization control [J].
Hwang, KS ;
Chao, HJ .
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2000, 47 (05) :1185-1188
[5]  
Narendra K S, 1990, IEEE Trans Neural Netw, V1, P4, DOI 10.1109/72.80202
[6]   Neural network-based model reference adaptive control system [J].
Patiño, HD ;
Liu, DR .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2000, 30 (01) :198-204
[7]   Neural network approach for linearizing control of nonlinear process plants [J].
Rahman, MHRF ;
Devanathan, R ;
Zhu, KY .
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2000, 47 (02) :470-477
[8]  
SOULING H, 1998, IEEE T NEURAL NETWOR, V9, P1409
[9]  
Sutton R. S., 1988, Machine Learning, V3, P9, DOI 10.1023/A:1022633531479
[10]   Reinforcement learning to train a cooperative network with both discrete and continuous output neurons [J].
Yamada, S ;
Nakashima, M ;
Shiono, S .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1998, 9 (06) :1502-1508