SVM-based tree-type neural networks as a critic in adaptive critic designs for control

被引:26
作者
Deb, Alok Kanti
Jayadeva
Gopal, Madan
Chandra, Suresh
机构
[1] Indian Inst Technol, Dept Elect Engn, New Delhi 110016, India
[2] Indian Inst Technol, Dept Math, New Delhi 110016, India
来源
IEEE TRANSACTIONS ON NEURAL NETWORKS | 2007年 / 18卷 / 04期
关键词
adaptive control; adaptive critic designs (ACDs); intelligent control; inverted pendulum; linear programming; neural network (NN) applications; support vector machines (SVMs);
D O I
10.1109/TNN.2007.899255
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we use the approach of adaptive critic design (ACD) for control, specifically, the action-dependent heuristic dynamic programming (ADHDP) method. A least squares support vector machine (SVM) regressor has been used for generating the control actions, while an SVM-based tree-type neural network (NN) is used as the critic. After a failure occurs, the critic and action are retrained in tandem using the failure data. Failure data is binary classification data, where the number of failure states are very few as compared to the number of no-failure states. The difficulty of conventional multilayer feedforward NNs in learning this type of classification data has been overcome by using the SVM-based tree-type NN, which due to its feature to add neurons to learn misclassified data, has the capability to learn any binary classification data without a priori choice of the number of neurons or the structure of the network. The capability of the trained controller to handle unforeseen situations is demonstrated.
引用
收藏
页码:1016 / 1030
页数:15
相关论文
共 34 条
[1]  
[Anonymous], [No title captured]
[2]  
[Anonymous], 1990, NEURAL NETWORKS CONT
[3]  
Bennett K. P., 1994, OPTIMIZATION METHODS, V3, P27, DOI DOI 10.1080/10556789408805554
[4]   LEARNING AND TUNING FUZZY-LOGIC CONTROLLERS THROUGH REINFORCEMENTS [J].
BERENJI, HR ;
KHEDKAR, P .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1992, 3 (05) :724-740
[5]   A dual neural network architecture for linear and nonlinear control of inverted pendulum on a cart [J].
Biega, V ;
Balakrishnan, N .
PROCEEDINGS OF THE 1996 IEEE INTERNATIONAL CONFERENCE ON CONTROL APPLICATIONS, 1996, :614-619
[6]  
BOSE NK, 1998, NEURAL NETWORK FUNDA
[7]   Evolving efficient learning algorithms for binary mappings [J].
Bullinaria, JA .
NEURAL NETWORKS, 2003, 16 (5-6) :793-800
[8]  
Cristianini N., 2000, Intelligent Data Analysis: An Introduction
[9]  
CROWBOROUGH E, FEEDBACK MATLAB BASE
[10]  
Cybenko G., 1989, Mathematics of Control, Signals, and Systems, V2, P303, DOI 10.1007/BF02551274