Linear quadratic optimal learning control (LQL)

被引:74
作者
Frueh, JA [1 ]
Phan, MQ [1 ]
机构
[1] Princeton Univ, Dept Mech & Aerosp Engn, Princeton, NJ 08544 USA
关键词
D O I
10.1080/002071700405815
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A learning control solution to the problem of finding a finite-time optimal control history that minimizes a quadratic cost is presented. Learning achieves optimization without requiring detailed knowledge of the system, which may be affected by unknown but repetitive disturbances. The optimal solution is synthesized one basis function at a time, reaching optimality in a finite number of trials. These system-dependent basis functions are special in that (1) each newly added basis function is learned without interfering with the previously optimized ones, and (2) it is extracted using data from previous learning trials. Numerical and experimental results are used to illustrate the algorithm.
引用
收藏
页码:832 / 839
页数:8
相关论文
共 21 条
[1]   BETTERING OPERATION OF ROBOTS BY LEARNING [J].
ARIMOTO, S ;
KAWAMURA, S ;
MIYAZAKI, F .
JOURNAL OF ROBOTIC SYSTEMS, 1984, 1 (02) :123-140
[2]  
BIEN Z, 1998, RECENT ADV ITERATIVE
[3]  
Craig J. J., 1984, Proceedings of the 1984 American Control Conference (IEEE Cat. No. 84CH2024-8), P1566
[4]  
FRUEH JA, 1997, P 2 AS CONTR C SEOUL, V2, P251
[5]  
Hara S., 1985, Proceedings of the 24th IEEE Conference on Decision and Control (Cat. No.85CH2245-9), P326
[6]   LEARNING CONTROL OF ROBOT MANIPULATORS [J].
HOROWITZ, R .
JOURNAL OF DYNAMIC SYSTEMS MEASUREMENT AND CONTROL-TRANSACTIONS OF THE ASME, 1993, 115 (2B) :402-411
[7]  
INOUE T, 1981, P 8 WORLD C IFAC, P216
[8]   IDENTIFICATION OF OBSERVER KALMAN FILTER MARKOV PARAMETERS - THEORY AND EXPERIMENTS [J].
JUANG, JN ;
PHAN, M ;
HORTA, LG ;
LONGMAN, RW .
JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 1993, 16 (02) :320-329
[9]  
LONGMAN RW, 1991, IEEE C INT CONTR ARL
[10]  
LONGMAN RW, 1990, AIAA AAS ASTR C COLL, V2, P530