REINFORCEMENT LEARNING OR TRACKING OF INPUT-OUTPUT MAPS

被引：2

作者：

HEISS, M

机构：

[1] Institut fur Allgemeine Elektrotechnik und Elektronik University of Technology Vienna, Vienna

来源：

APPLIED ARTIFICIAL INTELLIGENCE | 1994年 / 8卷 / 04期

关键词：

D O I：

10.1080/08839519408945456

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Three approaches are discussed for learning a slowly time-varying and smooth input-output map in a real-world environment: query smoothing, update smoothing, and radial basis functions. The main restrictions in a real-world environment are (1) some points of the map are more frequently updated than other points, and (2) the updating information may be noisy. A new algorithm is presented called update smoothing with linear interpolation. This learning algorithm allows immediate response (flash map) to a query of the map and is also very fast in updating the map. The advantages and limitations of the algorithm are analyzed. The presented numerical example and discussion of industrial applications show the relevance of these self-learning flash maps for the industry.

引用

页码：483 / 496

页数：14

共 45 条

[1] ABE K, 1988, Patent No. 4733357
[2] ABE K, 1990, Patent No. 4001494
[3] ATKESON CG, 1986, APR P IEEE INT C ROB, P1737
[4] ATKESON CG, 1991, IEEE INT C ROBOTICS
[5] AUGESKY C, 1988, Patent No. 3822582
[6] AUGESKY C, 1989, Patent No. 3822582
[7] FURUYAMA MH, 1990, Patent No. 3928585
[8] FURUYAMA MH, 1988, Patent No. 3819016
[9] GIROSI F, 1989, MIT AI1164 MEM
[10] INVERSE PASSIVE LEARNING OF AN INPUT OUTPUT-MAP THROUGH UPDATE-SPLINE-SMOOTHING
HEISS, M
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1994, 39 (02) : 259 - 268

← 1 2 3 4 5 →