LEARNING AND TUNING FUZZY-LOGIC CONTROLLERS THROUGH REINFORCEMENTS

被引：478

作者：

BERENJI, HR ^{[1
]}

KHEDKAR, P ^{[1
]}

机构：

[1] UNIV CALIF BERKELEY,DEPT ELECT ENGN & COMP SCI,BERKELEY,CA 94720

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS | 1992年 / 3卷 / 05期

关键词：

D O I：

10.1109/72.159061

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents a new method for learning and tuning a fuzzy logic controller based on reinforcements from a dynamic system. In particular, our generalized approximate reasoning-based intelligent control (GARIC) architecture (a) learns and tunes a fuzzy logic controller even when only weak reinforcement, such as a binary failure signal, is available; (b) introduces a new conjunction operator in computing the rule strengths of fuzzy control rules; (c) introduces a new localized mean of maximum (LMOM) method in combining the conclusions of several firing control rules; and (d) learns to produce real-valued control actions. Learning is achieved by integrating fuzzy inference into a feedforward neural network, which can then adaptively improve performance by using gradient descent methods. We extend the AHC algorithm of Barto, Sutton, and Anderson to include the prior control knowledge of human operators. The GARIC architecture is applied to a cart-pole balancing system and demonstrates significant improvements in terms of the speed of learning and robustness to changes in the dynamic system's parameters over previous schemes for cart-pole balancing.

引用

页码：724 / 740

页数：17

共 31 条

[1] Anderson C. W., 1986, THESIS U MASSACHUSET
[2] ANDERSON CW, 1988, TR875093 GTE LAB INC
[3] [Anonymous], 1987, LEARNING INTERNAL RE
[4] Atkinson R. C., 1965, INTRO MATH LEARNING
[5] NEURONLIKE ADAPTIVE ELEMENTS THAT CAN SOLVE DIFFICULT LEARNING CONTROL-PROBLEMS
BARTO, AG
SUTTON, RS
ANDERSON, CW
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1983, 13 (05): : 834 - 846
[6] BARTO AG, 1987, 1ST P IEEE INT C NEU, V2, P629
[7] BARTO AG, 1990, ADV NEURAL INFORMATI, V2, P686
[8] BARTO AG, 1989, COINS8989 U MASS TEC
[9] BERENJI H, 1990, 6TH P C UNC ART INT, P362
[10] Berenji H. R., 1992, International Journal of Approximate Reasoning, V6, P267, DOI 10.1016/0888-613X(92)90020-Z

← 1 2 3 4 →