LEARNING AND TUNING FUZZY-LOGIC CONTROLLERS THROUGH REINFORCEMENTS

被引:478
作者
BERENJI, HR [1 ]
KHEDKAR, P [1 ]
机构
[1] UNIV CALIF BERKELEY,DEPT ELECT ENGN & COMP SCI,BERKELEY,CA 94720
来源
IEEE TRANSACTIONS ON NEURAL NETWORKS | 1992年 / 3卷 / 05期
关键词
D O I
10.1109/72.159061
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a new method for learning and tuning a fuzzy logic controller based on reinforcements from a dynamic system. In particular, our generalized approximate reasoning-based intelligent control (GARIC) architecture (a) learns and tunes a fuzzy logic controller even when only weak reinforcement, such as a binary failure signal, is available; (b) introduces a new conjunction operator in computing the rule strengths of fuzzy control rules; (c) introduces a new localized mean of maximum (LMOM) method in combining the conclusions of several firing control rules; and (d) learns to produce real-valued control actions. Learning is achieved by integrating fuzzy inference into a feedforward neural network, which can then adaptively improve performance by using gradient descent methods. We extend the AHC algorithm of Barto, Sutton, and Anderson to include the prior control knowledge of human operators. The GARIC architecture is applied to a cart-pole balancing system and demonstrates significant improvements in terms of the speed of learning and robustness to changes in the dynamic system's parameters over previous schemes for cart-pole balancing.
引用
收藏
页码:724 / 740
页数:17
相关论文
共 31 条
  • [1] Anderson C. W., 1986, THESIS U MASSACHUSET
  • [2] ANDERSON CW, 1988, TR875093 GTE LAB INC
  • [3] [Anonymous], 1987, LEARNING INTERNAL RE
  • [4] Atkinson R. C., 1965, INTRO MATH LEARNING
  • [5] NEURONLIKE ADAPTIVE ELEMENTS THAT CAN SOLVE DIFFICULT LEARNING CONTROL-PROBLEMS
    BARTO, AG
    SUTTON, RS
    ANDERSON, CW
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1983, 13 (05): : 834 - 846
  • [6] BARTO AG, 1987, 1ST P IEEE INT C NEU, V2, P629
  • [7] BARTO AG, 1990, ADV NEURAL INFORMATI, V2, P686
  • [8] BARTO AG, 1989, COINS8989 U MASS TEC
  • [9] BERENJI H, 1990, 6TH P C UNC ART INT, P362
  • [10] Berenji H. R., 1992, International Journal of Approximate Reasoning, V6, P267, DOI 10.1016/0888-613X(92)90020-Z