TEMPORAL DIFFERENCE LEARNING AND TD-GAMMON

被引:853
作者
TESAURO, G
机构
关键词
D O I
10.1145/203330.203343
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
[No abstract available]
引用
收藏
页码:58 / 68
页数:11
相关论文
共 16 条
  • [1] [Anonymous], 1990, ADV NEURAL INF PROCE
  • [2] COMPUTER BACKGAMMON
    BERLINER, H
    [J]. SCIENTIFIC AMERICAN, 1980, 242 (06) : 64 - &
  • [3] Dayan P, 1994, ADV NEURAL INFORM PR, P817
  • [4] TOWARD AN IDEAL TRAINER
    EPSTEIN, SL
    [J]. MACHINE LEARNING, 1994, 15 (03) : 251 - 277
  • [5] FAWCETT TE, 1992, MACHINE LEARNING /, P144
  • [6] MULTILAYER FEEDFORWARD NETWORKS ARE UNIVERSAL APPROXIMATORS
    HORNIK, K
    STINCHCOMBE, M
    WHITE, H
    [J]. NEURAL NETWORKS, 1989, 2 (05) : 359 - 366
  • [7] ISABELLE JF, 1993, THESIS U MONTREAL
  • [8] MAGRIEL P, 1976, BACKGAMMON
  • [9] Robertie B., 1992, INSIDE BACKGAMMON, V2, P14
  • [10] Rumelhart DE, 1986, ENCY DATABASE SYST, P45