共 31 条
[21]
SAMUEL AL, 1959, IBM J RES DEV, V3, P211, DOI 10.1147/rd.441.0206
[22]
LEARNING CONTROL OF FINITE MARKOV-CHAINS WITH AN EXPLICIT TRADE-OFF BETWEEN ESTIMATION AND CONTROL
[J].
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS,
1988, 18 (05)
:677-684
[23]
SINGH SP, 1991, MACHINE LEARNING, P348
[25]
Sutton R. S., 1988, Machine Learning, V3, P9, DOI 10.1007/BF00115009
[26]
Sutton R. S., 1990, LEARNING COMPUTATION, P497, DOI DOI 10.1111/J.1748-1716.1960.TB01900.X
[27]
SUTTON RS, 1990, 7TH P INT C MACH LEA
[28]
Sutton RS, 1984, THESIS U MASSACHUSET
[29]
TESAURO GJ, 1991, RC17223 IBM TJ WATS
[30]
THRUN SB, 1992, ADV NEUR IN, V4, P531