共 37 条
[1]
[Anonymous], 1998, P 15 INT C MACH LEAR
[2]
Baird L, 1995, MACHINE LEARNING P 1, P30
[3]
Bakker B, 2002, ADV NEUR IN, V14, P1475
[4]
Barto A.G., 1990, Learning and Computational Neuroscience
[5]
NEURONLIKE ADAPTIVE ELEMENTS THAT CAN SOLVE DIFFICULT LEARNING CONTROL-PROBLEMS
[J].
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS,
1983, 13 (05)
:834-846
[6]
Bertsekas D., 1996, NEURO DYNAMIC PROGRA, V1st
[7]
Boyan, 1994, ADV NEURAL INFORM PR, P671
[8]
Bradtke S. J., 1995, Advances in Neural Information Processing Systems 7, P393
[10]
Christopher JohnCornish Hella by Watkins., 1989, Learning from delayed rewards