共 49 条
[1]
ABERDEEN D, 2001, POLICY GRADIENT LEAR
[2]
ALEKSANDROV VM, 1968, ENG CYBERN, P11
[3]
[Anonymous], 1989, REAL ANAL PROBABILIT
[4]
BAIRD LC, 1999, ADV NEURAL INFORMATI, V11
[5]
BARTLETT PL, 1999, HEBBIAN SYNAPTIC MOD
[6]
BARTLETT PL, 2000, J COMPUTER SYSTEMS S, V62
[7]
NEURONLIKE ADAPTIVE ELEMENTS THAT CAN SOLVE DIFFICULT LEARNING CONTROL-PROBLEMS
[J].
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS,
1983, 13 (05)
:834-846
[9]
BAXTER J, 2001, IN PRESS J ARTIFICIA
[10]
Bertsekas D. P., 1996, Neuro Dynamic Programming, V1st