共 8 条
- [1] Learning from Delayed Rewards. Watkins C. . 1989
- [2] Ants can play prisoner’s dilemma. Thlol Y,Acan A. 2003 IEEE International Conference on Sys-tems,Man and Cybernetics . 2003
- [3] Nash-Qlearning for general-sumstochastic games. Hu,Wellman M P. Journal of Machine LearningResearch . 2003
- [4] Another approach to mutation and learning. Amir M,Berninghaus S K. Games and Economic Behavior . 1996
- [5] Markov games as a framework formulti-agent reinforcement learning. Littman M L. Proceedingsof the Eleventh International Conference on MachineLearning . 1994
- [6] Emergence of cooperation and evolutionary stability in finite populations. Martin Nowak,et al. Nature . 2004
- [7] Genetic algorithms and evolution-ary games. Yao X,Darwen P. Commerce,Complexity and E-volution . 2000
- [8] Evolution and the Theory of Games. Smith J M. . 1982