Learning and decision making in monkeys during a rock-paper-scissors game

被引:75
作者
Lee, D [1 ]
McGreevy, BP [1 ]
Barraclough, DJ [1 ]
机构
[1] Univ Rochester, Ctr Visual Sci, Dept Brain & Cognit Sci, Rochester, NY 14627 USA
来源
COGNITIVE BRAIN RESEARCH | 2005年 / 25卷 / 02期
基金
美国国家卫生研究院;
关键词
game theory; mixed strategy; motivation; prefrontal cortex; reward; zero-sum game;
D O I
10.1016/j.cogbrainres.2005.07.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Game theory provides a solution to the problem of finding a set of optimal decision-making strategies in a group. However, people seldom play such optimal strategies and adjust their strategies based on their experience. Accordingly, many theories postulate a set of variables related to the probabilities of choosing various strategies and describe how such variables are dynamically updated. In reinforcement learning, these value functions are updated based on the outcome of the player's choice, whereas belief learning allows the value functions of all available choices to be updated according to the choices of other players. We investigated the nature of learning process in monkeys playing a competitive game with ternary choices, using a rock-paper-scissors game. During the baseline condition in which the computer selected its targets randomly, each animal displayed biases towards some targets. When the computer exploited the pattern of animal's choice sequence but not its reward history, the animal's choice was still systematically biased by the previous choice of the computer. This bias was reduced when the computer exploited both the choice and reward histories of the animal. Compared to simple models of reinforcement learning or belief learning, these adaptive processes were better described by a model that incorporated the features of both models. These results suggest that stochastic decision-making strategies in primates during social interactions might be adjusted according to both actual and hypothetical payoffs. (c) 2005 Elsevier B.V. All rights reserved.
引用
收藏
页码:416 / 430
页数:15
相关论文
共 41 条
[1]  
[Anonymous], 1983, Statistical methods
[2]   Prefrontal cortex and decision making in a mixed-strategy game [J].
Barraclough, DJ ;
Conroy, ML ;
Lee, D .
NATURE NEUROSCIENCE, 2004, 7 (04) :404-410
[3]   Does minimax work? An experimental study [J].
Binmore, K ;
Swierzbinski, J ;
Proulx, C .
ECONOMIC JOURNAL, 2001, 111 (473) :445-464
[4]   SUBJECTIVE RANDOMIZATION IN 1-PERSON AND 2-PERSON GAMES [J].
BUDESCU, DV ;
RAPOPORT, A .
JOURNAL OF BEHAVIORAL DECISION MAKING, 1994, 7 (04) :261-278
[5]  
Burnham K. P., 2002, MODEL SELECTION MULT
[6]   Experience-weighted attraction learning in normal form games [J].
Camerer, C ;
Ho, TH .
ECONOMETRICA, 1999, 67 (04) :827-874
[7]  
CAMERER CF, 2003, BEHAV GAME THOERY EX
[8]   Individual learning in normal form games: Some laboratory results [J].
Cheung, YW ;
Friedman, D .
GAMES AND ECONOMIC BEHAVIOR, 1997, 19 (01) :46-76
[9]  
Cournot A. A., 1838, Researches into the Mathematical Principles of the Theory of Wealth
[10]   Effects of expectations for different reward magnitudes on neuronal activity in primate striatum [J].
Cromwell, HC ;
Schultz, W .
JOURNAL OF NEUROPHYSIOLOGY, 2003, 89 (05) :2823-2838