Instrumental vigour in punishment and reward

被引:55
作者
Dayan, Peter [1 ]
机构
[1] UCL, Gatsby Computat Neurosci Unit, London WC1N 3AR, England
关键词
dopamine; reinforcement learning; safety; serotonin; two-factor theory; DYNAMIC BEHAVIORAL-CHANGES; NUCLEUS-ACCUMBENS DOPAMINE; TEMPORAL DIFFERENCE MODELS; RAPHE SEROTONIN NEURONS; BASAL GANGLIA; REINFORCEMENT; PREDICTION; DORSAL; MODULATION; ACTIVATION;
D O I
10.1111/j.1460-9568.2012.08026.x
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Recent notions about the vigour of responding in operant conditioning suggest that the long-run average rate of reward should control the alacrity of action in cases in which the actual cost of speed is balanced against the opportunity cost of sloth. The average reward rate is suggested as being reported by tonic activity in the dopamine system and thereby influencing all actions, including ones that do not themselves lead directly to the rewards. This idea is syntactically problematical for the case of punishment. Here, we broaden the scope of the original suggestion, providing a two-factor analysis of obviated punishment in a variety of operant circumstances. We also consider the effects of stochastically successful actions, which turn out to differ rather markedly between appetitive and aversive cases. Finally, we study how to fit these ideas into nascent treatments that extend concepts of opponency between dopamine and serotonin from valence to invigoration.
引用
收藏
页码:1152 / 1168
页数:17
相关论文
共 122 条
[1]   Neurochemical and anatomical identification of fast- and slow-firing neurones in the rat dorsal raphe nucleus using juxtacellular labelling methods in vivo [J].
Allers, KA ;
Sharp, T .
NEUROSCIENCE, 2003, 122 (01) :193-204
[2]  
[Anonymous], 2004, Affective Neuroscience: The Foundations of Human and Animal Emotions
[3]  
[Anonymous], 2005, MARKOV DECISION PROC
[4]   The Reinforcement Mountain: Allocation of Behavior as a Function of the Rate and Intensity of Rewarding Brain Stimulation [J].
Arvanitogiannis, Andreas ;
Shizgal, Peter .
BEHAVIORAL NEUROSCIENCE, 2008, 122 (05) :1126-1138
[5]   Neural bases of food-seeking: Affect, arousal and reward in corticostriatolimbic circuits [J].
Balleine, BW .
PHYSIOLOGY & BEHAVIOR, 2005, 86 (05) :717-730
[6]   THE USE OF EXTINCTION TO INVESTIGATE THE NATURE OF NEUROLEPTIC-INDUCED AVOIDANCE DEFICITS [J].
BENINGER, RJ ;
MASON, ST ;
PHILLIPS, AG ;
FIBIGER, HC .
PSYCHOPHARMACOLOGY, 1980, 69 (01) :11-18
[7]   Stressor controllability modulates stress-induced dopamine and serotonin efflux and morphine-induced serotonin efflux in the medial prefrontal cortex [J].
Bland, ST ;
Hargrave, D ;
Pepin, JL ;
Amat, J ;
Watkins, LR ;
Maier, SF .
NEUROPSYCHOPHARMACOLOGY, 2003, 28 (09) :1589-1596
[8]   SPECIES-SPECIFIC DEFENSE REACTIONS AND AVOIDANCE LEARNING [J].
BOLLES, RC .
PSYCHOLOGICAL REVIEW, 1970, 77 (01) :32-48
[9]   Opponency Revisited: Competition and Cooperation Between Dopamine and Serotonin [J].
Boureau, Y-Lan ;
Dayan, Peter .
NEUROPSYCHOPHARMACOLOGY, 2011, 36 (01) :74-97
[10]   THE MISBEHAVIOR OF ORGANISMS [J].
BRELAND, K ;
BRELAND, M .
AMERICAN PSYCHOLOGIST, 1961, 16 (11) :681-684