Prefrontal and striatal dopaminergic genes predict individual differences in exploration and exploitation

被引:336
作者
Frank, Michael J. [1 ,2 ,3 ]
Doll, Bradley B. [1 ,2 ,3 ]
Oas-Terpstra, Jen [4 ]
Moreno, Francisco [4 ]
机构
[1] Brown Univ, Brown Inst Brain Sci, Dept Cognit & Linguist Sci, Providence, RI 02912 USA
[2] Brown Univ, Brown Inst Brain Sci, Dept Psychol, Providence, RI 02912 USA
[3] Brown Univ, Brown Inst Brain Sci, Dept Psychiat, Providence, RI 02912 USA
[4] Univ Arizona, Dept Psychiat, Tucson, AZ USA
关键词
MIDBRAIN DOPAMINE; DECISION-MAKING; NEUROCOMPUTATIONAL ACCOUNT; SYNAPTIC PLASTICITY; NUCLEUS-ACCUMBENS; NEURONAL-ACTIVITY; POPULATION CODES; COMT GENOTYPE; REINFORCEMENT; REWARD;
D O I
10.1038/nn.2342
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
The basal ganglia support learning to exploit decisions that have yielded positive outcomes in the past. In contrast, limited evidence implicates the prefrontal cortex in the process of making strategic exploratory decisions when the magnitude of potential outcomes is unknown. Here we examine neurogenetic contributions to individual differences in these distinct aspects of motivated human behavior, using a temporal decision-making task and computational analysis. We show that two genes controlling striatal dopamine function, DARPP-32 (also called PPP1R1B) and DRD2, are associated with exploitative learning to adjust response times incrementally as a function of positive and negative decision outcomes. In contrast, a gene primarily controlling prefrontal dopamine function (COMT) is associated with a particular type of 'directed exploration', in which exploratory decisions are made in proportion to Bayesian uncertainty about whether other choices might produce outcomes that are better than the status quo. Quantitative model fits reveal that genetic factors modulate independent parameters of a reinforcement learning system.
引用
收藏
页码:1062 / U145
页数:11
相关论文
共 50 条
[1]   Statistics of midbrain dopamine neuron spike trains in the awake primate [J].
Bayer, Hannah M. ;
Lau, Brian ;
Glimcher, Paul W. .
JOURNAL OF NEUROPHYSIOLOGY, 2007, 98 (03) :1428-1439
[2]   Midbrain dopamine neurons encode a quantitative reward prediction error signal [J].
Bayer, HM ;
Glimcher, PW .
NEURON, 2005, 47 (01) :129-141
[3]   High impulsivity predicts the switch to compulsive cocaine-taking [J].
Belin, David ;
Mar, Adam C. ;
Dalley, Jeffrey W. ;
Robbins, Trevor W. ;
Everitt, Barry J. .
SCIENCE, 2008, 320 (5881) :1352-1355
[4]   Dopamine and cAMP-regulated phosphoprotein 32 kDa controls both striatal long-term depression and long-term potentiation, opposing forms of synaptic plasticity [J].
Calabresi, P ;
Gubellini, P ;
Centonze, D ;
Picconi, B ;
Bernardi, G ;
Chergui, K ;
Svenningsson, P ;
Fienberg, AA ;
Greengard, P .
JOURNAL OF NEUROSCIENCE, 2000, 20 (22) :8443-8451
[5]   Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration [J].
Cohen, Jonathan D. ;
McClure, Samuel M. ;
Yu, Angela J. .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2007, 362 (1481) :933-942
[6]   Nucleus Accumbens D2/3 receptors predict trait impulsivity and cocaine reinforcement [J].
Dalley, Jeffrey W. ;
Fryer, Tim D. ;
Brichard, Laurent ;
Robinson, Emma S. J. ;
Theobald, David E. H. ;
Laeaene, Kristjan ;
Pena, Yolanda ;
Murphy, Emily R. ;
Shah, Yasmene ;
Probst, Katrin ;
Abakumova, Irina ;
Aigbirhio, Franklin I. ;
Richards, Hugh K. ;
Hong, Young ;
Baron, Jean-Claude ;
Everitt, Barry J. ;
Robbins, Trevor W. .
SCIENCE, 2007, 315 (5816) :1267-1270
[7]   Time-limited modulation of appetitive Pavlovian memory by D1 and NMDA receptors in the nucleus accumbens [J].
Dalley, JW ;
Lääne, K ;
Theobald, DEH ;
Armstrong, HC ;
Corlett, PR ;
Chudasama, Y ;
Robbins, TW .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (17) :6189-6194
[8]   Cortical substrates for exploratory decisions in humans [J].
Daw, Nathaniel D. ;
O'Doherty, John P. ;
Dayan, Peter ;
Seymour, Ben ;
Dolan, Raymond J. .
NATURE, 2006, 441 (7095) :876-879
[9]   Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control [J].
Daw, ND ;
Niv, Y ;
Dayan, P .
NATURE NEUROSCIENCE, 2005, 8 (12) :1704-1711
[10]   Exploration bonuses and dual control [J].
Dayan, P ;
Sejnowski, TJ .
MACHINE LEARNING, 1996, 25 (01) :5-22