Shifting responsibly: the importance of striatal modularity to reinforcement learning in uncertain environments

被引:54
作者
Amemori, Ken-ichi [1 ,2 ]
Gibb, Leif G. [1 ,2 ]
Graybiel, Ann M. [1 ,2 ]
机构
[1] MIT, McGovern Inst Brain Res, Cambridge, MA 02139 USA
[2] MIT, Dept Brain & Cognit Sci, Cambridge, MA 02139 USA
来源
FRONTIERS IN HUMAN NEUROSCIENCE | 2011年 / 5卷
基金
美国国家卫生研究院;
关键词
basal ganglia; striatum; striosome and matrix compartments; direct and indirect pathways; acetylcholine; modular reinforcement learning; responsibility signal; mixture of experts; TONICALLY ACTIVE NEURONS; GABAERGIC SYNAPTIC-TRANSMISSION; MEDIUM SPINY NEURONS; MU-OPIOID RECEPTORS; BASAL GANGLIA; CHOLINERGIC INTERNEURONS; PROJECTION NEURONS; SUBTHALAMIC NUCLEUS; SUBSTANTIA-NIGRA; GLOBUS-PALLIDUS;
D O I
10.3389/fnhum.2011.00047
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
We propose here that the modular organization of the striatum reflects a context-sensitive modular learning architecture in which clustered striosome-matrisome domains participate in modular reinforcement learning (RL). Based on anatomical and physiological evidence, it has been suggested that the modular organization of the striatum could represent a learning architecture. There is not, however, a coherent view of how such a learning architecture could relate to the organization of striatal outputs into the direct and indirect pathways of the basal ganglia, nor a clear formulation of how such a modular architecture relates to the RL functions attributed to the striatum. Here, we hypothesize that striosome-matrisome modules not only learn to bias behavior toward specific actions, as in standard RL, but also learn to assess their own relevance to the environmental context and modulate their own learning and activity on this basis. We further hypothesize that the contextual relevance or "responsibility" of modules is determined by errors in predictions of environmental features and that such responsibility is assigned by striosomes and conveyed to matrisomes via local circuit interneurons. To examine these hypotheses and to identify the general requirements for realizing this architecture in the nervous system, we developed a simple modular RL model. We then constructed a network model of basal ganglia circuitry that includes these modules and the direct and indirect pathways. Based on simple assumptions, this model suggests that while the direct pathway may promote actions based on striatal action values, the indirect pathway may act as a gating network that facilitates or suppresses behavioral modules on the basis of striatal responsibility signals. Our modeling functionally unites the modular compartmental organization of the striatum with the direct-indirect pathway divisions of the basal ganglia, a step that we suggest will have important clinical implications.
引用
收藏
页数:20
相关论文
共 134 条
  • [31] Complementary roles of basal ganglia and cerebellum in learning and motor control
    Doya, K
    [J]. CURRENT OPINION IN NEUROBIOLOGY, 2000, 10 (06) : 732 - 739
  • [32] Multiple model-based reinforcement learning
    Doya, K
    Samejima, K
    Katagiri, K
    Kawato, M
    [J]. NEURAL COMPUTATION, 2002, 14 (06) : 1347 - 1369
  • [33] Metalearning and neuromodulation
    Doya, K
    [J]. NEURAL NETWORKS, 2002, 15 (4-6) : 495 - 506
  • [34] EBLEN F, 1995, J NEUROSCI, V15, P5999
  • [35] FLAHERTY AW, 1991, J NEUROPHYSIOL, V66, P1249
  • [36] FLAHERTY AW, 1994, J NEUROSCI, V14, P599
  • [37] FLAHERTY AW, 1993, J NEUROSCI, V13, P1120
  • [38] Exclusive and common targets of neostriatofugal projections of rat striosome neurons: a single neuron-tracing study using a viral vector
    Fujiyama, Fumino
    Sohn, Jaerin
    Nakano, Takashi
    Furuta, Takahiro
    Nakamura, Kouichi C.
    Matsuda, Wakoto
    Kaneko, Takeshi
    [J]. EUROPEAN JOURNAL OF NEUROSCIENCE, 2011, 33 (04) : 668 - 677
  • [39] Cholinergic modulation of neostriatal output:: A functional antagonism between different types of muscarinic receptors
    Galarraga, E
    Hernández-López, S
    Reyes, A
    Miranda, I
    Bermudez-Rattoni, F
    Vilchis, C
    Bargas, J
    [J]. JOURNAL OF NEUROSCIENCE, 1999, 19 (09) : 3629 - 3638
  • [40] GERFEN CR, 1987, J NEUROSCI, V7, P3915