The ascending neuromodulatory systems in learning by reinforcement: Comparing computational conjectures with experimental findings

被引:50
作者
Pennartz, CMA [1 ]
机构
[1] CALTECH, PASADENA, CA 91125 USA
关键词
acetylcholine; dopamine; long-term potentiation; memory; noradrenaline; supervised learning; synaptic plasticity; temporal difference learning;
D O I
10.1016/0165-0173(95)00014-3
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
A central problem in cognitive neuroscience is how animals can manage to rapidly master complex sensorimotor tasks when the only sensory feedback they use to improve their performance is a simple reinforcing stimulus. Neural network theorists have constructed algorithms for reinforcement learning that can be used to solve a variety of biological problems and do not violate basic neurophysiological principles, in contrast to the back-propagation algorithm. A key assumption in these models is the existence of a reinforcement signal, which would be diffusively broadcast throughout one or several brain areas engaged in learning. This signal is further assumed to mediate up- and downward changes in synaptic efficacy by acting as a multiplicative factor in learning rules. The biological plausibility of these algorithms has been defended by the conjecture that the neuromodulators noradrenaline, acetylcholine or dopamine may form the neurochemical substrate of reinforcement signals. In this commentary, the predictions raised by this hypothesis are compared to anatomical, electrophysiological and behavioural findings. The experimental evidence does not support, and often argues against, a general reinforcement-encoding function of these neuromodulatory systems. Nevertheless, the broader concept of evaluative signalling between brain structures implied in learning appears to be reasonable and the available algorithms may open new avenues for constructing more realistic network architectures.
引用
收藏
页码:219 / 245
页数:27
相关论文
共 235 条
[1]   DIFFERENTIAL EFFECT OF STRESS ON INVIVO DOPAMINE RELEASE IN STRIATUM, NUCLEUS ACCUMBENS, AND MEDIAL FRONTAL-CORTEX [J].
ABERCROMBIE, ED ;
KEEFE, KA ;
DIFRISCHIA, DS ;
ZIGMOND, MJ .
JOURNAL OF NEUROCHEMISTRY, 1989, 52 (05) :1655-1658
[2]  
ANGEVINE JB, 1981, PRINCIPLES NEUROANAT
[3]  
[Anonymous], 1987, MEMORY BRAIN
[4]  
[Anonymous], 1991, INTRO THEORY NEURAL, DOI DOI 10.1201/9780429499661
[5]  
APICELLA P, 1991, EXP BRAIN RES, V85, P491
[6]   LONG-TERM DEPRESSION OF EXCITATORY SYNAPTIC TRANSMISSION AND ITS RELATIONSHIP TO LONG-TERM POTENTIATION [J].
ARTOLA, A ;
SINGER, W .
TRENDS IN NEUROSCIENCES, 1993, 16 (11) :480-487
[7]  
ASTONJONES G, 1991, PROG BRAIN RES, V88, P501
[8]  
ASTONJONES G, 1985, PHYSIOL PSYCHOL, V13, P118
[9]  
Barto A. G., 1995, MODELS INFORM PROCES, P215
[10]   NEURONLIKE ADAPTIVE ELEMENTS THAT CAN SOLVE DIFFICULT LEARNING CONTROL-PROBLEMS [J].
BARTO, AG ;
SUTTON, RS ;
ANDERSON, CW .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1983, 13 (05) :834-846