The ascending neuromodulatory systems in learning by reinforcement: Comparing computational conjectures with experimental findings

被引:50
作者
Pennartz, CMA [1 ]
机构
[1] CALTECH, PASADENA, CA 91125 USA
关键词
acetylcholine; dopamine; long-term potentiation; memory; noradrenaline; supervised learning; synaptic plasticity; temporal difference learning;
D O I
10.1016/0165-0173(95)00014-3
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
A central problem in cognitive neuroscience is how animals can manage to rapidly master complex sensorimotor tasks when the only sensory feedback they use to improve their performance is a simple reinforcing stimulus. Neural network theorists have constructed algorithms for reinforcement learning that can be used to solve a variety of biological problems and do not violate basic neurophysiological principles, in contrast to the back-propagation algorithm. A key assumption in these models is the existence of a reinforcement signal, which would be diffusively broadcast throughout one or several brain areas engaged in learning. This signal is further assumed to mediate up- and downward changes in synaptic efficacy by acting as a multiplicative factor in learning rules. The biological plausibility of these algorithms has been defended by the conjecture that the neuromodulators noradrenaline, acetylcholine or dopamine may form the neurochemical substrate of reinforcement signals. In this commentary, the predictions raised by this hypothesis are compared to anatomical, electrophysiological and behavioural findings. The experimental evidence does not support, and often argues against, a general reinforcement-encoding function of these neuromodulatory systems. Nevertheless, the broader concept of evaluative signalling between brain structures implied in learning appears to be reasonable and the available algorithms may open new avenues for constructing more realistic network architectures.
引用
收藏
页码:219 / 245
页数:27
相关论文
共 235 条
[91]   NEURONS WITH GRADED RESPONSE HAVE COLLECTIVE COMPUTATIONAL PROPERTIES LIKE THOSE OF 2-STATE NEURONS [J].
HOPFIELD, JJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA-BIOLOGICAL SCIENCES, 1984, 81 (10) :3088-3092
[92]   NORADRENERGIC ENHANCEMENT OF LONG-TERM POTENTIATION AT MOSSY FIBER SYNAPSES IN THE HIPPOCAMPUS [J].
HOPKINS, WF ;
JOHNSTON, D .
JOURNAL OF NEUROPHYSIOLOGY, 1988, 59 (02) :667-687
[93]   DOPAMINE UPTAKE - A REVIEW OF PROGRESS IN THE LAST DECADE [J].
HORN, AS .
PROGRESS IN NEUROBIOLOGY, 1990, 34 (05) :387-400
[94]   MULTILAYER FEEDFORWARD NETWORKS ARE UNIVERSAL APPROXIMATORS [J].
HORNIK, K ;
STINCHCOMBE, M ;
WHITE, H .
NEURAL NETWORKS, 1989, 2 (05) :359-366
[95]   IMMUNOCYTOCHEMICAL LOCALIZATION OF CHOLINE-ACETYLTRANSFERASE IN RAT CEREBRAL-CORTEX - A STUDY OF CHOLINERGIC NEURONS AND SYNAPSES [J].
HOUSER, CR ;
CRAWFORD, GD ;
SALVATERRA, PM ;
VAUGHN, JE .
JOURNAL OF COMPARATIVE NEUROLOGY, 1985, 234 (01) :17-34
[96]   CHANGES IN BRAIN DOPAMINE AND ACETYLCHOLINE-RELEASE DURING AND FOLLOWING STRESS ARE INDEPENDENT OF THE PITUITARY-ADRENOCORTICAL AXIS [J].
IMPERATO, A ;
PUGLISIALLEGRA, S ;
CASOLINI, P ;
ANGELUCCI, L .
BRAIN RESEARCH, 1991, 538 (01) :111-117
[97]   CHOLINERGIC ROLE IN MONKEY DORSOLATERAL PREFRONTAL CORTEX DURING BAR-PRESS FEEDING-BEHAVIOR [J].
INOUE, M ;
OOMURA, Y ;
NISHINO, H ;
AOU, S ;
SIKDAR, SK ;
HYNES, M ;
MIZUNO, Y ;
KATABUCHI, T .
BRAIN RESEARCH, 1983, 278 (1-2) :185-194
[98]   COMPARISON OF THE EFFECTS OF COCAINE AND OTHER INHIBITORS OF DOPAMINE UPTAKE IN RAT STRIATUM, NUCLEUS-ACCUMBENS, OLFACTORY TUBERCLE, AND MEDIAL PREFRONTAL CORTEX [J].
IZENWASSER, S ;
WERLING, LL ;
COX, BM .
BRAIN RESEARCH, 1990, 520 (1-2) :303-309
[99]   SINGLE UNIT-ACTIVITY OF LOCUS-COERULEUS NEURONS IN BEHAVING ANIMALS [J].
JACOBS, BL .
PROGRESS IN NEUROBIOLOGY, 1986, 27 (02) :183-194
[100]   STRUCTURE AND FUNCTION OF THE BRAIN-SEROTONIN SYSTEM [J].
JACOBS, BL ;
AZMITIA, EC .
PHYSIOLOGICAL REVIEWS, 1992, 72 (01) :165-229