A NEURAL-NETWORK MODEL OF ADAPTIVELY TIMED REINFORCEMENT LEARNING AND HIPPOCAMPAL DYNAMICS

被引:119
作者
GROSSBERG, S [1 ]
MERRILL, JWL [1 ]
机构
[1] BOSTON UNIV, DEPT COGNIT & NEURAL SYST, BOSTON, MA 02215 USA
来源
COGNITIVE BRAIN RESEARCH | 1992年 / 1卷 / 01期
基金
美国国家科学基金会;
关键词
LEARNING; TIMING; NEURAL NETWORK; REINFORCEMENT; EMOTION; RECOGNITION; ATTENTION; MOTOR CONTROL; HIPPOCAMPUS; THALAMUS; CORTEX; CEREBELLUM; N-METHYL-D-ASPARTATE;
D O I
10.1016/0926-6410(92)90003-A
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A neural model is described of how adaptively timed reinforcement learning occurs. The adaptive timing circuit is suggested to exist in the hippocampus, and to involve convergence of dentate granule cells on CA3 pyramidal cells, and N-methyl-D-aspartate (NMDA) receptors. This circuit forms part of a model neural system for the coordinated control of recognition learning, reinforcement learning, and motor learning, whose properties clarify how an animal can learn to acquire a delayed reward. Behavioral and neural data are summarized in support of each processing stage of the system. The relevant anatomical sites are in thalamus, neocortex, hippocampus, hypothalamus, amygdala and cerebellum. Cerebellar influences on motor learning are distinguished from hippocampal influences on adaptive timing of reinforcement learning. The model simulates how damage to the hippocampal formation disrupts adaptive timing, eliminates attentional blocking and causes symptoms of medial temporal amnesia. Properties-of learned expectations, attentional focussing, memory search and orienting reactions to novel events are used to analyze the blocking and amnesia data. The model also suggests how normal acquisition of subcortical emotional conditioning can occur after cortical ablation, even though extinction of emotional conditioning is retarded by cortical ablation. The model simulates how increasing the duration of an unconditioned stimulus increases- the amplitude of emotional conditioning, but does not change adaptive timing; and how an increase in the intensity of a conditioned stimulus 'speeds up the clock', but an increase in the intensity of an unconditioned stimulus does not. Computer simulations of the model fit parametric conditioning data, including a Weber law property and an inverted U property. Both primary and secondary adaptively timed conditionings are simulated, as are data concerning conditioning using multiple interstimulus intervals (ISIs), gradually or abruptly changing ISIs, partial reinforcement and multiple stimuli that lead to time-averaging of responses. Neurobiologically testable predictions are made to facilitate further tests of the model.
引用
收藏
页码:3 / 38
页数:36
相关论文
共 177 条
[1]   TRANSLOCATION OF PROTEIN-KINASE-C ACTIVITY MAY MEDIATE HIPPOCAMPAL LONG-TERM POTENTIATION [J].
AKERS, RF ;
LOVINGER, DM ;
COLLEY, PA ;
LINDEN, DJ ;
ROUTTENBERG, A .
SCIENCE, 1986, 231 (4738) :587-589
[2]  
ALBUS J S, 1971, Mathematical Biosciences, V10, P25, DOI 10.1016/0025-5564(71)90051-4
[3]  
ARMSTRONG JN, 1991, SOC NEUR ABSTR, V17, P485
[4]   AUDITORY DIFFERENTIAL CONDITIONING OF RABBIT NICTITATING MEMBRANE RESPONSE .3. EFFECTS OF US SHOCK INTENSITY AND DURATION [J].
ASHTON, AB ;
BITGOOD, SC ;
MOORE, JW .
PSYCHONOMIC SCIENCE, 1969, 15 (03) :127-&
[5]   VISUAL LEARNING, ADAPTIVE EXPECTATIONS, AND BEHAVIORAL CONDITIONING OF THE MOBILE ROBOT MAVIN [J].
BALOCH, AA ;
WAXMAN, AM .
NEURAL NETWORKS, 1991, 4 (03) :271-302
[6]  
BALOCH AA, 1991, NEURAL NETWORKS CONC, V4
[7]   PROBING COGNITIVE-PROCESSES THROUGH THE STRUCTURE OF EVENT-RELATED POTENTIALS DURING LEARNING - AN EXPERIMENTAL AND THEORETICAL-ANALYSIS [J].
BANQUET, JP ;
GROSSBERG, S .
APPLIED OPTICS, 1987, 26 (23) :4931-4946
[8]   NEURONLIKE ADAPTIVE ELEMENTS THAT CAN SOLVE DIFFICULT LEARNING CONTROL-PROBLEMS [J].
BARTO, AG ;
SUTTON, RS ;
ANDERSON, CW .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1983, 13 (05) :834-846
[9]   HIPPOCAMPECTOMY SELECTIVELY DISRUPTS DISCRIMINATION REVERSAL CONDITIONING OF THE RABBIT NICTITATING-MEMBRANE RESPONSE [J].
BERGER, TW ;
ORR, WB .
BEHAVIOURAL BRAIN RESEARCH, 1983, 8 (01) :49-68
[10]   NEURONAL PLASTICITY IN LIMBIC SYSTEM DURING CLASSICAL-CONDITIONING OF RABBIT NICTITATING-MEMBRANE RESPONSE .1. HIPPOCAMPUS [J].
BERGER, TW ;
THOMPSON, RF .
BRAIN RESEARCH, 1978, 145 (02) :323-346