Model of birdsong learning based on gradient estimation by dynamic perturbation of neural conductances

被引:114
作者
Fiete, Ila R.
Fee, Michale S.
Seung, H. Sebastian
机构
[1] Univ Calif Santa Barbara, Kavli Inst Theoret Phys, Santa Barbara, CA 93106 USA
[2] Univ Calif San Diego, Ctr Theoret Biol Phys, San Diego, CA 92103 USA
[3] MIT, McGovern Inst Brain Res, Cambridge, MA 02139 USA
[4] MIT, Howard Hughes Med Inst, Cambridge, MA 02139 USA
[5] MIT, Dept Brain & Cognit Sci, Cambridge, MA 02139 USA
关键词
D O I
10.1152/jn.01311.2006
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
We propose a model of songbird learning that focuses on avian brain areas HVC and RA, involved in song production, and area LMAN, important for generating song variability. Plasticity at HVC -> RA synapses is driven by hypothetical "rules" depending on three signals: activation of HVC -> RA synapses, activation of LMAN -> RA synapses, and reinforcement from an internal critic that compares the bird's own song with a memorized template of an adult tutor's song. Fluctuating glutamatergic input to RA from LMAN generates behavioral variability for trial-and-error learning. The plasticity rules perform gradient-based reinforcement learning in a spiking neural network model of song production. Although the reinforcement signal is delayed, temporally imprecise, and binarized, the model learns in a reasonable amount of time in numerical simulations. Varying the number of neurons in HVC and RA has little effect on learning time. The model makes specific predictions for the induction of bidirectional long-term plasticity at HVC -> RA synapses.
引用
收藏
页码:2038 / 2057
页数:20
相关论文
共 75 条
[1]   Robustness in bacterial chemotaxis [J].
Alon, U ;
Surette, MG ;
Barkai, N ;
Leibler, S .
NATURE, 1999, 397 (6715) :168-171
[2]  
[Anonymous], 1993, Advances in neural information processing systems
[3]   NEURONLIKE ADAPTIVE ELEMENTS THAT CAN SOLVE DIFFICULT LEARNING CONTROL-PROBLEMS [J].
BARTO, AG ;
SUTTON, RS ;
ANDERSON, CW .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1983, 13 (05) :834-846
[4]   PATTERN-RECOGNIZING STOCHASTIC LEARNING AUTOMATA [J].
BARTO, AG ;
ANANDAN, P .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1985, 15 (03) :360-375
[5]   Infinite-horizon policy-gradient estimation [J].
Baxter, J ;
Bartlett, PL .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2001, 15 :319-350
[6]   Pure-tone birdsong by resonance filtering of harmonic overtones [J].
Beckers, GJL ;
Suthers, RA ;
ten Cate, C .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (12) :7372-7376
[7]   FOREBRAIN LESIONS DISRUPT DEVELOPMENT BUT NOT MAINTENANCE OF SONG IN PASSERINE BIRDS [J].
BOTTJER, SW ;
MIESNER, EA ;
ARNOLD, AP .
SCIENCE, 1984, 224 (4651) :901-903
[8]   Auditory feedback in learning and maintenance of vocal behaviour [J].
Brainard, MS ;
Doupe, AJ .
NATURE REVIEWS NEUROSCIENCE, 2000, 1 (01) :31-40
[9]   What songbirds teach us about learning [J].
Brainard, MS ;
Doupe, AJ .
NATURE, 2002, 417 (6886) :351-358
[10]   Interruption of a basal ganglia-forebrain circuit prevents plasticity of learned vocalizations [J].
Brainard, MS ;
Doupe, AJ .
NATURE, 2000, 404 (6779) :762-766