The rat approximates an ideal detector of changes in rates of reward: Implications for the law of effect

被引:173
作者
Gallistel, CR
Mark, TA
King, AP
Latham, PE
机构
[1] Univ Calif Los Angeles, Dept Psychol, Los Angeles, CA USA
[2] Fairfield Univ, Dept Comp Sci, Fairfield, CT USA
[3] Univ Calif Los Angeles, Dept Neurobiol, Los Angeles, CA USA
来源
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-ANIMAL BEHAVIOR PROCESSES | 2001年 / 27卷 / 04期
关键词
D O I
10.1037//0097-7403.27.4.354
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Rats responded on 2 levers delivering brain stimulation reward on concurrent variable interval schedules. Following many successive sessions with unchanging relative rates of reward, subjects adjusted to an eventual change slowly and showed spontaneous reversions at the beginning of subsequent sessions. When changes in rates of reward occurred between and within every session, subjects adjusted to them about as rapidly as they could in principle do so, as shown by comparison to a Bayesian model of an ideal detector. This and other features of the adjustments to frequent changes imply that the behavioral effect of reinforcement depends on the subject's perception of incomes and changes in incomes rather than on the strengthening and weakening of behaviors in accord with their past effects or expected results. Models for the process by which perceived incomes determine stay durations and for the process that detects changes in rates are developed.
引用
收藏
页码:354 / 372
页数:19
相关论文
共 40 条
[31]   A framework for mesencephalic dopamine systems based on predictive Hebbian learning [J].
Montague, PR ;
Dayan, P ;
Sejnowski, TJ .
JOURNAL OF NEUROSCIENCE, 1996, 16 (05) :1936-1947
[32]   THE KINETICS OF CHOICE - AN OPERANT SYSTEMS-ANALYSIS [J].
MYERSON, J ;
MIEZIN, FM .
PSYCHOLOGICAL REVIEW, 1980, 87 (02) :160-174
[33]   IMPROVEMENT IN SUCCESSIVE DISCRIMINATION REVERSALS [J].
NORTH, AJ .
JOURNAL OF COMPARATIVE AND PHYSIOLOGICAL PSYCHOLOGY, 1950, 43 (06) :442-460
[35]  
Schmajuk NA, 1997, ANIMAL LEARNING COGN
[36]   A neural substrate of prediction and reward [J].
Schultz, W ;
Dayan, P ;
Montague, PR .
SCIENCE, 1997, 275 (5306) :1593-1599
[37]   HERRNSTEINS EQUATION AND RELATED FORMS [J].
STADDON, JER .
JOURNAL OF THE EXPERIMENTAL ANALYSIS OF BEHAVIOR, 1977, 28 (02) :163-170
[38]  
Sutton R. S., 1998, Reinforcement Learning: An Introduction, V22447
[39]  
Williams B.A., 1988, STEVENSHANDBOOK EXPT, V2, P167
[40]   AUTO-MAINTENANCE IN PIGEON - SUSTAINED PECKING DESPITE CONTINGENT NON-REINFORCEMENT [J].
WILLIAMS, DR ;
WILLIAMS, H .
JOURNAL OF THE EXPERIMENTAL ANALYSIS OF BEHAVIOR, 1969, 12 (04) :511-&