Solving the credit assignment problem: explicit and implicit learning of action sequences with probabilistic outcomes

被引：35

作者：

Fu, Wai-Tat ^{[1
]}

Anderson, John R. ^{[2
]}

机构：

[1] Univ Illinois, Human Factors Div & Beckman Inst, Urbana, IL 61801 USA

[2] Carnegie Mellon Univ, Dept Psychol, Pittsburgh, PA 15213 USA

来源：

PSYCHOLOGICAL RESEARCH-PSYCHOLOGISCHE FORSCHUNG | 2008年 / 72卷 / 03期

关键词：

D O I：

10.1007/s00426-007-0113-7

中图分类号：

B84 [心理学];

学科分类号：

04 ; 0402 ;

摘要：

In most problem-solving activities, feedback is received at the end of an action sequence. This creates a credit-assignment problem where the learner must associate the feedback with earlier actions, and the interdependencies of actions require the learner to remember past choices of actions. In two studies, we investigated the nature of explicit and implicit learning processes in the credit-assignment problem using a probabilistic sequential choice task with and without a secondary memory task. We found that when explicit learning was dominant, learning was faster to select the better option in their first choices than in the last choices. When implicit reinforcement learning was dominant, learning was faster to select the better option in their last choices than in their first choices. Consistent with the probability-learning and sequence-learning literature, the results show that credit assignment involves two processes: an explicit memory encoding process that requires memory rehearsals and an implicit reinforcement-learning process that propagates credits backwards to previous choices.

引用

页码：321 / 330

页数：10

共 45 条

[1] SPECIALIZING THE OPERATION OF AN EXPLICIT RULE
ALLEN, SW
BROOKS, LR
[J]. JOURNAL OF EXPERIMENTAL PSYCHOLOGY-GENERAL, 1991, 120 (01) : 3 - 19
[2] [Anonymous], 1964, CATEGORIES HUMAN LEA
[3] On the dominance of unidimensional rules in unsupervised categorization
Ashby, FG
Queller, S
Berretty, PM
[J]. PERCEPTION & PSYCHOPHYSICS, 1999, 61 (06): : 1178 - 1199
[4] Sequence learning in a dual-stimulus setting
Cleeremans, A
[J]. PSYCHOLOGICAL RESEARCH-PSYCHOLOGISCHE FORSCHUNG, 1997, 60 (1-2): : 72 - 86
[5] LEARNING THE STRUCTURE OF EVENT SEQUENCES
CLEEREMANS, A
MCCLELLAND, JL
[J]. JOURNAL OF EXPERIMENTAL PSYCHOLOGY-GENERAL, 1991, 120 (03) : 235 - 253
[6] ATTENTION AND STRUCTURE IN SEQUENCE LEARNING
COHEN, A
IVRY, RI
KEELE, SW
[J]. JOURNAL OF EXPERIMENTAL PSYCHOLOGY-LEARNING MEMORY AND COGNITION, 1990, 16 (01) : 17 - 30
[7] ATTENTIONAL AND NONATTENTIONAL FORMS OF SEQUENCE LEARNING
CURRAN, T
KEELE, SW
[J]. JOURNAL OF EXPERIMENTAL PSYCHOLOGY-LEARNING MEMORY AND COGNITION, 1993, 19 (01) : 189 - 202
[8] Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control
Daw, ND
Niv, Y
Dayan, P
[J]. NATURE NEUROSCIENCE, 2005, 8 (12) : 1704 - 1711
[9] THE ROLE OF AUDITORY FEATURES IN MEMORY SPAN FOR WORDS
DREWNOWSKI, A
MURDOCK, BB
[J]. JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN LEARNING AND MEMORY, 1980, 6 (03): : 319 - 332
[10] Traps in the route to models of memory and decision
Estes, WK
[J]. PSYCHONOMIC BULLETIN & REVIEW, 2002, 9 (01) : 3 - 25

← 1 2 3 4 5 →