Social is special: A normative framework for teaching with and learning from evaluative feedback

被引:47
作者
Ho, Mark K. [1 ]
MacGlashan, James [2 ]
Littman, Michael L. [2 ]
Cushman, Fiery [3 ]
机构
[1] Brown Univ, Dept Cognit Linguist & Psychol Sci, Box 1821, Providence, RI 02912 USA
[2] Brown Univ, Dept Comp Sci, 115 Waterman St, Providence, RI 02906 USA
[3] Harvard Univ, Dept Psychol, William James Hall,33 Kirkland St, Cambridge, MA 02138 USA
基金
美国国家科学基金会;
关键词
Reward; Punishment; Theory of mind; Social learning; Evaluative feedback; Teaching; PEDAGOGICAL CUES; MATERNAL ENCOURAGEMENT; RATIONAL IMITATION; INFANTS SELECTION; CHILD COMPLIANCE; REINFORCEMENT; PUNISHMENT; EVOLUTION; BEHAVIOR; REWARDS;
D O I
10.1016/j.cognition.2017.03.006
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Humans often attempt to influence one another's behavior using rewards and punishments. How does this work? Psychologists have often assumed that "evaluative feedback" influences behavior via standard learning mechanisms that learn from environmental contingencies. On this view, teaching with evaluative feedback involves leveraging learning systems designed to maximize an organism's positive outcomes. Yet, despite its parsimony, programs of research predicated on this assumption, such as ones in developmental psychology, animal behavior, and human-robot interaction, have had limited success. We offer an explanation by analyzing the logic of evaluative feedback and show that specialized learning mechanisms are uniquely favored in the case of evaluative feedback from a social partner. Specifically, evaluative feedback works best when it is treated as communicating information about the value of an action rather than as a form of reward to be maximized. This account suggests that human learning from evaluative feedback depends on inferences about communicative intent, goals and other mental states much like learning from other sources, such as demonstration, observation and instruction. Because these abilities are especially developed in humans, the present account also explains why evaluative feedback is far more widespread in humans than non-human animals. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:91 / 106
页数:16
相关论文
共 118 条
  • [61] Reinforcement learning: A survey
    Kaelbling, LP
    Littman, ML
    Moore, AW
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1996, 4 : 237 - 285
  • [62] Beyond rational imitation: Learning arbitrary means actions from communicative demonstrations
    Kiraly, Ildiko
    Csibra, Gergely
    Gergely, Gyorgy
    [J]. JOURNAL OF EXPERIMENTAL CHILD PSYCHOLOGY, 2013, 116 (02) : 471 - 486
  • [63] Framing reinforcement learning from human reward: Reward positivity, temporal discounting, episodicity, and performance
    Knox, W. Bradley
    Stone, Peter
    [J]. ARTIFICIAL INTELLIGENCE, 2015, 225 : 24 - 50
  • [64] KOCHANSKA G, 1995, CHILD DEV, V66, P236, DOI 10.2307/1131203
  • [65] Kochanska G, 1995, CHILD DEV, V66, P1752, DOI 10.2307/1131908
  • [66] Preschoolers mistrust ignorant and inaccurate speakers
    Koenig, MA
    Harris, PL
    [J]. CHILD DEVELOPMENT, 2005, 76 (06) : 1261 - 1277
  • [67] Theory of Mind: A Neural Prediction Problem
    Koster-Hale, Jorie
    Saxe, Rebecca
    [J]. NEURON, 2013, 79 (05) : 836 - 848
  • [68] Flexible shaping: How learning in small steps helps
    Krueger, Kai A.
    Dayan, Peter
    [J]. COGNITION, 2009, 110 (03) : 380 - 394
  • [69] What evolves in the evolution of social learning?
    Leadbeater, E.
    [J]. JOURNAL OF ZOOLOGY, 2015, 295 (01) : 4 - 11
  • [70] Neural Basis of Reinforcement Learning and Decision Making
    Lee, Daeyeol
    Seo, Hyojung
    Jung, Min Whan
    [J]. ANNUAL REVIEW OF NEUROSCIENCE, VOL 35, 2012, 35 : 287 - 308