Social is special: A normative framework for teaching with and learning from evaluative feedback

被引：47

作者：

Ho, Mark K. ^{[1
]}

MacGlashan, James ^{[2
]}

Littman, Michael L. ^{[2
]}

Cushman, Fiery ^{[3
]}

机构：

[1] Brown Univ, Dept Cognit Linguist & Psychol Sci, Box 1821, Providence, RI 02912 USA

[2] Brown Univ, Dept Comp Sci, 115 Waterman St, Providence, RI 02906 USA

[3] Harvard Univ, Dept Psychol, William James Hall,33 Kirkland St, Cambridge, MA 02138 USA

来源：

COGNITION | 2017年 / 167卷

基金：

美国国家科学基金会;

关键词：

Reward; Punishment; Theory of mind; Social learning; Evaluative feedback; Teaching; PEDAGOGICAL CUES; MATERNAL ENCOURAGEMENT; RATIONAL IMITATION; INFANTS SELECTION; CHILD COMPLIANCE; REINFORCEMENT; PUNISHMENT; EVOLUTION; BEHAVIOR; REWARDS;

D O I：

10.1016/j.cognition.2017.03.006

中图分类号：

B84 [心理学];

学科分类号：

04 ; 0402 ;

摘要：

Humans often attempt to influence one another's behavior using rewards and punishments. How does this work? Psychologists have often assumed that "evaluative feedback" influences behavior via standard learning mechanisms that learn from environmental contingencies. On this view, teaching with evaluative feedback involves leveraging learning systems designed to maximize an organism's positive outcomes. Yet, despite its parsimony, programs of research predicated on this assumption, such as ones in developmental psychology, animal behavior, and human-robot interaction, have had limited success. We offer an explanation by analyzing the logic of evaluative feedback and show that specialized learning mechanisms are uniquely favored in the case of evaluative feedback from a social partner. Specifically, evaluative feedback works best when it is treated as communicating information about the value of an action rather than as a form of reward to be maximized. This account suggests that human learning from evaluative feedback depends on inferences about communicative intent, goals and other mental states much like learning from other sources, such as demonstration, observation and instruction. Because these abilities are especially developed in humans, the present account also explains why evaluative feedback is far more widespread in humans than non-human animals. (C) 2017 Elsevier B.V. All rights reserved.

引用

页码：91 / 106

页数：16

共 118 条

[1] [Anonymous], 2006, AAAI
[2] [Anonymous], BEHAV BRAIN SCI
[3] [Anonymous], 1997, PARENTING CHILDRENS
[4] [Anonymous], 1990, ADAPTIVE CHARACTER T
[5] [Anonymous], 2012, P 11 INT C AUTONOMOU
[6] [Anonymous], 2015, CogSci
[7] [Anonymous], 1982, Visual perception
[8] Aronfreed J., 1968, ONDUCT CONSCIENCE SO
[9] Action understanding as inverse planning
Baker, Chris L.
Saxe, Rebecca
Tenenbaum, Joshua B.
[J]. COGNITION, 2009, 113 (03) : 329 - 349
[10] Infants parse dynamic action
Baldwin, DA
Baird, JA
Saylor, MM
Clark, MA
[J]. CHILD DEVELOPMENT, 2001, 72 (03) : 708 - 717

← 1 2 3 4 5 6 7 8 9 10 →