From T-Mazes to labyrinths: Learning from model-based feedback

被引：78

作者：

Denrell, J ^{[1
]}

Fang, C

Levinthal, DA

机构：

[1] Stanford Univ, Grad Sch Business, Stanford, CA 94305 USA

[2] NYU, Stern Sch Business, New York, NY 10012 USA

[3] Univ Penn, Wharton Sch, Dept Management & Econ, Philadelphia, PA 19104 USA

来源：

MANAGEMENT SCIENCE | 2004年 / 50卷 / 10期

关键词：

organizational learning; credit assignment; organizational routines; task interdependency; reinforcement learning;

D O I：

10.1287/mnsc.1040.0271

中图分类号：

C93 [管理学];

学科分类号：

12 ; 1201 ; 1202 ; 120202 ;

摘要：

Many organizational actions need not have any immediate or direct payoff consequence but set the stage for subsequent actions that bring the organization toward some actual payoff. Learning in such settings poses the challenge of credit assignment (Minsky 1961), that is, how to assign credit for the overall outcome of a sequence of actions to each of the antecedent actions. To explore the process of learning in such contexts, we create a formal model in which the actors develop a mental model of the value of stage-setting actions as a complex problem-solving task is repeated. Partial knowledge, either of particular states in the problem space or inefficient and circuitous routines through the space, is shown to be quite valuable. Because of the interdependence of intelligent action when a sequence of actions must be identified, however, organizational knowledge is relatively fragile. As a consequence, while turnover may stimulate search and have largely benign implications in less interdependent task settings, it is very destructive of the organization's near-term performance when the learning problem requires a complementarity among the actors' knowledge.

引用

页码：1366 / 1378

页数：13

共 61 条

[1] GROUP LEARNING-CURVES - THE EFFECTS OF TURNOVER AND TASK COMPLEXITY ON GROUP-PERFORMANCE
ARGOTE, L
INSKO, CA
YOVETICH, N
ROMERO, AA
[J]. JOURNAL OF APPLIED SOCIAL PSYCHOLOGY, 1995, 25 (06) : 512 - 529
[2] Argote Linda., 2012, ORG LEARNING CREATIN, V2nd
[3] Axelrod R., 1999, HARNESSING COMPLEXIT
[4] DYNAMIC PROGRAMMING
BELLMAN, R
[J]. SCIENCE, 1966, 153 (3731) : 34 - &
[5] Bertsekas D. P., 1996, Neuro-dynamic programming
[6] BLOCK Z, 1985, HARVARD BUS REV, V63, P184
[7] BREHMER B, 1995, COMPLEX PROBLEM SOLV, P103
[8] Organizational evolution, learning, and selection: A genetic-algorithm-based model
Bruderer, E
Singh, JV
[J]. ACADEMY OF MANAGEMENT JOURNAL, 1996, 39 (05) : 1322 - 1349
[9] Progress in behavioral game theory
Camerer, CF
[J]. JOURNAL OF ECONOMIC PERSPECTIVES, 1997, 11 (04) : 167 - 188
[10] ORGANIZATIONAL LEARNING AND PERSONNEL TURNOVER
CARLEY, K
[J]. ORGANIZATION SCIENCE, 1992, 3 (01) : 20 - 46

← 1 2 3 4 5 6 7 →