From T-Mazes to labyrinths: Learning from model-based feedback

被引:78
作者
Denrell, J [1 ]
Fang, C
Levinthal, DA
机构
[1] Stanford Univ, Grad Sch Business, Stanford, CA 94305 USA
[2] NYU, Stern Sch Business, New York, NY 10012 USA
[3] Univ Penn, Wharton Sch, Dept Management & Econ, Philadelphia, PA 19104 USA
关键词
organizational learning; credit assignment; organizational routines; task interdependency; reinforcement learning;
D O I
10.1287/mnsc.1040.0271
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
Many organizational actions need not have any immediate or direct payoff consequence but set the stage for subsequent actions that bring the organization toward some actual payoff. Learning in such settings poses the challenge of credit assignment (Minsky 1961), that is, how to assign credit for the overall outcome of a sequence of actions to each of the antecedent actions. To explore the process of learning in such contexts, we create a formal model in which the actors develop a mental model of the value of stage-setting actions as a complex problem-solving task is repeated. Partial knowledge, either of particular states in the problem space or inefficient and circuitous routines through the space, is shown to be quite valuable. Because of the interdependence of intelligent action when a sequence of actions must be identified, however, organizational knowledge is relatively fragile. As a consequence, while turnover may stimulate search and have largely benign implications in less interdependent task settings, it is very destructive of the organization's near-term performance when the learning problem requires a complementarity among the actors' knowledge.
引用
收藏
页码:1366 / 1378
页数:13
相关论文
共 61 条