Why imitate, and if so, how? A boundedly rational approach to multi-armed bandits

被引:464
作者
Schlag, KH [1 ]
机构
[1] Univ Bonn, D-53113 Bonn, Germany
关键词
D O I
10.1006/jeth.1997.2347
中图分类号
F [经济];
学科分类号
02 ;
摘要
Individuals in a finite population repeatedly choose among actions yielding uncertain payoffs. Between choices, each individual observes the action and realized outcome of one other individual. We restrict our search to learning rules with limited memory that increase expected payoffs regardless of the distribution underlying their realizations. It is shown that the rule that outperforms all others is that which imitates the action of an observed individual (whose realized outcome is better than self) with a probability proportional to the difference in these realizations. When each individual uses this best rule, the aggregate population behavior is approximated by the replicator dynamic. (C) 1998 Academic Press.
引用
收藏
页码:130 / 156
页数:27
相关论文
共 26 条
[1]   A SIMPLE-MODEL OF HERD BEHAVIOR [J].
BANERJEE, AV .
QUARTERLY JOURNAL OF ECONOMICS, 1992, 107 (03) :797-817
[2]   MUSICAL CHAIRS - MODELING NOISY EVOLUTION [J].
BINMORE, KG ;
SAMUELSON, L ;
VAUGHAN, R .
GAMES AND ECONOMIC BEHAVIOR, 1995, 11 (01) :1-35
[3]  
BJORNERSTEDT J, 1996, P IEA C, P155
[4]  
BJORNERSTEDT J, 1996, B378 U BONN
[5]  
BORGERS T, 1993, 9319 U COLL LOND
[6]   LAWS OF LARGE NUMBERS FOR DYNAMIC-SYSTEMS WITH RANDOMLY MATCHED INDIVIDUALS [J].
BOYLAN, RT .
JOURNAL OF ECONOMIC THEORY, 1992, 57 (02) :473-504
[7]  
CABRALEES A, 1993, UNPUB STOCHASTIC REP
[8]  
EASLEY D, 1995, UNPUB CHOICE BELIEFS
[9]   WORD-OF-MOUTH COMMUNICATION AND SOCIAL-LEARNING [J].
ELLISON, G ;
FUDENBERG, D .
QUARTERLY JOURNAL OF ECONOMICS, 1995, 110 (01) :93-125
[10]   EVOLUTIONARY GAMES IN ECONOMICS [J].
FRIEDMAN, D .
ECONOMETRICA, 1991, 59 (03) :637-666