On adaptation, maximization, and reinforcement learning among cognitive strategies

被引：279

作者：

Erev, I ^{[1
]}

Barron, G

机构：

[1] Technion Israel Inst Technol, Max Werthiemer Minerva Ctr Cognit Studies, Fac Ind Engn & Management, Haifa, Israel

[2] Harvard Univ, Sch Business, Org & Markets Units, Cambridge, MA 02138 USA

来源：

PSYCHOLOGICAL REVIEW | 2005年 / 112卷 / 04期

关键词：

decisions from experience; stickiness effect; case-based reasoning; probability learning; learning in games;

D O I：

10.1037/0033-295X.112.4.912

中图分类号：

B84 [心理学];

学科分类号：

04 ; 0402 ;

摘要：

Analysis of binary choice behavior in iterated tasks with immediate feedback reveals robust deviations from maximization that can be described as indications of 3 effects: (a) a payoff variability effect, in which high payoff variability seems to move choice behavior toward random choice; (b) underweighting of rare events, in which alternatives that yield the best payoffs most of the time are attractive even when they are associated with a lower expected return; and (c) loss aversion, in which alternatives that minimize the probability of losses can be more attractive than those that maximize expected payoffs. The results are closer to probability matching than to maximization. Best approximation is provided with a model of reinforcement learning among cognitive strategies (RELACS). This model captures the 3 deviations, the learning curves, and the effect of information on uncertainty avoidance. It outperforms other models in fitting the data and in predicting behavior in other experiments.

引用

页码：912 / 931

页数：20

共 84 条

[1]

Allais M., 1979, EXPECTED UTILITY HYP, P27, DOI DOI 10.1007/978-94-015-7629-1_2

[2] Small feedback-based decisions and their limited correspondence to description-based decisions [J].

Barron, G ;

Erev, I .

JOURNAL OF BEHAVIORAL DECISION MAKING, 2003, 16 (03) :215-233

[3]

BARRON G, 2000, THESIS TECHNION ISRA

[4] On learning to become a successful loser: A comparison of alternative abstractions of learning processes in the loss domain [J].

Bereby-Meyer, Y ;

Erev, I .

JOURNAL OF MATHEMATICAL PSYCHOLOGY, 1998, 42 (2-3) :266-286

[5]

BEREBYMEYER Y, 1997, THESIS TECHNION ISRA

[6]

Berry D. A., 1985, BANDIT PROBLEMS SEQU

[7]

BLAVATSKYY PR, 2004, THESIS CHARLES U PRA

[8] Model comparisons and model selections based on generalization criterion methodology [J].

Busemeyer, JR ;

Wang, YM .

JOURNAL OF MATHEMATICAL PSYCHOLOGY, 2000, 44 (01) :171-189

[9] RESOURCE-ALLOCATION DECISION-MAKING IN AN UNCERTAIN ENVIRONMENT [J].

BUSEMEYER, JR ;

MYUNG, IJ .

ACTA PSYCHOLOGICA, 1987, 66 (01) :1-19

[10] AN ADAPTIVE APPROACH TO HUMAN DECISION-MAKING - LEARNING-THEORY, DECISION-THEORY, AND HUMAN-PERFORMANCE [J].

BUSEMEYER, JR ;

MYUNG, IJ .

JOURNAL OF EXPERIMENTAL PSYCHOLOGY-GENERAL, 1992, 121 (02) :177-194

← 1 2 3 4 5 6 7 8 9 →