Q-learning agents in a Cournot oligopoly model

被引:65
作者
Waltman, Ludo [1 ]
Kaymak, Uzay [1 ]
机构
[1] Erasmus Univ, Erasmus Sch Econ, Inst Econometr, NL-3000 DR Rotterdam, Netherlands
关键词
Collusion; Cournot oligopoly; Q-learning; Reinforcement learning;
D O I
10.1016/j.jedc.2008.01.003
中图分类号
F [经济];
学科分类号
02 ;
摘要
Q-learning is a reinforcement learning model from the field of artificial intelligence. We study the use of Q-learning for modeling the learning behavior of firms in repeated Cournot oligopoly games. Based on computer simulations, we show that Q-learning firms generally learn to collude with each other, although full collusion usually does not emerge. We also present some analytical results. These results provide insight into the underlying mechanism that causes collusive behavior to emerge. Q-learning is one of the few learning models available that can explain the emergence of collusive behavior in settings in which there is no punishment mechanism and no possibility for explicit communication between firms. (C) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:3275 / 3293
页数:19
相关论文
共 34 条
[1]   Cournot versus Walras in dynamic oligopolies with memory [J].
Alós-Ferrer, C .
INTERNATIONAL JOURNAL OF INDUSTRIAL ORGANIZATION, 2004, 22 (02) :193-217
[2]   GENETIC ALGORITHM LEARNING AND THE COBWEB MODEL [J].
ARIFOVIC, J .
JOURNAL OF ECONOMIC DYNAMICS & CONTROL, 1994, 18 (01) :3-28
[3]   Reinforcement learning rules in a repeated game [J].
Bell A.M. .
Computational Economics, 2001, 18 (01) :89-110
[4]  
BERGIN J, 2005, 1042 QUEENS EC DEP
[5]  
Brenner T, 2006, HANDB ECON, V13, P895
[6]   Rational route to randomness [J].
Brock, WA ;
Hommes, CH .
ECONOMETRICA, 1997, 65 (05) :1059-1095
[7]   Experience-weighted attraction learning in normal form games [J].
Camerer, C ;
Ho, TH .
ECONOMETRICA, 1999, 67 (04) :827-874
[8]   Keeping up with the Joneses: competition and the evolution of collusion [J].
Dixon, HD .
JOURNAL OF ECONOMIC BEHAVIOR & ORGANIZATION, 2000, 43 (02) :223-238
[9]   Endogenous fluctuations under evolutionary pressure in Cournot competition [J].
Droste, E ;
Hommes, C ;
Tuinstra, J .
GAMES AND ECONOMIC BEHAVIOR, 2002, 40 (02) :232-269
[10]  
Duffy J, 2006, HANDB ECON, V13, P949