A reinforcement learning approach to competitive ordering and pricing problem

被引:29
作者
Dogan, Ibrahim [1 ]
Guener, Ali R. [2 ]
机构
[1] Erciyes Univ, Dept Ind Engn, TR-38039 Kayseri, Turkey
[2] Wayne State Univ, Dept Ind & Syst Engn, Detroit, MI 48202 USA
关键词
agent-based simulation; reinforcement learning; pricing; supply chain; DECISIONS; INVENTORY; MANAGEMENT; MODEL;
D O I
10.1111/exsy.12054
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study analyses simultaneous ordering and pricing decisions for retailers working in a multi-retailer competitive environment for an infinite horizon. Retailers compete for the same market where the market demand is uncertain. The customer selects the winning agent (retailer) in each term on the basis of random utility maximization, which depends primarily on retailer price and random error. The complexity of the problem is increased by competitiveness, necessity for simultaneous decisions and uncertainty in the nature of increases, and is not conducive to examination using standard analytical methods. Therefore, we model the problem using reinforcement learning (RL), which is founded on stochastic dynamic programming and agent-based simulations. We analyse the effects of competitiveness and performance of RL on three different scenarios: a monopolistic case where one retailer employing a RL agent maximizes its profit, a duopolistic case where one retailer employs RL and another utilizes adaptive pricing and ordering policies, and a duopolistic case where both retailers employ RL.
引用
收藏
页码:39 / 48
页数:10
相关论文
共 31 条
[1]  
ABBEEL P, 2007, P ADV NEUR INF PROC
[2]   Pricing and manufacturing decisions when demand is a function of prices in multiple periods [J].
Ahn, Hyun-soo ;
Guemues, Mehmet ;
Kaminsky, Philip .
OPERATIONS RESEARCH, 2007, 55 (06) :1039-1057
[3]  
[Anonymous], 1994, P 11 INT C INT C MAC
[4]  
[Anonymous], 1998, Reinforcement Learning: An Introduction
[5]  
Banerjee B, 2007, 20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P672
[6]   Learning in groups of traffic signals [J].
Bazzan, Ana L. C. ;
de Oliveira, Denise ;
da Silva, Bruno C. .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2010, 23 (04) :560-568
[7]   A reinforcement learning model for supply chain ordering management: An application to the beer game [J].
Chaharsooghi, S. Kamal ;
Heydari, Jafar ;
Zegordi, S. Hessameddin .
DECISION SUPPORT SYSTEMS, 2008, 45 (04) :949-959
[8]  
Chan L., 2004, Handbook of Quantitative Supply Chain Analysis: Modeling in the E- business Era
[9]  
Darken C., 1992, P 1992 IEEE WORKSH P
[10]  
Eliashberg J., 1993, Hand- books in operations research and management science, V5, P827