Distributed convergence to Nash equilibria with local utility measurements

被引:32
作者
Arslan, G [1 ]
Shamma, JS [1 ]
机构
[1] Univ Hawaii, Dept Elect Engn, Honolulu, HI 96822 USA
来源
2004 43RD IEEE CONFERENCE ON DECISION AND CONTROL (CDC), VOLS 1-5 | 2004年
关键词
D O I
10.1109/CDC.2004.1430262
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 [计算机科学与技术];
摘要
We consider multiplayer repeated matrix games in which several players seek to increase their individual rewards by updating their strategies based on limited information. One body of work assumes that players can measure the actions of other players, but do not have access to the utility functions of other players. In this case, well known strategy update mechanisms such as Fictitious Play (FP) and Gradient Play (GP) provide convergence to Nash equilibria in certain special classes of games. Recent work by the authors introduced "dynamic" versions of FP and GP, where players use derivative action to process and respond to the information available to them. These mechanisms, called derivative action FP and derivative Action GP, lead to behaviour converging to Nash equilibria in a significantly larger set of games than standard FP and GP provide. In this paper, we consider the case where players do not have access to opposing actions. As before, players do not have access to opposing player utility functions. Furthermore, a player's access to its own utility function is restricted to the measured utility at each round of the repeated game-structural parameters of its own utility remain unknown. Our main result is to show that derivative action FP and GP can be adapted to the utility measurement case to yield the same dynamics (in continuous-time and up to a coordinate transformation) as though players could measure other player actions. The transformation holds for both two-player games as well as in multiplayer games with a specific utility structure. The implication is that many of the stability and convergence properties obtained under derivative action FP and GP can be extended to the utility measurement case.
引用
收藏
页码:1538 / 1543
页数:6
相关论文
共 15 条
[1]
[Anonymous], 1998, Evol. Games Popul. Dyn., DOI DOI 10.1017/CBO9781139173179
[2]
Mixed equilibria and dynamical systems arising from fictitious play in perturbed games [J].
Benaïm, M ;
Hirsch, MW .
GAMES AND ECONOMIC BEHAVIOR, 1999, 29 (1-2) :36-72
[3]
ADAPTATION IN GAMES - 2 SOLUTIONS TO THE CRAWFORD PUZZLE [J].
CONLISK, J .
JOURNAL OF ECONOMIC BEHAVIOR & ORGANIZATION, 1993, 22 (01) :25-50
[4]
LEARNING-BEHAVIOR AND MIXED-STRATEGY NASH EQUILIBRIA [J].
CRAWFORD, VP .
JOURNAL OF ECONOMIC BEHAVIOR & ORGANIZATION, 1985, 6 (01) :69-78
[5]
FUDENBERG D, 1995, J ECON DYN CONTROL, V19, P1065, DOI 10.1016/0165-1889(94)00819-4
[6]
Fudenberg D., 1998, THEORY LEARNING GAME
[7]
Uncoupled dynamics do not lead to Nash equilibrium [J].
Hart, S ;
Mas-Colell, A .
AMERICAN ECONOMIC REVIEW, 2003, 93 (05) :1830-1836
[8]
Kushner H. J., 1997, STOCHASTIC APPROXIMA
[9]
Samuelson L., 1997, EVOLUTIONARY GAMES E
[10]
SHAMMA J, 2003, IN PRESS IEEE T AUTO