Handling communication restrictions and team formation in congestion games

被引:11
作者
Agogino, AK [1 ]
Tumer, K
机构
[1] Univ Calif Santa Cruz, Santa Cruz, CA 95064 USA
[2] NASA, Ames Res Ctr, Moffett Field, CA 94035 USA
关键词
reinforcement learning; MAS; teams; communication;
D O I
10.1007/s10458-006-6105-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
There are many domains in which a multi-agent system needs to maximize a "system utility" function which rates the performance of the entire system, while subject to communication restrictions among the agents. Such communication restrictions make it difficult for agents that take actions to optimize their own "private" utilities to also help optimize the system utility. In this article we show how previously introduced utilities that promote coordination among agents can be modified to be effective in domains with communication restrictions. The modified utilities provide performance improvements of up to 75 over previously used utilities in congestion games (i.e., games where the system utility depends solely on the number of agents choosing a particular action). In addition, we show that in the presence of severe communication restrictions, team formation for the purpose of information sharing among agents leads to an additional 25 improvement in system utility. Finally, we show that agents' private utilities and team sizes can be manipulated to form the best compromise between how "aligned" an agent's utility is with the system utility and how easily an agent can learn that utility.
引用
收藏
页码:97 / 115
页数:19
相关论文
共 40 条
[1]  
Agogino A, 2004, LECT NOTES COMPUT SC, V3102, P1
[2]  
AGOGINO A, 2005, P 4 INT JOINT C AUT
[3]  
Agogino Adrian, 2005, AAMAS 05 WORKSH COOR
[4]  
[Anonymous], 2001, P 5 INT C AUTONOMOUS
[5]  
[Anonymous], IEEE INT C ROB AUT M
[6]  
ARTHUR WB, 1994, AM ECON REV, V84, P406
[7]  
Balch T., 1994, AUTONOMOUS ROBOTS, V1, P1
[8]  
BLUMROSEN L, 2002, 43 ANN IEEE S FDN CO
[9]  
BOUTILIER C, 1996, P 6 C THEOR ASP RAT
[10]  
BROOKS CH, 2003, AUTON AGENT MULTI-AG, P145