Bayesian update of recursive agent models

被引:20
作者
Gmytrasiewicz, PJ [1 ]
Noh, SU [1 ]
Kellogg, T [1 ]
机构
[1] Univ Texas, Dept Comp Sci & Engn, Arlington, TX 76019 USA
关键词
Bayesian learning; probabilistic updating; agent models; coordination; air defense; decision theory; multi-agent; artificial intelligence;
D O I
10.1023/A:1008269427670
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We present a framework for Bayesian updating of beliefs about models of agent(s) based on their observed behavior. We work within the formalism of the Recursive Modeling Method (RMM) that maintains and processes models an agent may use to interact with other agent(s), the models the agent may think the other agent has of the original agent, the models the other agent may think the agent has, and so on. The beliefs about which model is the correct one are incrementally updated based on the observed behavior of the modeled agent and, as the result, the probability of the model that best predicted the observed behavior is increased. Analogously, the models on deeper levels of modeling can be updated; the models that the agent thinks another agent uses to model the original agent are revised based on how the other agent is expected to observe the original agent's behavior, and so on. We have implemented and tested our method in two domains, and the results show a marked improvement in the quality of interactions with the belief update in both domains.
引用
收藏
页码:49 / 69
页数:21
相关论文
共 22 条
[1]  
ALBRECHT D, 1997, P 6 INT C US MOD CHI, P363
[2]  
Allen JF, 1990, INTENTIONS COMMUNICA
[3]  
BINMORE K, 1982, ESSAYS FDN GAME THEO
[4]  
Clark HH., 1981, ELEMENTS DISCOURSE U, P10
[5]   A BAYESIAN METHOD FOR THE INDUCTION OF PROBABILISTIC NETWORKS FROM DATA [J].
COOPER, GF ;
HERSKOVITS, E .
MACHINE LEARNING, 1992, 9 (04) :309-347
[6]  
DENNETT D, 1986, BRAINSTORMS
[7]  
FRIEDMAN N, 1994, P 5 C THEOR ASP REAS, P44
[8]  
Gmytrasiewicz P.J., 1995, P 1 INT C MULT SYST, P125
[9]  
GMYTRASIEWICZ PJ, 1996, P 5 INT C US MOD, P121
[10]   KNOWLEDGE AND COMMON KNOWLEDGE IN A DISTRIBUTED ENVIRONMENT [J].
HALPERN, JY ;
MOSES, Y .
JOURNAL OF THE ACM, 1990, 37 (03) :549-587