Information-Driven Search Strategies in the Board Game of CLUE®

被引:16
作者
Ferrari, Silvia [1 ]
Cai, Chenghui [2 ]
机构
[1] Duke Univ, Dept Mech Engn & Mat Sci, Durham, NC 27708 USA
[2] Duke Univ, Dept Elect & Comp Engn, Durham, NC 27708 USA
来源
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS | 2009年 / 39卷 / 03期
基金
美国国家科学基金会;
关键词
Bayesian networks (BNs); computer game playing; influence diagrams (IDs); label-correcting algorithms; mine hunting; path planning; search theory; sensor planning; value of information;
D O I
10.1109/TSMCB.2008.2007629
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents an information-driven sensor management problem, referred to as treasure hunt, which is relevant to mobile-sensor applications such as mine hunting, monitoring, and surveillance. The objective is to infer a hidden variable or treasure by selecting a sequence of measurements associated with multiple fixed targets distributed in the sensor workspace. The workspace is represented by a connectivity graph, where each node represents a possible sensor deployment, and the arcs represent possible sensor movements. An additive conditional entropy reduction function is presented to efficiently compute the expected benefit of a measurement sequence over time. Then, the optimal treasure hunt strategy is determined by a novel label-correcting algorithm operating on the connectivity graph. The methodology is illustrated through the board game of CLUE (R), which is shown to be a benchmark example of the treasure hunt problem. The game results show that a computer player implementing the strategies developed in this paper outperforms players implementing Bayesian networks, Q-learning, or constraint satisfaction, as well as human players.
引用
收藏
页码:607 / 625
页数:19
相关论文
共 40 条
[1]  
[Anonymous], 1961, Applied Statistical Decision Theory
[2]  
[Anonymous], MATL
[3]  
[Anonymous], 1962, Applied Dynamic Programming
[4]   ROBOT MOTION PLANNING - A DISTRIBUTED REPRESENTATION APPROACH [J].
BARRAQUAND, J ;
LATOMBE, JC .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 1991, 10 (06) :628-649
[5]   A SIMPLE AND FAST LABEL CORRECTING ALGORITHM FOR SHORTEST PATHS [J].
BERTSEKAS, DP .
NETWORKS, 1993, 23 (08) :703-709
[6]  
CAI C, 2008, P INT JOINT C NEUR N, P2347
[7]  
Cai CH, 2006, P AMER CONTR CONF, V1-12, P4350
[8]   Bayesian network modeling of acoustic sensor measurements [J].
Cai, Chenghui ;
Ferrari, Silvia ;
Qian, Ming .
2007 IEEE SENSORS, VOLS 1-3, 2007, :345-348
[9]   Information-Driven Sensor Path Planning by Approximate Cell Decomposition [J].
Cai, Chenghui ;
Ferrari, Silvia .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2009, 39 (03) :672-689
[10]   OPTIMAL SEARCH STRATEGIES IN DYNAMIC HYPOTHESIS-TESTING [J].
CASTANON, DA .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1995, 25 (07) :1130-1138