The Bayesian image retrieval system, PicHunter:: Theory, implementation, and psychophysical experiments

被引:386
作者
Cox, IJ
Miller, ML
Minka, TP
Papathomas, TV
Yianilos, PN
机构
[1] NEC Res Inst, Princeton, NJ 08540 USA
[2] MIT, Media Lab, Cambridge, MA 02139 USA
[3] Rutgers State Univ, Vis Res Lab, Piscataway, NJ 08854 USA
[4] Rutgers State Univ, Dept Biomed Engn, Piscataway, NJ 08854 USA
关键词
Bayesian search; content-based retrieval; digital libraries; image search; relevance feedback;
D O I
10.1109/83.817596
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents the theory, design principles, implementation, and performance results of PicHunter, a prototype content-based image retrieval (CBIR) system that has been developed over the past three years. In addition, this document presents the rationale, design, and results of psychophysical experiments that were conducted to address some key issues that arose during PicHunter's development, The PicHunter project makes four primary contributions to research on content-based image retrieval. First, PicHunter represents a simple instance of a general Bayesian framework we describe for using relevance feedback to direct a search. With an explicit model of what users would do, given what target image they want, PicHunter uses Bayes's rule to predict what is the target they want, given their actions. This is done via a probability distribution over possible image targets, rather than by refining a query. Second, an entropy-minimizing display algorithm is described that attempts to maximize the information obtained from a user at each iteration of the search. Third, PicHunter makes use of bidden annotation rather than a possibly inaccurate/inconsistent annotation structure that the user must learn and make queries in. Finally, PicHunter introduces two experimental paradigms to quantitatively evaluate the performance of the system, and psychophysical experiments are presented that support the theoretical claims.
引用
收藏
页码:20 / 37
页数:18
相关论文
共 83 条
[1]  
AALBERSBERG IJ, 1992, P 15 ANN INT SIGIR D
[2]  
[Anonymous], 1972, THEORY OPTIMAL EXPT
[3]  
BARROS J, 1994, P 23 AIPR WORKSH IM
[4]  
BEATTY M, 1997, IEEE INT C IM PROC
[5]   APPLYING PROBABILITY MEASURES TO ABSTRACT LANGUAGES [J].
BOOTH, TL ;
THOMPSON, RA .
IEEE TRANSACTIONS ON COMPUTERS, 1973, C 22 (05) :442-449
[6]  
Box G.E.P., 1978, STAT EXPERIMENTERS I
[7]  
BOYKOV Y, 1997, P IEEE C COMP VIS PA
[8]   RETRIEVAL OF SIMILAR PICTURES ON PICTORIAL DATABASES [J].
CHANG, CC ;
LEE, SY .
PATTERN RECOGNITION, 1991, 24 (07) :675-680
[9]  
CHUA TS, 1994, P 27 ANN HAW INT C S
[10]  
*COR CORP, 1990, COR STOCK PHOT LIBR