Attention-based dynamic visual search using inner-scene similarity: Algorithms and bounds

被引:9
作者
Avraham, T [1 ]
Lindenbaum, M [1 ]
机构
[1] Technion Israel Inst Technol, Dept Comp Sci, IL-32000 Haifa, Israel
关键词
computer vision; scene analysis; feature representation; similarity measures; performance evaluation of algorithms and systems; object recognition; visual search; attention;
D O I
10.1109/TPAMI.2006.28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 [模式识别与智能系统]; 0812 [计算机科学与技术]; 0835 [软件工程]; 1405 [智能科学与技术];
摘要
A visual search is required when applying a recognition process on a scene containing multiple objects. In such cases, we would like to avoid an exhaustive sequential search. This work proposes a dynamic visual search framework based mainly on inner-scene similarity. Given a number of candidates (e. g., subimages), we hypothesize is that more visually similar candidates are more likely to have the same identity. We use this assumption for determining the order of attention. Both deterministic and stochastic approaches, relying on this hypothesis, are considered. Under the deterministic approach, we suggest a measure similar to Kolmogorov's epsilon-covering that quantifies the difficulty of a search task. We show that this measure bounds the performance of all search algorithms and suggest a simple algorithm that meets this bound. Under the stochastic approach, we model the identity of the candidates as a set of correlated random variables and derive a search procedure based on linear estimation. Several experiments are presented in which the statistical characteristics, search algorithm, and bound are evaluated and verified.
引用
收藏
页码:251 / 264
页数:14
相关论文
共 42 条
[1]
AVRAHAM T, 2005, CIS200302 TECHN COMP
[2]
Pattern rejection [J].
Baker, S ;
Nayar, SK .
1996 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1996, :544-549
[3]
SEGMENTATION AND ESTIMATION OF IMAGE REGION PROPERTIES THROUGH COOPERATIVE HIERARCHIAL COMPUTATION [J].
BURT, PJ ;
HONG, TH ;
ROSENFELD, A .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1981, 11 (12) :802-809
[4]
CALLAGHAN TC, 1988, VISUAL SEARCH, P81
[5]
Blobworld: Image segmentation using expectation-maximization and its application to image querying [J].
Carson, C ;
Belongie, S ;
Greenspan, H ;
Malik, J .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (08) :1026-1038
[6]
The Bayesian image retrieval system, PicHunter:: Theory, implementation, and psychophysical experiments [J].
Cox, IJ ;
Miller, ML ;
Minka, TP ;
Papathomas, TV ;
Yianilos, PN .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2000, 9 (01) :20-37
[7]
Active object recognition integrating attention and viewpoint control [J].
Dickinson, SJ ;
Christensen, HI ;
Tsotsos, JK ;
Olofsson, G .
COMPUTER VISION AND IMAGE UNDERSTANDING, 1997, 67 (03) :239-260
[8]
VISUAL-SEARCH AND STIMULUS SIMILARITY [J].
DUNCAN, J ;
HUMPHREYS, GW .
PSYCHOLOGICAL REVIEW, 1989, 96 (03) :433-458
[9]
A Bayesian approach to unsupervised one-shot learning of object categories [J].
Fei-Fei, L ;
Fergus, R ;
Perona, P .
NINTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS I AND II, PROCEEDINGS, 2003, :1134-1141
[10]
CLUSTERING TO MINIMIZE THE MAXIMUM INTERCLUSTER DISTANCE [J].
GONZALEZ, TF .
THEORETICAL COMPUTER SCIENCE, 1985, 38 (2-3) :293-306