Rapid biologically-inspired scene classification using features shared with visual attention

被引:390
作者
Siagian, Christian [1 ]
Itti, Laurent [1 ]
机构
[1] Univ So Calif, Dept Comp Sci, Los Angeles, CA 90089 USA
基金
美国国家科学基金会;
关键词
gist of a scene; saliency; scene recognition; computational neuroscience; image classification; image statistics; robot vision; robot localization;
D O I
10.1109/TPAMI.2007.40
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe and validate a simple context-based scene recognition algorithm for mobile robotics applications. The system can differentiate outdoor scenes from various sites on a college campus using a multiscale set of early-visual features, which capture the "gist" of the scene into a low-dimensional signature vector. Distinct from previous approaches, the algorithm presents the advantage of being biologically plausible and of having low-computational complexity, sharing its low-level features with a model for visual attention that may operate concurrently on a robot. We compare classification accuracy using scenes filmed at three outdoor sites on campus (13,965 to 34,711 frames per site). Dividing each site into nine segments, we obtain segment classification rates between 84.21 percent and 88.62 percent. Combining scenes from all sites (75,073 frames in total) yields 86.45 percent correct classification, demonstrating the generalization and scalability of the approach.
引用
收藏
页码:300 / 312
页数:13
相关论文
共 46 条
[1]  
ABE Y, 1999, P IEEE INT C ROB AUT, V20, P1299
[2]   Robot steering with spectral image information [J].
Ackerman, C ;
Itti, L .
IEEE TRANSACTIONS ON ROBOTICS, 2005, 21 (02) :247-251
[3]  
[Anonymous], 2000, THESIS CALTECH PASAD
[4]   A comparison of computational color constancy algorithms - Part I: Methodology and experiments with synthesized data [J].
Barnard, K ;
Cardei, V ;
Funt, B .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2002, 11 (09) :972-984
[5]   A comparison of computational color constancy algorithms - Part II: Experiments with image data [J].
Barnard, K ;
Martin, L ;
Coath, A ;
Funt, B .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2002, 11 (09) :985-996
[6]   DO BACKGROUND DEPTH GRADIENTS FACILITATE OBJECT IDENTIFICATION [J].
BIEDERMAN, I .
PERCEPTION, 1981, 10 (05) :573-578
[7]   DRIFT-BALANCED RANDOM STIMULI - A GENERAL BASIS FOR STUDYING NON-FOURIER MOTION PERCEPTION [J].
CHUBB, C ;
SPERLING, G .
JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 1988, 5 (11) :1986-2007
[8]   The parahippocampal place area: Recognition, navigation, or encoding? [J].
Epstein, R ;
Harris, A ;
Stanley, D ;
Kanwisher, N .
NEURON, 1999, 23 (01) :115-125
[9]  
FINLAYSON GD, 1998, P 5 EUR C COMP VIS, P475
[10]  
Fox D., 1999, P 16 NAT C ART INT J