A framework for visual-context-aware object detection in still images

被引：21

作者：

Perko, Roland ^{[1
]}

Leonardis, Ales ^{[1
]}

机构：

[1] Univ Ljubljana, Fac Comp & Informat Sci, Ljubljana 1001, Slovenia

来源：

COMPUTER VISION AND IMAGE UNDERSTANDING | 2010年 / 114卷 / 06期

关键词：

Visual context; Object detection; Context integration; ATTENTION; RECOGNITION; SCENES;

D O I：

10.1016/j.cviu.2010.03.005

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Visual context provides cues about an object's presence, position and size within the observed scene, which should be used to increase the performance of object detection techniques. However, in computer vision, object detectors typically ignore this information. We therefore present a framework for visual-context-aware object detection. Methods for extracting visual contextual information from still images are proposed, which are then used to calculate a prior for object detection. The concept is based on a sparse coding of contextual features, which are based on geometry and texture. In addition, bottom-up saliency and object co-occurrences are exploited, to define auxiliary visual context. To integrate the individual contextual cues with a local appearance-based object detector, a fully probabilistic framework is established. In contrast to other methods, our integration is based on modeling the underlying conditional probabilities between the different cues, which is done via kernel density estimation. This integration is a crucial part of the framework which is demonstrated within the detailed evaluation. Our method is evaluated using a novel demanding image data set and compared to a state-of-the-art method for context-aware object detection. An in-depth analysis is given discussing the contributions of the individual contextual cues and the limitations of visual context for object detection. (C) 2010 Elsevier Inc. All rights reserved.

引用

页码：700 / 711

页数：12

共 53 条

[1] The parahippocampal cortex mediates spatial and nonspatial associations [J].

Aminoff, E. ;

Gronau, N. ;

Bar, M. .

CEREBRAL CORTEX, 2007, 17 (07) :1493-1503

[2]

[Anonymous], 1994, Kernel smoothing

[3]

[Anonymous], P IEEE INT C COMP VI

[4] Visual objects in context [J].

Bar, M .

NATURE REVIEWS NEUROSCIENCE, 2004, 5 (08) :617-629

[5] PERCEIVING REAL-WORLD SCENES [J].

BIEDERMA.I .

SCIENCE, 1972, 177 (4043) :77-&

[6]

Biederman I., 1981, PERCEPTUAL ORG, P213, DOI [10.4324/9781315512372-8, DOI 10.4324/9781315512372-8]

[7]

Bileschi S.M., 2006, StreetScenes : towards scene understanding in still images

[8] Blobworld: Image segmentation using expectation-maximization and its application to image querying [J].

Carson, C ;

Belongie, S ;

Greenspan, H ;

Malik, J .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (08) :1026-1038

[9] Histograms of oriented gradients for human detection [J].

Dalal, N ;

Triggs, B .

2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893

[10]

DIVVALA K, 2009, P C COMP VIS PATT RE

← 1 2 3 4 5 6 →