Exploiting Hierarchical Context on a Large Database of Object Categories

被引:151
作者
Choi, Myung Jin [1 ]
Lim, Joseph J. [1 ]
Torralba, Antonio [1 ]
Willsky, Alan S. [1 ]
机构
[1] MIT, Cambridge, MA 02139 USA
来源
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2010年
关键词
D O I
10.1109/CVPR.2010.5540221
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There has been a growing interest in exploiting contextual information in addition to local features to detect and localize multiple object categories in an image. Context models can efficiently rule out some unlikely combinations or locations of objects and guide detectors to produce a semantically coherent interpretation of a scene. However, the performance benefit from using context models has been limited because most of these methods were tested on datasets with only a few object categories, in which most images contain only one or two object categories. In this paper, we introduce a new dataset with images that contain many instances of different object categories and propose an efficient model that captures the contextual information among more than a hundred of object categories. We show that our context model can be applied to scene understanding tasks that local detectors alone cannot solve.
引用
收藏
页码:129 / 136
页数:8
相关论文
共 26 条
[1]  
[Anonymous], 2006, IEEE Conference on Computer Vision and Pattern Recognition
[2]  
[Anonymous], 2009, CVPR
[3]  
[Anonymous], 2006, Pattern recognition and machine learning
[4]  
[Anonymous], P 10 EUR C COMP VIS
[5]  
CHOW CK, 1968, IEEE TIT
[6]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[7]  
Desai C., 2009, ICCV
[8]  
Felzenszwalb P, 2008, PROC CVPR IEEE, P1984
[9]  
Fergus R., 2003, Proceedings 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pII
[10]  
Galleguillos C., 2008, CVPR