Object detection via a multi-region & semantic segmentation-aware CNN model

被引:527
作者
Gidaris, Spyros [1 ]
Komodakis, Nikos [1 ]
机构
[1] Univ Paris Est, Ecole Ponts ParisTech, Paris, France
来源
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2015年
关键词
D O I
10.1109/ICCV.2015.135
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose an object detection system that relies on a multi-region deep convolutional neural network (CNN) that also encodes semantic segmentation-aware features. The resulting CNN-based representation aims at capturing a diverse set of discriminative appearance factors and exhibits localization sensitivity that is essential for accurate object localization. We exploit the above properties of our recognition module by integrating it on an iterative localization mechanism that alternates between scoring a box proposal and refining its location with a deep CNN regression model. Thanks to the efficient use of our modules, we detect objects with very high localization accuracy. On the detection challenges of PASCAL VOC2007 and PASCAL VOC2012 we achieve mAP of 78.2% and 73.9% correspondingly, surpassing any other published work by a significant margin.
引用
收藏
页码:1134 / 1142
页数:9
相关论文
共 31 条
[1]  
[Anonymous], COMP VIS PATT REC CV
[2]  
[Anonymous], PATTERN ANAL MACHINE
[3]  
[Anonymous], 2014, COMP VIS PATT REC CV
[4]  
[Anonymous], 2014, ARXIV14117714
[5]  
[Anonymous], COMP VIS 2009 IEEE 1
[6]  
[Anonymous], 2006, SCIENCE
[7]  
[Anonymous], 2005, 2005 IEEE COMP SOC C, DOI DOI 10.1109/CVPR.2005.177
[8]  
[Anonymous], 2014, CORR
[9]  
[Anonymous], 2014, COMPUTER VISION ECCV
[10]  
[Anonymous], COMPUTER VISION ECCV