LocNet: Improving Localization Accuracy for Object Detection

被引:102
作者
Gidaris, Spyros [1 ]
Komodakis, Nikos [1 ]
机构
[1] Univ Paris Est, Ecole Ponts ParisTech, Paris, France
来源
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2016年
关键词
D O I
10.1109/CVPR.2016.92
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel object localization methodology with the purpose of boosting the localization accuracy of state-of-the-art object detection systems. Our model, given a search region, aims at returning the bounding box of an object of interest inside this region. To accomplish its goal, it relies on assigning conditional probabilities to each row and column of this region, where these probabilities provide useful information regarding the location of the boundaries of the object inside the search region and allow the accurate inference of the object bounding box under a simple probabilistic framework. For implementing our localization model, we make use of a convolutional neural network architecture that is properly adapted for this task, called LocNet. We show experimentally that LocNet achieves a very significant improvement on the mAP for high IoU thresholds on PASCAL VOC2007 test set and that it can be very easily coupled with recent state-of-the-art object detection systems, helping them to boost their performance. Finally, we demonstrate that our detection approach can achieve high detection accuracy even when it is given as input a set of sliding windows, thus proving that it is independent of box proposal methods.
引用
收藏
页码:789 / 798
页数:10
相关论文
共 40 条
  • [1] [Anonymous], PATTERN ANAL MACHINE
  • [2] [Anonymous], 2014, COMP VIS PATT REC CV
  • [3] [Anonymous], P BRIT MACH VIS C BM
  • [4] [Anonymous], P NEURAL INFORM PROC
  • [5] [Anonymous], COMP VIS PATT REC CV
  • [6] [Anonymous], 2015, P IEEE INT C COMP VI
  • [7] [Anonymous], PATTERN ANAL MACHINE
  • [8] [Anonymous], 2013, ADV NEURAL INFORM PR
  • [9] [Anonymous], 2015, ADV NEURAL INFORM PR
  • [10] [Anonymous], 2015, P IEEE C COMP VIS PA