Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

被引:45256
作者
Ren, Shaoqing [1 ]
He, Kaiming [2 ]
Girshick, Ross [3 ]
Sun, Jian [2 ]
机构
[1] Univ Sci & Technol China, Hefei 230026, Anhui, Peoples R China
[2] Microsoft Res, Visual Comp Grp, Beijing 100080, Peoples R China
[3] Facebook AI Res, Seattle, WA 98109 USA
关键词
Object detection; region proposal; convolutional neural network;
D O I
10.1109/TPAMI.2016.2577031
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
State-of-the-art object detection networks depend on region proposal algorithms to hypothesize object locations. Advances like SPPnet [1] and Fast R-CNN [2] have reduced the running time of these detection networks, exposing region proposal computation as a bottleneck. In this work, we introduce a Region Proposal Network (RPN) that shares full-image convolutional features with the detection network, thus enabling nearly cost-free region proposals. An RPN is a fully convolutional network that simultaneously predicts object bounds and objectness scores at each position. The RPN is trained end-to-end to generate high-quality region proposals, which are used by Fast R-CNN for detection. We further merge RPN and Fast R-CNN into a single network by sharing their convolutional features-using the recently popular terminology of neural networks with 'attention' mechanisms, the RPN component tells the unified network where to look. For the very deep VGG-16 model [3], our detection system has a frame rate of 5 fps (including all steps) on a GPU, while achieving state-of-the-art object detection accuracy on PASCAL VOC 2007, 2012, and MS COCO datasets with only 300 proposals per image. In ILSVRC and COCO 2015 competitions, Faster R-CNN and RPN are the foundations of the 1st-place winning entries in several tracks. Code has been made publicly available.
引用
收藏
页码:1137 / 1149
页数:13
相关论文
共 40 条
  • [1] Measuring the Objectness of Image Windows
    Alexe, Bogdan
    Deselaers, Thomas
    Ferrari, Vittorio
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (11) : 2189 - 2202
  • [2] [Anonymous], 2015, ARXIV151107571
  • [3] [Anonymous], 2014, P BRIT MACH VIS C NO
  • [4] [Anonymous], ARXIV150505836
  • [5] [Anonymous], 2014, 2 INT C LEARN REPR I
  • [6] [Anonymous], 2015, P 3 INT C LEARN REPR
  • [7] [Anonymous], ARXIV150406066
  • [8] [Anonymous], PROC CVPR IEEE
  • [9] [Anonymous], 2015, PROC CVPR IEEE
  • [10] [Anonymous], IEEE T PATTERN ANAL