Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

被引:45256
作者
Ren, Shaoqing [1 ]
He, Kaiming [2 ]
Girshick, Ross [3 ]
Sun, Jian [2 ]
机构
[1] Univ Sci & Technol China, Hefei 230026, Anhui, Peoples R China
[2] Microsoft Res, Visual Comp Grp, Beijing 100080, Peoples R China
[3] Facebook AI Res, Seattle, WA 98109 USA
关键词
Object detection; region proposal; convolutional neural network;
D O I
10.1109/TPAMI.2016.2577031
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
State-of-the-art object detection networks depend on region proposal algorithms to hypothesize object locations. Advances like SPPnet [1] and Fast R-CNN [2] have reduced the running time of these detection networks, exposing region proposal computation as a bottleneck. In this work, we introduce a Region Proposal Network (RPN) that shares full-image convolutional features with the detection network, thus enabling nearly cost-free region proposals. An RPN is a fully convolutional network that simultaneously predicts object bounds and objectness scores at each position. The RPN is trained end-to-end to generate high-quality region proposals, which are used by Fast R-CNN for detection. We further merge RPN and Fast R-CNN into a single network by sharing their convolutional features-using the recently popular terminology of neural networks with 'attention' mechanisms, the RPN component tells the unified network where to look. For the very deep VGG-16 model [3], our detection system has a frame rate of 5 fps (including all steps) on a GPU, while achieving state-of-the-art object detection accuracy on PASCAL VOC 2007, 2012, and MS COCO datasets with only 300 proposals per image. In ILSVRC and COCO 2015 competitions, Faster R-CNN and RPN are the foundations of the 1st-place winning entries in several tracks. Code has been made publicly available.
引用
收藏
页码:1137 / 1149
页数:13
相关论文
共 40 条
  • [11] [Anonymous], 2015, ARXIV151102300
  • [12] [Anonymous], ARXIV151104003
  • [13] [Anonymous], 2013, Advances in Neural Information Processing Systems
  • [14] [Anonymous], P BRIT MACH VIS C BM
  • [15] [Anonymous], ARXIV151107131
  • [16] [Anonymous], 2015, ARXIV14121441V1
  • [17] [Anonymous], ARXIV151204412
  • [18] Multiscale Combinatorial Grouping
    Arbelaez, Pablo
    Pont-Tuset, Jordi
    Barron, Jonathan T.
    Marques, Ferran
    Malik, Jitendra
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 328 - 335
  • [19] CPMC: Automatic Object Segmentation Using Constrained Parametric Min-Cuts
    Carreira, Joao
    Sminchisescu, Cristian
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (07) : 1312 - 1328
  • [20] Chorowski J, 2015, ADV NEUR IN, V28