Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks

被引:916
作者
Bell, Sean [1 ]
Zitnick, C. Lawrence [2 ,3 ]
Bala, Kavita [1 ]
Girshick, Ross [3 ]
机构
[1] Cornell Univ, Ithaca, NY 14853 USA
[2] Microsoft Res, Redmond, WA USA
[3] Facebook AI Res, Menlo Pk, CA USA
来源
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2016年
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
10.1109/CVPR.2016.314
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is well known that contextual and multi-scale representations are important for accurate visual recognition. In this paper we present the Inside-Outside Net (ION), an object detector that exploits information both inside and outside the region of interest. Contextual information outside the region of interest is integrated using spatial recurrent neural networks. Inside, we use skip pooling to extract information at multiple scales and levels of abstraction. Through extensive experiments we evaluate the design space and provide readers with an overview of what tricks of the trade are important. ION improves state-of-the-art on PASCAL VOC 2012 object detection from 73.9% to 77.9% mAP. On the new and more challenging MS COCO dataset, we improve state-of-the-art from 19.7% to 33.1% mAP. In the 2015 MS COCO Detection Challenge, our ION model won "Best Student Entry" and finished 3rd place overall. As intuition suggests, our detection results provide strong evidence that context and multi-scale representations improve small object detection.
引用
收藏
页码:2874 / 2883
页数:10
相关论文
共 43 条
  • [1] Agrawal P, 2014, LECT NOTES COMPUT SC, V8695, P329, DOI 10.1007/978-3-319-10584-0_22
  • [2] [Anonymous], 2010, AISTATS
  • [3] [Anonymous], 2015, PROC CVPR IEEE, DOI 10.1109/CVPR.2015.7298642
  • [4] [Anonymous], 2008, Advances in neural information processing systems, DOI DOI 10.1007/978-1-4471-4072-6_12
  • [5] [Anonymous], 1994, IEE P VIS IM SIGN PR
  • [6] [Anonymous], 2015, ARXIV150205082
  • [7] [Anonymous], 2014, ECCV
  • [8] [Anonymous], 2015, ARXIV150300848
  • [9] Behnke S., 2003, THESIS
  • [10] Byeon W., 2015, CVPR