BING: Binarized Normed Gradients for Objectness Estimation at 300fps

被引:836
作者
Cheng, Ming-Ming [1 ]
Zhang, Ziming [2 ]
Lin, Wen-Yan
Torr, Philip [1 ]
机构
[1] Univ Oxford, Oxford OX1 2JD, England
[2] Boston Univ, Boston, MA 02215 USA
来源
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2014年
基金
英国工程与自然科学研究理事会;
关键词
VISUAL-ATTENTION; SEARCH;
D O I
10.1109/CVPR.2014.414
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Training a generic objectness measure to produce a small set of candidate object windows, has been shown to speed up the classical sliding window object detection paradigm. We observe that generic objects with well-defined closed boundary can be discriminated by looking at the norm of gradients, with a suitable resizing of their corresponding image windows in to a small fixed size. Based on this observation and computational reasons, we propose to resize the window to 8 x 8 and use the norm of the gradients as a simple 64D feature to describe it, for explicitly training a generic objectness measure. We further show how the binarized version of this feature, namely binarized normed gradients (BING), can be used for efficient objectness estimation, which requires only a few atomic operations (e.g. ADD, BITWISE SHIFT, etc.). Experiments on the challenging PASCAL VOC 2007 dataset show that our method efficiently (300fps on a single laptop CPU) generates a small set of category-independent, high quality object windows, yielding 96.2% object detection rate (DR) with 1,000 proposals. Increasing the numbers of proposals and color spaces for computing BING features, our performance can be further improved to 99.5% DR.
引用
收藏
页码:3286 / 3293
页数:8
相关论文
共 60 条
  • [31] Efficient Salient Region Detection with Soft Image Abstraction
    Cheng, Ming-Ming
    Warrell, Jonathan
    Lin, Wen-Yan
    Zheng, Shuai
    Vineet, Vibhav
    Crook, Nigel
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 1529 - 1536
  • [32] SalientShape: group saliency in image collections
    Cheng, Ming-Ming
    Mitra, Niloy J.
    Huang, Xiaolei
    Hu, Shi-Min
    [J]. VISUAL COMPUTER, 2014, 30 (04) : 443 - 453
  • [33] RepFinder: Finding Approximately Repeated Scene Elements for Image Editing
    Cheng, Ming-Ming
    Zhang, Fang-Lue
    Mitra, Niloy J.
    Huang, Xiaolei
    Hu, Shi-Min
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2010, 29 (04):
  • [34] Semantic Colorization with Internet Images
    Chia, Alex Yong-Sang
    Zhuo, Shaojie
    Gupta, Raj Kumar
    Tai, Yu-Wing
    Cho, Siu-Yeung
    Tan, Ping
    Lin, Stephen
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2011, 30 (06):
  • [35] Histograms of oriented gradients for human detection
    Dalal, N
    Triggs, B
    [J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 886 - 893
  • [36] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
  • [37] Endres I, 2010, LECT NOTES COMPUT SC, V6315, P575, DOI 10.1007/978-3-642-15555-0_42
  • [38] Everingham M., The PASCAL Visual Object Classes challenge 2010 (VOC2010) Development Kit
  • [39] Fan RE, 2008, J MACH LEARN RES, V9, P1871
  • [40] Object Detection with Discriminatively Trained Part-Based Models
    Felzenszwalb, Pedro F.
    Girshick, Ross B.
    McAllester, David
    Ramanan, Deva
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (09) : 1627 - 1645