BING: Binarized Normed Gradients for Objectness Estimation at 300fps

被引:836
作者
Cheng, Ming-Ming [1 ]
Zhang, Ziming [2 ]
Lin, Wen-Yan
Torr, Philip [1 ]
机构
[1] Univ Oxford, Oxford OX1 2JD, England
[2] Boston Univ, Boston, MA 02215 USA
来源
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2014年
基金
英国工程与自然科学研究理事会;
关键词
VISUAL-ATTENTION; SEARCH;
D O I
10.1109/CVPR.2014.414
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Training a generic objectness measure to produce a small set of candidate object windows, has been shown to speed up the classical sliding window object detection paradigm. We observe that generic objects with well-defined closed boundary can be discriminated by looking at the norm of gradients, with a suitable resizing of their corresponding image windows in to a small fixed size. Based on this observation and computational reasons, we propose to resize the window to 8 x 8 and use the norm of the gradients as a simple 64D feature to describe it, for explicitly training a generic objectness measure. We further show how the binarized version of this feature, namely binarized normed gradients (BING), can be used for efficient objectness estimation, which requires only a few atomic operations (e.g. ADD, BITWISE SHIFT, etc.). Experiments on the challenging PASCAL VOC 2007 dataset show that our method efficiently (300fps on a single laptop CPU) generates a small set of category-independent, high quality object windows, yielding 96.2% object detection rate (DR) with 1,000 proposals. Increasing the numbers of proposals and color spaces for computing BING features, our performance can be further improved to 99.5% DR.
引用
收藏
页码:3286 / 3293
页数:8
相关论文
共 60 条
  • [11] [Anonymous], 2009, ICCV
  • [12] [Anonymous], 2013, IJCV
  • [13] [Anonymous], IEEE TPAMI IN PRESS
  • [14] [Anonymous], CVPR
  • [15] [Anonymous], IEEE FG
  • [16] [Anonymous], 2007, PROC IEEE C COMPUT V, DOI 10.1109/CVPR.2007.383267
  • [17] [Anonymous], 2012, ECCV
  • [18] [Anonymous], TECHNICAL REPORT
  • [19] [Anonymous], TECHNICAL REPORT
  • [20] [Anonymous], 2014, CVPR