BING: Binarized Normed Gradients for Objectness Estimation at 300fps

被引:836
作者
Cheng, Ming-Ming [1 ]
Zhang, Ziming [2 ]
Lin, Wen-Yan
Torr, Philip [1 ]
机构
[1] Univ Oxford, Oxford OX1 2JD, England
[2] Boston Univ, Boston, MA 02215 USA
来源
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2014年
基金
英国工程与自然科学研究理事会;
关键词
VISUAL-ATTENTION; SEARCH;
D O I
10.1109/CVPR.2014.414
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Training a generic objectness measure to produce a small set of candidate object windows, has been shown to speed up the classical sliding window object detection paradigm. We observe that generic objects with well-defined closed boundary can be discriminated by looking at the norm of gradients, with a suitable resizing of their corresponding image windows in to a small fixed size. Based on this observation and computational reasons, we propose to resize the window to 8 x 8 and use the norm of the gradients as a simple 64D feature to describe it, for explicitly training a generic objectness measure. We further show how the binarized version of this feature, namely binarized normed gradients (BING), can be used for efficient objectness estimation, which requires only a few atomic operations (e.g. ADD, BITWISE SHIFT, etc.). Experiments on the challenging PASCAL VOC 2007 dataset show that our method efficiently (300fps on a single laptop CPU) generates a small set of category-independent, high quality object windows, yielding 96.2% object detection rate (DR) with 1,000 proposals. Increasing the numbers of proposals and color spaces for computing BING features, our performance can be further improved to 99.5% DR.
引用
收藏
页码:3286 / 3293
页数:8
相关论文
共 60 条
  • [1] Alexe B, 2010, PROC CVPR IEEE, P73, DOI 10.1109/CVPR.2010.5540226
  • [2] [Anonymous], 2003, ACM MULTIMEDIA
  • [3] [Anonymous], 1995, ANN REV NEUROSCIENCE
  • [4] [Anonymous], 2013, IEEE TVCG
  • [5] [Anonymous], 2012, IEEE TPAMI
  • [6] [Anonymous], IEEE TPAMI
  • [7] [Anonymous], 2013, CVPR
  • [8] [Anonymous], ACM TOG
  • [9] [Anonymous], VIS COMPUT
  • [10] [Anonymous], 2012, IEEE TIP