BING: Binarized Normed Gradients for Objectness Estimation at 300fps

被引：836

作者：

Cheng, Ming-Ming ^{[1
]}

Zhang, Ziming ^{[2
]}

Lin, Wen-Yan

Torr, Philip ^{[1
]}

机构：

[1] Univ Oxford, Oxford OX1 2JD, England

[2] Boston Univ, Boston, MA 02215 USA

来源：

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2014年

基金：

英国工程与自然科学研究理事会;

关键词：

VISUAL-ATTENTION; SEARCH;

D O I：

10.1109/CVPR.2014.414

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Training a generic objectness measure to produce a small set of candidate object windows, has been shown to speed up the classical sliding window object detection paradigm. We observe that generic objects with well-defined closed boundary can be discriminated by looking at the norm of gradients, with a suitable resizing of their corresponding image windows in to a small fixed size. Based on this observation and computational reasons, we propose to resize the window to 8 x 8 and use the norm of the gradients as a simple 64D feature to describe it, for explicitly training a generic objectness measure. We further show how the binarized version of this feature, namely binarized normed gradients (BING), can be used for efficient objectness estimation, which requires only a few atomic operations (e.g. ADD, BITWISE SHIFT, etc.). Experiments on the challenging PASCAL VOC 2007 dataset show that our method efficiently (300fps on a single laptop CPU) generates a small set of category-independent, high quality object windows, yielding 96.2% object detection rate (DR) with 1,000 proposals. Increasing the numbers of proposals and color spaces for computing BING features, our performance can be further improved to 99.5% DR.

引用

页码：3286 / 3293

页数：8

共 60 条

[31] Efficient Salient Region Detection with Soft Image Abstraction
Cheng, Ming-Ming
Warrell, Jonathan
Lin, Wen-Yan
Zheng, Shuai
Vineet, Vibhav
Crook, Nigel
[J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 1529 - 1536
[32] SalientShape: group saliency in image collections
Cheng, Ming-Ming
Mitra, Niloy J.
Huang, Xiaolei
Hu, Shi-Min
[J]. VISUAL COMPUTER, 2014, 30 (04) : 443 - 453
[33] RepFinder: Finding Approximately Repeated Scene Elements for Image Editing
Cheng, Ming-Ming
Zhang, Fang-Lue
Mitra, Niloy J.
Huang, Xiaolei
Hu, Shi-Min
[J]. ACM TRANSACTIONS ON GRAPHICS, 2010, 29 (04):
[34] Semantic Colorization with Internet Images
Chia, Alex Yong-Sang
Zhuo, Shaojie
Gupta, Raj Kumar
Tai, Yu-Wing
Cho, Siu-Yeung
Tan, Ping
Lin, Stephen
[J]. ACM TRANSACTIONS ON GRAPHICS, 2011, 30 (06):
[35] Histograms of oriented gradients for human detection
Dalal, N
Triggs, B
[J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 886 - 893
[36] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[37] Endres I, 2010, LECT NOTES COMPUT SC, V6315, P575, DOI 10.1007/978-3-642-15555-0_42
[38] Everingham M., The PASCAL Visual Object Classes challenge 2010 (VOC2010) Development Kit
[39] Fan RE, 2008, J MACH LEARN RES, V9, P1871
[40] Object Detection with Discriminatively Trained Part-Based Models
Felzenszwalb, Pedro F.
Girshick, Ross B.
McAllester, David
Ramanan, Deva
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (09) : 1627 - 1645

← 1 2 3 4 5 6 →