HyperNet: Towards Accurate Region Proposal Generation and Joint Object Detection

被引:681
作者
Kong, Tao [1 ,2 ]
Yao, Anbang [2 ]
Chen, Yurong [2 ]
Sun, Fuchun [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Tsinghua Natl Lab Informat Sci & Technol TNList, State Key Lab Intelligent Technol & Syst, Beijing, Peoples R China
[2] Intel Labs China, Beijing, Peoples R China
来源
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2016年
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR.2016.98
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Almost all of the current top-performing object detection networks employ region proposals to guide the search for object instances. State-of-the-art region proposal methods usually need several thousand proposals to get high recall, thus hurting the detection efficiency. Although the latest Region Proposal Network method gets promising detection accuracy with several hundred proposals, it still struggles in small-size object detection and precise localization (e.g., large IoU thresholds), mainly due to the coarseness of its feature maps. In this paper, we present a deep hierarchical network, namely HyperNet, for handling region proposal generation and object detection jointly. Our HyperNet is primarily based on an elaborately designed Hyper Feature which aggregates hierarchical feature maps first and then compresses them into a uniform space. The Hyper Features well incorporate deep but highly semantic, intermediate but really complementary, and shallow but naturally high-resolution features of the image, thus enabling us to construct HyperNet by sharing them both in generating proposals and detecting objects via an end-to-end joint training strategy. For the deep VGG16 model, our method achieves completely leading recall and state-of-the-art object detection accuracy on PASCAL VOC 2007 and 2012 using only 100 proposals per image. It runs with a speed of 5 fps (including all steps) on a GPU, thus having the potential for real-time processing.
引用
收藏
页码:845 / 853
页数:9
相关论文
共 36 条
[1]  
Alexe B., 2010, CVPR
[2]  
[Anonymous], 2015, CVPR
[3]  
[Anonymous], 2005, CVPR
[4]  
[Anonymous], 2 INT C LEARN REPR
[5]  
[Anonymous], 2014, ECCV
[6]  
[Anonymous], 2015, ICCV
[7]  
[Anonymous], PAMI
[8]  
[Anonymous], CVPR 2013
[9]  
[Anonymous], ARXIV150406066
[10]  
[Anonymous], 2014, CVPR