Simultaneous Detection and Segmentation

被引:754
作者
Hariharan, Bharath [1 ]
Arbelaez, Pablo [1 ]
Girshick, Ross [1 ]
Malik, Jitendra [1 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
来源
COMPUTER VISION - ECCV 2014, PT VII | 2014年 / 8695卷
关键词
detection; segmentation; convolutional networks;
D O I
10.1007/978-3-319-10584-0_20
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We aim to detect all instances of a category in an image and, for each instance, mark the pixels that belong to it. We call this task Simultaneous Detection and Segmentation (SDS). Unlike classical bounding box detection, SDS requires a segmentation and not just a box. Unlike classical semantic segmentation, we require individual object instances. We build on recent work that uses convolutional neural networks to classify category-independent region proposals (R-CNN [16]), introducing a novel architecture tailored for SDS. We then use category-specific, top-down figure-ground predictions to refine our bottom-up proposals. We show a 7 point boost (16% relative) over our baselines on SDS, a 5 point boost (10% relative) over state-of-the-art on semantic segmentation, and state-of-the-art performance in object detection. Finally, we provide diagnostic tools that unpack performance and provide directions for future work.
引用
收藏
页码:297 / 312
页数:16
相关论文
共 32 条
[1]  
[Anonymous], 2012, CVPR
[2]  
[Anonymous], IJCV
[3]  
[Anonymous], 2 INT C LEARN REPR
[4]  
[Anonymous], 2010, ECCV
[5]  
[Anonymous], 1989, NEURAL COMPUTATION
[6]  
[Anonymous], IJCV
[7]  
[Anonymous], 2013, 31 INT C MACH LEARN
[8]  
[Anonymous], TPAMI
[9]  
[Anonymous], 2014, CVPR
[10]  
[Anonymous], CVPR