Salient object detection via multi-scale attention CNN

被引:76
作者
Ji, Yuzhu [1 ]
Zhang, Haijun [1 ]
Wu, Q. M. Jonathan [2 ]
机构
[1] Harbin Inst Technol, Shenzhen Grad Sch, Dept Comp Sci, Shenzhen, Peoples R China
[2] Univ Windsor, Dept Elect & Comp Engn, Windsor, ON, Canada
关键词
Saliency detection; Encoder-decoder model; Multi-scale attention CNN;
D O I
10.1016/j.neucom.2018.09.061
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
Fully convolutional network (FCN) based semantic segmentation models have largely inspired most recent works in the field of salient object detection. However, the lack of context information summarization can degrade the prediction accuracy of the final saliency map. Moreover, the information loss of downsampling operations of FCN-based models results in the loss of details of the final saliency map, such as edges of the saliency object. In this paper, we proposed a novel deep convolutional neural network (CNN) by introducing a spatial and channel-wise attention layer into a multi-scale encoder-decoder framework. The attention CNN layer can align the context information between the feature maps at different scales and the final prediction of the saliency map. In addition, a structure with multiple scale side-way outputs was designed to produce more accurate edge-preserving saliency maps by integrating saliency maps at different scales. Experimental results demonstrated the effectiveness of the proposed model on several benchmark datasets. Additional experimental results also validated the potential and feasibility of applying our trained saliency model to other object-driven vision tasks as an efficient preprocessing step. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:130 / 140
页数:11
相关论文
共 37 条
[1]
Achanta R, 2009, PROC CVPR IEEE, P1597, DOI 10.1109/CVPRW.2009.5206596
[2]
[Anonymous], P IEEE CVPR
[3]
[Anonymous], 2016, CoRR abs/1606.00915
[4]
[Anonymous], P IEEE CVPR
[5]
Borji Ali, 2019, [Computational Visual Media, 计算可视媒体], V5, P117
[6]
Salient Object Detection: A Benchmark [J].
Borji, Ali ;
Sihite, Dicky N. ;
Itti, Laurent .
COMPUTER VISION - ECCV 2012, PT II, 2012, 7573 :414-429
[7]
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[8]
SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning [J].
Chen, Long ;
Zhang, Hanwang ;
Xiao, Jun ;
Nie, Liqiang ;
Shao, Jian ;
Liu, Wei ;
Chua, Tat-Seng .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6298-6306
[9]
Efficient Salient Region Detection with Soft Image Abstraction [J].
Cheng, Ming-Ming ;
Warrell, Jonathan ;
Lin, Wen-Yan ;
Zheng, Shuai ;
Vineet, Vibhav ;
Crook, Nigel .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :1529-1536
[10]
Global Contrast based Salient Region Detection [J].
Cheng, Ming-Ming ;
Zhang, Guo-Xin ;
Mitra, Niloy J. ;
Huang, Xiaolei ;
Hu, Shi-Min .
2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011, :409-416