Visual Saliency Detection Based on Multiscale Deep CNN Features

被引：314

作者：

Li, Guanbin ^{[1
,2
]}

Yu, Yizhou ^{[2
]}

机构：

[1] Sun Yat Sen Univ, Guangzhou 510006, Guangdong, Peoples R China

[2] Univ Hong Kong, Dept Comp Sci, Hong Kong, Hong Kong, Peoples R China

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2016年 / 25卷 / 11期

关键词：

Convolutional neural networks; saliency detection; deep contrast feature; OBJECT DETECTION; MODEL;

D O I：

10.1109/TIP.2016.2602079

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Visual saliency is a fundamental problem in both cognitive and computational sciences, including computer vision. In this paper, we discover that a high-quality visual saliency model can be learned from multiscale features extracted using deep convolutional neural networks (CNNs), which have had many successes in visual recognition tasks. For learning such saliency models, we introduce a neural network architecture, which has fully connected layers on top of CNNs responsible for feature extraction at three different scales. The penultimate layer of our neural network has been confirmed to be a discriminative high-level feature vector for saliency detection, which we call deep contrast feature. To generate a more robust feature, we integrate handcrafted low-level features with our deep contrast feature. To promote further research and evaluation of visual saliency models, we also construct a new large database of 4447 challenging images and their pixelwise saliency annotations. Experimental results demonstrate that our proposed method is capable of achieving the state-of-the-art performance on all public benchmarks, improving the F-measure by 6.12% and 10%, respectively, on the DUT-OMRON data set and our new data set (HKU-IS), and lowering the mean absolute error by 9% and 35.3%, respectively, on these two data sets.

引用

页码：5012 / 5024

页数：13

共 56 条

[1]

Achanta R, 2009, PROC CVPR IEEE, P1597, DOI 10.1109/CVPRW.2009.5206596

[2]

[Anonymous], DEEP CONTRAST LEARNI

[3]

[Anonymous], PROC CVPR IEEE

[4]

[Anonymous], 2007, COMPUTER VISION PATT, DOI DOI 10.1109/CVPR.2007.383017

[5]

[Anonymous], 2007, PROC IEEE C COMPUT V, DOI 10.1109/CVPR.2007.383267

[6]

[Anonymous], 2014, P 31 INT C INT C MAC

[7] Contour Detection and Hierarchical Image Segmentation [J].

Arbelaez, Pablo ;

Maire, Michael ;

Fowlkes, Charless ;

Malik, Jitendra .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (05) :898-916

[8] Seam carving for content-aware image resizing [J].

Avidan, Shai ;

Shamir, Ariel .

ACM TRANSACTIONS ON GRAPHICS, 2007, 26 (03)

[9] iCoseg: Interactive Co-segmentation with Intelligent Scribble Guidance [J].

Batra, Dhruv ;

Kowdle, Adarsh ;

Parikh, Devi ;

Luo, Jiebo ;

Chen, Tsuhan .

2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, :3169-3176

[10] State-of-the-Art in Visual Attention Modeling [J].

Borji, Ali ;

Itti, Laurent .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (01) :185-207

← 1 2 3 4 5 6 →