An Object-Oriented Visual Saliency Detection Framework Based on Sparse Coding Representations

被引：92

作者：

Han, Junwei ^{[1
]}

He, Sheng ^{[1
]}

Qian, Xiaoliang ^{[1
]}

Wang, Dongyang ^{[1
]}

Guo, Lei ^{[1
]}

Liu, Tianming ^{[2
]}

机构：

[1] Northwestern Polytech Univ, Sch Automat, Xian 710072, Peoples R China

[2] Univ Georgia, Dept Comp Sci, Athens, GA 30602 USA

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2013年 / 23卷 / 12期

基金：

美国国家科学基金会;

关键词：

Gaussian mixture models; independent component analysis; saliency; sparse coding; visual attention; REGION DETECTION; ATTENTION; COLOR; SEGMENTATION; MODEL; EXTRACTION;

D O I：

10.1109/TCSVT.2013.2242594

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Saliency detection aims at quantitatively predicting attended locations in an image. It may mimic the selection mechanism of the human vision system, which processes a small subset of a massive amount of visual input while the redundant information is ignored. Motivated by the biological evidence that the receptive fields of simple cells in V1 of the vision system are similar to sparse codes learned from natural images, this paper proposes a novel framework for saliency detection by using image sparse coding representations as features. Unlike many previous approaches dedicated to examining the local or global contrast of each individual location, this paper develops a probabilistic computational algorithm by integrating objectness likelihood with appearance rarity. In the proposed framework, image sparse coding representations are yielded through learning on a large amount of eye-fixation patches from an eye-tracking dataset. The objectness likelihood is measured by three generic cues called compactness, continuity, and center bias. The appearance rarity is inferred by using a Gaussian mixture model. The proposed paper can serve as a basis for many techniques such as image/video segmentation, retrieval, retargeting, and compression. Extensive evaluations on benchmark databases and comparisons with a number of up-to-date algorithms demonstrate its effectiveness.

引用

页码：2009 / 2021

页数：13

共 49 条

[21] Image Signature: Highlighting Sparse Salient Regions [J].

Hou, Xiaodi ;

Harel, Jonathan ;

Koch, Christof .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (01) :194-201

[22] Independent component analysis applied to feature extraction from colour and stereo images [J].

Hoyer, PO ;

Hyvärinen, A .

NETWORK-COMPUTATION IN NEURAL SYSTEMS, 2000, 11 (03) :191-210

[23] A model of saliency-based visual attention for rapid scene analysis [J].

Itti, L ;

Koch, C ;

Niebur, E .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (11) :1254-1259

[24] Keyframe-Based Video Summary Using Visual Attention Clues [J].

Jiang Peng ;

Qin Xiao-Lin .

IEEE MULTIMEDIA, 2010, 17 (02) :64-73

[25]

Judd T, 2009, IEEE I CONF COMP VIS, P2106, DOI 10.1109/ICCV.2009.5459462

[26]

Khuwuthyakorn P, 2010, LECT NOTES COMPUT SC, V6312, P636, DOI 10.1007/978-3-642-15552-9_46

[27] A VOP generation tool: Automatic segmentation of moving objects in image sequences based on spatio-temporal information [J].

Kim, M ;

Choi, JG ;

Kim, D ;

Lee, H ;

Lee, MH ;

Ahn, C ;

Ho, YS .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1999, 9 (08) :1216-1226

[28]

Klein DA, 2011, IEEE I CONF COMP VIS, P2214, DOI 10.1109/ICCV.2011.6126499

[29]

KOCH C, 1985, HUM NEUROBIOL, V4, P219

[30] Efficient Subwindow Search: A Branch and Bound Framework for Object Localization [J].

Lampert, Christoph H. ;

Blaschko, Matthew B. ;

Hofmann, Thomas .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (12) :2129-2142

← 1 2 3 4 5 →