An Object-Oriented Visual Saliency Detection Framework Based on Sparse Coding Representations

被引:92
作者
Han, Junwei [1 ]
He, Sheng [1 ]
Qian, Xiaoliang [1 ]
Wang, Dongyang [1 ]
Guo, Lei [1 ]
Liu, Tianming [2 ]
机构
[1] Northwestern Polytech Univ, Sch Automat, Xian 710072, Peoples R China
[2] Univ Georgia, Dept Comp Sci, Athens, GA 30602 USA
基金
美国国家科学基金会;
关键词
Gaussian mixture models; independent component analysis; saliency; sparse coding; visual attention; REGION DETECTION; ATTENTION; COLOR; SEGMENTATION; MODEL; EXTRACTION;
D O I
10.1109/TCSVT.2013.2242594
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Saliency detection aims at quantitatively predicting attended locations in an image. It may mimic the selection mechanism of the human vision system, which processes a small subset of a massive amount of visual input while the redundant information is ignored. Motivated by the biological evidence that the receptive fields of simple cells in V1 of the vision system are similar to sparse codes learned from natural images, this paper proposes a novel framework for saliency detection by using image sparse coding representations as features. Unlike many previous approaches dedicated to examining the local or global contrast of each individual location, this paper develops a probabilistic computational algorithm by integrating objectness likelihood with appearance rarity. In the proposed framework, image sparse coding representations are yielded through learning on a large amount of eye-fixation patches from an eye-tracking dataset. The objectness likelihood is measured by three generic cues called compactness, continuity, and center bias. The appearance rarity is inferred by using a Gaussian mixture model. The proposed paper can serve as a basis for many techniques such as image/video segmentation, retrieval, retargeting, and compression. Extensive evaluations on benchmark databases and comparisons with a number of up-to-date algorithms demonstrate its effectiveness.
引用
收藏
页码:2009 / 2021
页数:13
相关论文
共 49 条
[1]  
Achanta R., 2010, P IEEE INT C IM PROC, P33
[2]  
Achanta R, 2008, LECT NOTES COMPUT SC, V5008, P66
[3]  
Achanta R, 2009, PROC CVPR IEEE, P1597, DOI 10.1109/CVPRW.2009.5206596
[4]  
[Anonymous], ACM MULT C MULT C
[5]  
[Anonymous], 2003, P 11 ACM INT C MULTI, DOI DOI 10.1145/957013.957094
[6]  
[Anonymous], 2006, Advances in Neural Information Processing Systems
[7]   Dynamic bit allocation for multiple video object coding [J].
Chen, Zhenzhong ;
Han, Junwei ;
Ngan, King Ngi .
IEEE TRANSACTIONS ON MULTIMEDIA, 2006, 8 (06) :1117-1124
[8]   Unsupervised segmentation of color-texture regions in images and video [J].
Deng, YN ;
Manjunath, BS .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (08) :800-810
[9]   Low bit-rate coding of image sequences using adaptive regions of interest [J].
Doulamis, N ;
Doulamis, A ;
Kalogeras, D ;
Kollias, S .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1998, 8 (08) :928-934
[10]   Attention-driven salient edge(s) and region(s) extraction with application to CBIR [J].
Feng, Songhe ;
Xu, De ;
Yang, Xu .
SIGNAL PROCESSING, 2010, 90 (01) :1-15