Saliency Detection in the Compressed Domain for Adaptive Image Retargeting

被引：271

作者：

Fang, Yuming ^{[1
]}

Chen, Zhenzhong ^{[2
]}

Lin, Weisi ^{[1
]}

Lin, Chia-Wen ^{[3
]}

机构：

[1] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore

[2] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore

[3] Natl Tsing Hua Univ, Dept Elect Engn, Hsinchu 30013, Taiwan

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2012年 / 21卷 / 09期

关键词：

Compressed domain; image retargeting; joint photographic experts group (JPEG); saliency detection; texture homogeneity; COLOR; MODEL; TEXTURE; SEGMENTATION; ATTENTION;

D O I：

10.1109/TIP.2012.2199126

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Saliency detection plays important roles in many image processing applications, such as regions of interest extraction and image resizing. Existing saliency detection models are built in the uncompressed domain. Since most images over Internet are typically stored in the compressed domain such as joint photographic experts group (JPEG), we propose a novel saliency detection model in the compressed domain in this paper. The intensity, color, and texture features of the image are extracted from discrete cosine transform (DCT) coefficients in the JPEG bit-stream. Saliency value of each DCT block is obtained based on the Hausdorff distance calculation and feature map fusion. Based on the proposed saliency detection model, we further design an adaptive image retargeting algorithm in the compressed domain. The proposed image retargeting algorithm utilizes multioperator operation comprised of the block-based seam carving and the image scaling to resize images. A new definition of texture homogeneity is given to determine the amount of removal block-based seams. Thanks to the directly derived accurate saliency information from the compressed domain, the proposed image retargeting algorithm effectively preserves the visually important regions for images, efficiently removes the less crucial regions, and therefore significantly outperforms the relevant state-of-the-art algorithms, as demonstrated with the in-depth analysis in the extensive experiments.

引用

页码：3888 / 3901

页数：14

共 45 条

[21] Estimating just-noticeable distortion for video [J].

Jia, Yuting ;

Lin, Weisi ;

Kassim, Ashraf A. .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2006, 16 (07) :820-829

[22] Nonhomogeneous scaling optimization for realtime image resizing [J].

Jin, Yong ;

Liu, Ligang ;

Wu, Qingbiao .

VISUAL COMPUTER, 2010, 26 (6-8) :769-778

[23] A coherent computational approach to model bottom-up visual attention [J].

Le Meur, O ;

Le Callet, P ;

Barba, D ;

Thoreau, D .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (05) :802-817

[24] Do video coding impairments disturb the visual attention deployment? [J].

Le Meur, O. ;

Ninassi, A. ;

Le Callet, P. ;

Barba, D. .

SIGNAL PROCESSING-IMAGE COMMUNICATION, 2010, 25 (08) :597-609

[25] A new texture generation method based on pseudo-DCT coefficients [J].

Li, HL ;

Liu, GZ ;

Zhang, ZW .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2006, 15 (05) :1300-1312

[26]

Li Y., 2007, P IEEE INT C IM PROC, P21

[27] Modeling visual attention's modulatory aftereffects on visual sensitivity and quality evaluation [J].

Lu, ZK ;

Lin, WS ;

Yang, XK ;

Ong, EP ;

Yao, SS .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2005, 14 (11) :1928-1942

[28]

Ma Y, 2003, P ACM INT C MULT, P228

[29] Color image segmentation by analysis of subset connectedness and color homogeneity properties [J].

Macaire, L ;

Vandenbroucke, N ;

Postaire, JG .

COMPUTER VISION AND IMAGE UNDERSTANDING, 2006, 102 (01) :105-116

[30] THRESHOLD SELECTION METHOD FROM GRAY-LEVEL HISTOGRAMS [J].

OTSU, N .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1979, 9 (01) :62-66

← 1 2 3 4 5 →