An End-to-End Compression Framework Based on Convolutional Neural Networks

被引:146
作者
Jiang, Feng [1 ]
Tao, Wen [1 ]
Liu, Shaohui [1 ]
Ren, Jie [1 ]
Guo, Xun [2 ]
Zhao, Debin [1 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Heilongjiang, Peoples R China
[2] Microsoft Res Asia, Beijing 100080, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep learning; compression framework; compact representation; convolutional neural networks (CNNs); ARTIFACT REDUCTION; DEBLOCKING; FILTER; DCT;
D O I
10.1109/TCSVT.2017.2734838
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Deep learning, e.g., convolutional neural networks (CNNs), has achieved great success in image processing and computer vision especially in high-level vision applications, such as recognition and understanding. However, it is rarely used to solve low-level vision problems such as image compression studied in this paper. Here, we move forward a step and propose a novel compression framework based on CNNs. To achieve high-quality image compression at low bit rates, two CNNs are seamlessly integrated into an end-to-end compression framework. The first CNN, named compact convolutional neural network (ComCNN), learns an optimal compact representation from an input image, which preserves the structural information and is then encoded using an image codes (e.g., JPEG, JPEG2000, or BPG). The second CNN, named reconstruction convolutional neural network (RecCNN), is used to reconstruct the decoded image with high quality in the decoding end. To make two CNNs effectively collaborate, we develop a unified end-to-end learning algorithm to simultaneously learn ComCNN and RecCNN, which facilitates the accurate reconstruction of the decoded image using RecCNN. Such a design also makes the proposed compression framework compatible with existing image coding standards. Experimental results validate that the proposed compression framework greatly outperforms several compression frameworks that use existing image coding standards with the state-of-the-art deblocking or denoising post-processing methods.
引用
收藏
页码:3007 / 3018
页数:12
相关论文
共 42 条
  • [11] [Anonymous], TRAINABLE NONLINEAR
  • [12] Balle J., 2016, End-to-end optimized image compression
  • [13] Reducing Artifacts in JPEG Decompression Via a Learned Dictionary
    Chang, Huibin
    Ng, Michael K.
    Zeng, Tieyong
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2014, 62 (03) : 718 - 728
  • [14] Image denoising by sparse 3-D transform-domain collaborative filtering
    Dabov, Kostadin
    Foi, Alessandro
    Katkovnik, Vladimir
    Egiazarian, Karen
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2007, 16 (08) : 2080 - 2095
  • [15] Accelerating the Super-Resolution Convolutional Neural Network
    Dong, Chao
    Loy, Chen Change
    Tang, Xiaoou
    [J]. COMPUTER VISION - ECCV 2016, PT II, 2016, 9906 : 391 - 407
  • [16] Compression Artifacts Reduction by a Deep Convolutional Network
    Dong, Chao
    Deng, Yubin
    Loy, Chen Change
    Tang, Xiaoou
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 576 - 584
  • [17] Image Super-Resolution Using Deep Convolutional Networks
    Dong, Chao
    Loy, Chen Change
    He, Kaiming
    Tang, Xiaoou
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (02) : 295 - 307
  • [18] Duchi J, 2011, J MACH LEARN RES, V12, P2121
  • [19] Pointwise shape-adaptive DCT for high-quality denoising and deblocking of grayscale and color images
    Foi, Alessandro
    Katkovnik, Vladimir
    Egiazarian, Karen
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2007, 16 (05) : 1395 - 1411
  • [20] A generic post-deblocking filter for block based image compression algorithms
    Francisco, Nelson C.
    Rodrigues, Nuno M. M.
    da Silva, Eduardo A. B.
    de Faria, Sergio M. M.
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2012, 27 (09) : 985 - 997