Deep Neural Networks for No-Reference and Full-Reference Image Quality Assessment

被引:842
作者
Bosse, Sebastian [1 ]
Maniry, Dominique [1 ]
Mueller, Klaus-Robert [2 ,3 ,4 ]
Wiegand, Thomas [5 ,6 ]
Samek, Wojciech [1 ]
机构
[1] Fraunhofer Heinrich Hertz Inst, Dept Video Coding & Analyt, D-10587 Berlin, Germany
[2] Berlin Inst Technol, Machine Learning Lab, D-10587 Berlin, Germany
[3] Korea Univ, Dept Brain & Cognit Engn, Seoul 136713, South Korea
[4] Max Planck Inst Informat, D-66123 Saarbrucken, Germany
[5] Fraunhofer Heinrich Hertz Inst, D-10587 Berlin, Germany
[6] Berlin Inst Technol, Media Technol Lab, D-10587 Berlin, Germany
基金
新加坡国家研究基金会;
关键词
Full-reference image quality assessment; no-reference image quality assessment; neural networks; quality pooling; deep learning; feature extraction; regression; PERCEPTUAL IMAGE; SIMILARITY; INDEX;
D O I
10.1109/TIP.2017.2760518
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a deep neural network-based approach to image quality assessment (IQA). The network is trained end-to-end and comprises ten convolutional layers and five pooling layers for feature extraction, and two fully connected layers for regression, which makes it significantly deeper than related IQA models. Unique features of the proposed architecture are that: 1) with slight adaptations it can be used in a no-reference (NR) as well as in a full-reference (FR) IQA setting and 2) it allows for joint learning of local quality and local weights, i.e., relative importance of local quality to the global quality estimate, in an unified framework. Our approach is purely data-driven and does not rely on hand-crafted features or other types of prior domain knowledge about the human visual system or image statistics. We evaluate the proposed approach on the LIVE, CISQ, and TID2013 databases as well as the LIVE In the wild image quality challenge database and show superior performance to state-of-the-art NR and FR IQA methods. Finally, cross-database evaluation shows a high ability to generalize between different databases, indicating a high robustness of the learned features.
引用
收藏
页码:206 / 219
页数:14
相关论文
共 52 条
[1]  
[Anonymous], 2016, ARXIV160706140
[2]  
[Anonymous], 2009, Advances of Modern Radioelectronics
[3]  
[Anonymous], 2012, SPACING DIAERESIS EF, DOI DOI 10.1007/978-3-642-35289-8
[4]  
[Anonymous], 2016, 2016 PICTURE CODING, DOI DOI 10.1109/PCS.2016.7906376
[5]  
[Anonymous], 2015, Live in the wild image quality challenge database
[6]   On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation [J].
Bach, Sebastian ;
Binder, Alexander ;
Montavon, Gregoire ;
Klauschen, Frederick ;
Mueller, Klaus-Robert ;
Samek, Wojciech .
PLOS ONE, 2015, 10 (07)
[7]  
Bosse S., 2016, P IEEE INT C SYST MA
[8]  
Bosse S, 2017, IEEE IMAGE PROC, P315, DOI 10.1109/ICIP.2017.8296294
[9]   Assessing Perceived Image Quality Using Steady-State Visual Evoked Potentials and Spatio-Spectral Decomposition [J].
Bosse, Sebastian ;
Acqualagna, Laura ;
Samek, Wojciech ;
Porbadnigk, Anne K. ;
Curio, Gabriel ;
Blankertz, Benjamin ;
Mueller, Klaus-Robert ;
Wiegand, Thomas .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (08) :1694-1706
[10]  
Bosse S, 2016, IEEE IMAGE PROC, P3773, DOI 10.1109/ICIP.2016.7533065