SVD-Based Quality Metric for Image and Video Using Machine Learning

被引:126
作者
Narwaria, Manish [1 ]
Lin, Weisi [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
来源
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS | 2012年 / 42卷 / 02期
关键词
Image structure; singular value decomposition (SVD); support vector regression (SVR); visual quality assessment; SINGULAR-VALUE DECOMPOSITION; INFORMATION; SELECTION; FEATURES; INDEX;
D O I
10.1109/TSMCB.2011.2163391
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We study the use of machine learning for visual quality evaluation with comprehensive singular value decomposition (SVD)-based visual features. In this paper, the two-stage process and the relevant work in the existing visual quality metrics are first introduced followed by an in-depth analysis of SVD for visual quality assessment. Singular values and vectors form the selected features for visual quality assessment. Machine learning is then used for the feature pooling process and demonstrated to be effective. This is to address the limitations of the existing pooling techniques, like simple summation, averaging, Minkowski summation, etc., which tend to be ad hoc. We advocate machine learning for feature pooling because it is more systematic and data driven. The experiments show that the proposed method outperforms the eight existing relevant schemes. Extensive analysis and cross validation are performed with ten publicly available databases (eight for images with a total of 4042 test images and two for video with a total of 228 videos). We use all publicly accessible software and databases in this study, as well as making our own software public, to facilitate comparison in future research.
引用
收藏
页码:347 / 364
页数:18
相关论文
共 79 条
[61]   An SVD-based grayscale image quality measure for local and global assessment [J].
Shnayderman, A ;
Gusev, A ;
Eskicioglu, AM .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2006, 15 (02) :422-429
[62]   STOCHASTIC PERTURBATION-THEORY [J].
STEWART, GW .
SIAM REVIEW, 1990, 32 (04) :579-610
[63]   Codevelopmental learning between human and humanoid robot using a dynamic neural-network model [J].
Tani, Jun ;
Nishimoto, Ryu ;
Namikawa, Jun ;
Ito, Masato .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (01) :43-59
[64]   Reduced-Reference IQA in Contourlet Domain [J].
Tao, Dacheng ;
Li, Xuelong ;
Lu, Wen ;
Gao, Xinbo .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2009, 39 (06) :1623-1627
[65]  
Targhi AT, 2003, P SOC PHOTO-OPT INS, V5150, P972, DOI 10.1117/12.503073
[66]   Do singular values contain adequate information for face recognition? [J].
Tian, Y ;
Tan, TN ;
Wang, YH ;
Fang, YC .
PATTERN RECOGNITION, 2003, 36 (03) :649-655
[67]  
Tourancheau S, 2008, IEEE IMAGE PROC, P365, DOI 10.1109/ICIP.2008.4711767
[68]  
Wandell B. A., 1995, Foundations of vision
[69]   Image quality assessment: From error visibility to structural similarity [J].
Wang, Z ;
Bovik, AC ;
Sheikh, HR ;
Simoncelli, EP .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2004, 13 (04) :600-612
[70]  
Wang Z, 2002, INT CONF ACOUST SPEE, P3313