SVD-Based Quality Metric for Image and Video Using Machine Learning

被引:126
作者
Narwaria, Manish [1 ]
Lin, Weisi [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
来源
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS | 2012年 / 42卷 / 02期
关键词
Image structure; singular value decomposition (SVD); support vector regression (SVR); visual quality assessment; SINGULAR-VALUE DECOMPOSITION; INFORMATION; SELECTION; FEATURES; INDEX;
D O I
10.1109/TSMCB.2011.2163391
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We study the use of machine learning for visual quality evaluation with comprehensive singular value decomposition (SVD)-based visual features. In this paper, the two-stage process and the relevant work in the existing visual quality metrics are first introduced followed by an in-depth analysis of SVD for visual quality assessment. Singular values and vectors form the selected features for visual quality assessment. Machine learning is then used for the feature pooling process and demonstrated to be effective. This is to address the limitations of the existing pooling techniques, like simple summation, averaging, Minkowski summation, etc., which tend to be ad hoc. We advocate machine learning for feature pooling because it is more systematic and data driven. The experiments show that the proposed method outperforms the eight existing relevant schemes. Extensive analysis and cross validation are performed with ten publicly available databases (eight for images with a total of 4042 test images and two for video with a total of 228 videos). We use all publicly accessible software and databases in this study, as well as making our own software public, to facilitate comparison in future research.
引用
收藏
页码:347 / 364
页数:18
相关论文
共 79 条
[31]   CAN VISUAL FIXATION PATTERNS IMPROVE IMAGE FIDELITY ASSESSMENT? [J].
Larson, Eric C. ;
Vu, Cuong ;
Chandler, Damon M. .
2008 15TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-5, 2008, :2572-2575
[32]  
Le Callet P., Subjective quality assessment IRCCyN/IVC database
[33]   A convolutional neural network approach for objective video quality assessment [J].
Le Callet, Patrick ;
Viard-Gaudin, Christian ;
Barba, Dominique .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2006, 17 (05) :1316-1327
[34]   DISTORTION CRITERIA OF THE HUMAN VIEWER [J].
LIMB, JO .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1979, 9 (12) :778-793
[35]   Perceptual visual quality metrics: A survey [J].
Lin, Weisi ;
Kuo, C-C Jay .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2011, 22 (04) :297-312
[36]   Visual distortion gauge based on discrimination of noticeable contrast changes [J].
Lin, WS ;
Dong, L ;
Xue, P .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2005, 15 (07) :900-909
[37]   First-order perturbation analysis of singular vectors in singular value decomposition [J].
Liu, Jun ;
Liu, Xiangqian ;
Ma, Xiaoli .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2008, 56 (07) :3044-3049
[38]  
Lubin Jeffrey, 1993, P163
[39]  
Ma Q, 2008, INT C PATT RECOG, P2783
[40]   Image quality measurement besides distortion type classifying [J].
Mahmoudi-Aznaveh, Ahmad ;
Mansouri, Azadeh ;
Torkamani-Azar, Farah ;
Eslami, Mohammad .
OPTICAL REVIEW, 2009, 16 (01) :30-34