No reference and reduced reference video quality metrics for end to end QoS monitoring

被引:13
作者
Le Callet, P [1 ]
Viard-Gaudin, C [1 ]
Péchard, S [1 ]
Caillault, E [1 ]
机构
[1] Univ Nantes, IRCCyN, Nantes, France
关键词
convolutional neural network; video quality assessment; MPEG; 2; temporal pooling;
D O I
10.1093/ietcom/e89-b.2.289
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper describes an objective measurement method designed to assess the perceived quality for digital videos. The proposed approach can be used either in the context of a reduced reference quality assessment or in the more challenging situation where no reference is available. In that way, it can be deployed in a QoS monitoring strategy ill order to control the end-user perceived quality. The originality of the approach relies on the very limited computation resources which are involved, such a system could be integrated quite easily in a real time application. It uses a convolutional neural network (CNN) that allows a continuous time scoring of the video. Experiments conducted on different MPEG-2 videos, with bit rates ranging from 2 to 6 Mbits/s, show the effectiveness of the proposed approach. More specifically, a linear correlation criterion, between objective and subjective scoring, ranging from 0.90 up to 0.95 has been obtained on a set of typical TV videos in the case of a reduced reference assessment. Without any reference to the original video, the correlation criteria remains quite satisfying since it still lies between 0.85 and 0.90, which is quite high with respect to the difficulty of the task, and equivalent and more in some cases than the traditional PSNR, which is a full reference measurement.
引用
收藏
页码:289 / 296
页数:8
相关论文
共 17 条
[1]  
Bishop C. M, 1995, NEURAL NETWORKS PATT
[2]  
CAVIEDES J, 2003, P SPIE VISUAL COMMUN
[3]  
Farias M., 2004, THESIS U CALIFORNIA
[4]   Objective quality assessment of MPEG-2 video streams by using CBP neural networks [J].
Gastaldo, P ;
Rovetta, S ;
Zunino, R .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (04) :939-947
[5]   A DISTORTION MEASURE FOR BLOCKING ARTIFACTS IN IMAGES BASED ON HUMAN VISUAL SENSITIVITY [J].
KARUNASEKERA, SA ;
KINGSBURY, NG .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 1995, 4 (06) :713-724
[6]   CARDINAL DIRECTIONS OF COLOR SPACE [J].
KRAUSKOPF, J ;
WILLIAMS, DR ;
HEELEY, DW .
VISION RESEARCH, 1982, 22 (09) :1123-1131
[7]  
LECALLET P, 2005, 1 INT WORKSH VID PRO
[8]   Gradient-based learning applied to document recognition [J].
Lecun, Y ;
Bottou, L ;
Bengio, Y ;
Haffner, P .
PROCEEDINGS OF THE IEEE, 1998, 86 (11) :2278-2324
[9]  
MELCHER D, 1995, TLA1595104
[10]   PHONEME RECOGNITION USING TIME-DELAY NEURAL NETWORKS [J].
WAIBEL, A ;
HANAZAWA, T ;
HINTON, G ;
SHIKANO, K ;
LANG, KJ .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (03) :328-339