Modelling Spatio-Temporal Saliency to Predict Gaze Direction for Short Videos

被引：155

作者：

Marat, Sophie ^{[1
]}

Phuoc, Tien Ho ^{[1
]}

Granjon, Lionel ^{[1
]}

Guyader, Nathalie ^{[1
]}

Pellerin, Denis ^{[1
]}

Guerin-Dugue, Anne ^{[1
]}

机构：

[1] Dept Images Signal, GIPSA Lab, F-38402 Grenoble, France

来源：

INTERNATIONAL JOURNAL OF COMPUTER VISION | 2009年 / 82卷 / 03期

关键词：

Saliency; Spatio-temporal model; Gaze prediction; Video viewing; VISUAL-ATTENTION; INTEGRATION; ALLOCATION; SELECTION;

D O I：

10.1007/s11263-009-0215-3

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents a spatio-temporal saliency model that predicts eye movement during video free viewing. This model is inspired by the biology of the first steps of the human visual system. The model extracts two signals from video stream corresponding to the two main outputs of the retina: parvocellular and magnocellular. Then, both signals are split into elementary feature maps by cortical-like filters. These feature maps are used to form two saliency maps: a static and a dynamic one. These maps are then fused into a spatio-temporal saliency map. The model is evaluated by comparing the salient areas of each frame predicted by the spatio-temporal saliency map to the eye positions of different subjects during a free video viewing experiment with a large database (17000 frames). In parallel, the static and the dynamic pathways are analyzed to understand what is more or less salient and for what type of videos our model is a good or a poor predictor of eye movement.

引用

页码：231 / 243

页数：13

共 35 条

[1] BEAUDOT W, 1993, LNCS, V686, P370
[2] BEAUDOT WH, 1994, THESIS TIRF LAB GREN
[3] Robust motion estimation using spatial Gabor-like filters
Bruno, E
Pellerin, D
[J]. SIGNAL PROCESSING, 2002, 82 (02) : 297 - 309
[4] Visual causes versus correlates of attentional selection in dynamic scenes
Carmi, Ran
Itti, Laurent
[J]. VISION RESEARCH, 2006, 46 (26) : 4333 - 4345
[5] TWO-DIMENSIONAL SPECTRAL-ANALYSIS OF CORTICAL RECEPTIVE-FIELD PROFILES
DAUGMAN, JG
[J]. VISION RESEARCH, 1980, 20 (10) : 847 - 856
[6] DEVALOIS RL, 1991, PIGMENT PERCEPTION
[7] Visual attention: Control, representation, and time course
Egeth, HE
Yantis, S
[J]. ANNUAL REVIEW OF PSYCHOLOGY, 1997, 48 : 269 - 297
[8] Video Summarization Based on Camera Motion and a Subjective Evaluation Method
Guironnet, M.
Pellerin, D.
Guyader, N.
Ladret, P.
[J]. EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2007, 2007 (1)
[9] Hansen T, 2001, LECT NOTES ARTIF INT, V2036, P139
[10] Human gaze control during real-world scene perception
Henderson, JM
[J]. TRENDS IN COGNITIVE SCIENCES, 2003, 7 (11) : 498 - 504

← 1 2 3 4 →