A Time Delay Neural Network model for simulating eye gaze data

被引:8
作者
Zhang, Yun [1 ,2 ]
Zhao, Xiaoyu [2 ]
Fu, Hong [2 ]
Liang, Zhen [2 ]
Chi, Zheru [2 ]
Zhao, Xinbo [1 ]
Feng, Dagan [2 ,3 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Shaanxi, Peoples R China
[2] Hong Kong Polytech Univ, Ctr Multimedia Signal Proc, Dept Elect & Informat Engn, Kowloon, Hong Kong, Peoples R China
[3] Univ Sydney, Sch Informat Technol, Sydney, NSW 2006, Australia
关键词
eye movement modelling; visual attention model; time delayed neural networks; oculomotor control; coincident probability; coincident significance; VISUAL-ATTENTION; OCULOMOTOR CONTROL; REGIONS; OVERT;
D O I
10.1080/0952813X.2010.506298
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human eye movement modelling is a new, challenging and promising research topic in computer vision. Human eye movement modelling aims at simulating the scan path in which a human being views an image, a scene or a video. The successful modelling of human eye movements potentially benefits a wide range of applications such as image retrieval, image annotation, medical image diagnosis and human visual perception. This article presents a model based on a Time Delay Neural Network (TDNN) to simulate eye gaze data. First, 120 Hz eye gaze data are acquired by a non-intrusive table-mounted eye tracker. Our proposed model is to simulate the image reading process of a single subject. Seven features are then extracted based on the knowledge of the human oculomotor system and the image contents to train a TDNN. Finally, the trained TDNN combined with a saccade control mechanism is used to simulate the scan path of a human being viewing an image. The proposed model can generate 600 points of raw eye gaze data in a 5-second eye viewing window. Both subjective and objective methods are used to evaluate the model by comparing its behaviour and characteristics with the real eye gaze data collected from an eye tracker. Qualitative assessment shows that the subject can hardly tell the differences between the scan path from the model and that from a human being. By evaluating the coincident probability Cp and coincident significance Cs, quantitative assessment shows that the results from the TDNN model are reasonable and similar to human scan paths.
引用
收藏
页码:111 / 126
页数:16
相关论文
共 30 条
[1]  
[Anonymous], 1999, Network: Computation in Neural Systems
[2]  
BRUCE BND, 2003, P IEEE INT C IM PROC, V1, P429
[3]   High-level aspects of oculomotor control during viewing of natural-task images [J].
Canosa, RL ;
Pelz, JB ;
Mennie, NR ;
Peak, J .
HUMAN VISION AND ELECTRONIC IMAGING VIII, 2003, 5007 :240-251
[4]   Unsupervised segmentation of color-texture regions in images and video [J].
Deng, YN ;
Manjunath, BS .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (08) :800-810
[5]  
Duchowski A. T., 2003, EYE TRACKING METHODO
[6]   A breadth-first survey of eye-tracking applications [J].
Duchowski, AT .
BEHAVIOR RESEARCH METHODS INSTRUMENTS & COMPUTERS, 2002, 34 (04) :455-470
[7]   Visual attention: Control, representation, and time course [J].
Egeth, HE ;
Yantis, S .
ANNUAL REVIEW OF PSYCHOLOGY, 1997, 48 :269-297
[8]   Attention-driven image interpretation with application to image retrieval [J].
Fu, Hong ;
Chi, Zheru ;
Feng, Dagan .
PATTERN RECOGNITION, 2006, 39 (09) :1604-1621
[9]   A model of saliency-based visual attention for rapid scene analysis [J].
Itti, L ;
Koch, C ;
Niebur, E .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (11) :1254-1259
[10]   A saliency-based search mechanism for overt and covert shifts of visual attention [J].
Itti, L ;
Koch, C .
VISION RESEARCH, 2000, 40 (10-12) :1489-1506