A model for image categorisation based on a biological visual mechanism

被引:2
作者
He Dongjian [1 ]
Shao Junming [1 ]
Gen Nan [1 ]
Yang Qinli [2 ]
机构
[1] NW A&F Univ, Coll Informat Engn, Yangling 712100, Shaanxi, Peoples R China
[2] NW A&F Univ, Coll Resources & Environm, Yangling 712100, Shaanxi, Peoples R China
关键词
image categorisation; region of interest; visual attention; visual cortex;
D O I
暂无
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
For integrating a visual attention mechanism and object recognition in the visual cortex we propose a novel biologically-motivated computational model for image categorisation. We first extract the focus of attention using an image-driven, bottom-up attention model and then adjust it according to the principles of whole effect and centre preference. After that, we obtain the region of interest, depending on the characteristics of object spatial proximity and object similarity. Based on this we compute a set of position- and scale-invariant C2 features and finally pool them into the standard classifier to achieve image categorisation. We test our model on an image database used in SIMPLIcity. The results suggest that our model can not only classify images effectively under various complex "clutters" but also that it needs only a few training samples.
引用
收藏
页码:781 / 787
页数:7
相关论文
共 15 条
  • [11] Components of reflexive visual orienting to moving objects
    Ro, T
    Rafal, RD
    [J]. PERCEPTION & PSYCHOPHYSICS, 1999, 61 (05): : 826 - 836
  • [12] A feedforward architecture accounts for rapid categorization
    Serre, Thomas
    Oliva, Aude
    Poggio, Tomaso
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2007, 104 (15) : 6424 - 6429
  • [13] Robust object recognition with cortex-like mechanisms
    Serre, Thomas
    Wolf, Lior
    Bileschi, Stanley
    Riesenhuber, Maximilian
    Poggio, Tomaso
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2007, 29 (03) : 411 - 426
  • [14] Robust multipose face detection in images
    Xiao, R
    Li, MJ
    Zhang, HJ
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2004, 14 (01) : 31 - 41
  • [15] A geometric snake model for segmentation of medical imagery
    Yezzi, A
    Kichenassamy, S
    Kumar, A
    Olver, P
    Tannenbaum, A
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 1997, 16 (02) : 199 - 209