Semantic modeling of natural scenes for content-based image retrieval

被引:261
作者
Vogel, Julia [1 ]
Schiele, Bernt
机构
[1] Univ British Columbia, Dept Comp Sci, Vancouver, BC V6T 1W5, Canada
[2] Tech Univ Darmstadt, Dept Comp Sci, Darmstadt, Germany
关键词
semantic scene understanding; content-based image retrieval; scene clasification; human scene preception; perceptually based techniques; computer vision;
D O I
10.1007/s11263-006-8614-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a novel image representation that renders it possible to access natural scenes by local semantic description. Our work is motivated by the continuing effort in content-based image retrieval to extract and to model the semantic content of images. The basic idea of the semantic modeling is to classify local image regions into semantic concept classes such as water, rocks, or foliage. Images are represented through the frequency of occurrence of these local concepts. Through extensive experiments, we demonstrate that the image representation is well suited for modeling the semantic content of heterogenous scene categories, and thus for categorization and retrieval. The image representation also allows us to rank natural scenes according to their semantic similarity relative to certain scene categories. Based on human ranking data, we learn a perceptually plausible distance measure that leads to a hi-h Correlation between the human and the automatically obtained typicality ranking. This result is especially valuable for content-based image retrieval where the goal is to present retrieval results in descending semantic similarity from the query.
引用
收藏
页码:133 / 157
页数:25
相关论文
共 43 条
  • [1] [Anonymous], 2000, The handbook of psychological testing
  • [2] Matching words and pictures
    Barnard, K
    Duygulu, P
    Forsyth, D
    de Freitas, N
    Blei, DM
    Jordan, MI
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (06) : 1107 - 1135
  • [3] BARNARD K, 2002, EUR C COMP VIS ECCV
  • [4] Bortz J., 1999, STAT SOZIALWISSENSCH
  • [5] Learning multi-label scene classification
    Boutell, MR
    Luo, JB
    Shen, XP
    Brown, CM
    [J]. PATTERN RECOGNITION, 2004, 37 (09) : 1757 - 1771
  • [6] LIBSVM: A Library for Support Vector Machines
    Chang, Chih-Chung
    Lin, Chih-Jen
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
  • [7] Mean shift: A robust approach toward feature space analysis
    Comaniciu, D
    Meer, P
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (05) : 603 - 619
  • [8] DUYGULU P, 2002, EUR C COMP VIS ECCV
  • [9] Eakins J., 1999, Content-Based Image Retrieval
  • [10] FENG SL, 2004, C IM VID RETR CIVR 0