Geographic Image Retrieval Using Local Invariant Features

被引:290
作者
Yang, Yi [1 ]
Newsam, Shawn [1 ]
机构
[1] Univ Calif, Elect Engn & Comp Sci Program, Merced, CA 95343 USA
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2013年 / 51卷 / 02期
基金
美国国家科学基金会;
关键词
Bag of visual words; content-based image retrieval; high-resolution overhead image analysis; land cover; land use; local invariant features; remote sensing; TEXTURE FEATURES; URBAN-AREA; SCALE; SIFT; REPRESENTATION; POINTS;
D O I
10.1109/TGRS.2012.2205158
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
This paper investigates local invariant features for geographic (overhead) image retrieval. Local features are particularly well suited for the newer generations of aerial and satellite imagery whose increased spatial resolution, often just tens of centimeters per pixel, allows a greater range of objects and spatial patterns to be recognized than ever before. Local invariant features have been successfully applied to a broad range of computer vision problems and, as such, are receiving increased attention from the remote sensing community particularly for challenging tasks such as detection and classification. We perform an extensive evaluation of local invariant features for image retrieval of land-use/land-cover (LULC) classes in high-resolution aerial imagery. We report on the effects of a number of design parameters on a bag-of-visual-words (BOVW) representation including saliency-versus grid-based local feature extraction, the size of the visual codebook, the clustering algorithm used to create the codebook, and the dissimilarity measure used to compare the BOVW representations. We also perform comparisons with standard features such as color and texture. The performance is quantitatively evaluated using a first-of-its-kind LULC ground truth data set which will be made publicly available to other researchers. In addition to reporting on the effects of the core design parameters, we also describe interesting findings such as the performance-efficiency tradeoffs that are possible through the appropriate pairings of different-sized codebooks and dissimilarity measures. While the focus is on image retrieval, we expect our insights to be informative for other applications such as detection and classification.
引用
收藏
页码:818 / 832
页数:15
相关论文
共 58 条
  • [1] [Anonymous], 2002, Introduction to MPEG-7: Multimedia Content Description Interface
  • [2] [Anonymous], 2006, 2006 IEEE COMP SOC C
  • [3] [Anonymous], 1995, proceedings of ACM International Conference on Management of Data (SIGMOD)
  • [4] Bao Q, 2004, IEEE SYS MAN CYBERN, P1112
  • [5] SURF: Speeded up robust features
    Bay, Herbert
    Tuytelaars, Tinne
    Van Gool, Luc
    [J]. COMPUTER VISION - ECCV 2006 , PT 1, PROCEEDINGS, 2006, 3951 : 404 - 417
  • [6] Shape matching and object recognition using shape contexts
    Belongie, S
    Malik, J
    Puzicha, J
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (04) : 509 - 522
  • [7] Bordes J., 2008, P BRIT MACH VIS C
  • [8] Bretschneider T, 2002, INT GEOSCI REMOTE SE, P2253, DOI 10.1109/IGARSS.2002.1026510
  • [9] Bretschneider T., 2002, P INT C IM SCI SYST, P439
  • [10] Dorado-Munoz L., 2010, P WORKSH HYP IM SIGN, P1