Auto-encoder-based shared mid-level visual dictionary learning for scene classification using very high resolution remote sensing images

被引:69
作者
Cheng, Gong [1 ]
Zhou, Peicheng [1 ]
Han, Junwei [1 ]
Guo, Lei [1 ]
Han, Jungong [2 ]
机构
[1] Northwestern Polytech Univ, Sch Automat, Xian 710072, Peoples R China
[2] Civolut Technol, Eindhoven, Netherlands
基金
美国国家科学基金会; 中国博士后科学基金;
关键词
EFFECTIVE FEATURE-EXTRACTION; OBJECT DETECTION; EFFICIENT;
D O I
10.1049/iet-cvi.2014.0270
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Effective representation and classification of scenes using very high resolution (VHR) remote sensing images cover a wide range of applications. Although robust low-level image features have been proven to be effective for scene classification, they are not semantically meaningful and thus have difficulty to deal with challenging visual recognition tasks. In this study, the authors propose a new and effective auto-encoder-based method to learn a shared mid-level visual dictionary. This dictionary serves as a shared and universal basis to discover mid-level visual elements. On the one hand, the mid-level visual dictionary learnt using machine learning technique is more discriminative and contains rich semantic information, compared with the traditional low-level visual words. On the other hand, the mid-level visual dictionary is more robust to occlusions and image clutters. In the authors' scene-classification scheme, they use discriminative mid-level visual elements, rather than individual pixels or low-level image features, to represent images. This new image representation is able to capture much of the high-level meaning and contents of the image, facilitating challenging remote sensing image scene-classification tasks. Comprehensive evaluations on a challenging VHR remote sensing images data set and comparisons with state-of-the-art approaches demonstrate the effectiveness and superiority of their study.
引用
收藏
页码:639 / 647
页数:9
相关论文
共 28 条
  • [1] [Anonymous], 2010, CS294A LECT NOTES SP
  • [2] Modeling and detection of geospatial objects using texture motifs
    Bhagavathy, Sitaram
    Manjunath, B. S.
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2006, 44 (12): : 3706 - 3715
  • [3] Multi-class geospatial object detection and geographic image classification based on collection of part detectors
    Cheng, Gong
    Han, Junwei
    Zhou, Peicheng
    Guo, Lei
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2014, 98 : 119 - 132
  • [4] Object detection in remote sensing imagery using a discriminatively trained mixture model
    Cheng, Gong
    Han, Junwei
    Guo, Lei
    Qian, Xiaoliang
    Zhou, Peicheng
    Yao, Xiwen
    Hu, Xintao
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2013, 85 : 32 - 43
  • [5] Automatic landslide detection from remote-sensing imagery using a scene classification method based on BoVW and pLSA
    Cheng, Gong
    Guo, Lei
    Zhao, Tianyun
    Han, Junwei
    Li, Huihui
    Fang, Jun
    [J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 2013, 34 (01) : 45 - 59
  • [6] Unsupervised Feature Learning for Aerial Scene Classification
    Cheriyadat, Anil M.
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2014, 52 (01): : 439 - 451
  • [7] Histograms of oriented gradients for human detection
    Dalal, N
    Triggs, B
    [J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 886 - 893
  • [8] Mining Mid-level Features for Image Classification
    Fernando, Basura
    Fromont, Elisa
    Tuytelaars, Tinne
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2014, 108 (03) : 186 - 203
  • [9] Efficient, simultaneous detection of multi-class geospatial targets based on visual saliency modeling and discriminative learning of sparse coding
    Han, Junwei
    Zhou, Peicheng
    Zhang, Dingwen
    Cheng, Gong
    Guo, Lei
    Liu, Zhenbao
    Bu, Shuhui
    Wu, Jun
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2014, 89 : 37 - 48
  • [10] Reducing the dimensionality of data with neural networks
    Hinton, G. E.
    Salakhutdinov, R. R.
    [J]. SCIENCE, 2006, 313 (5786) : 504 - 507