Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images

被引:1463
作者
Cheng, Gong [1 ]
Zhou, Peicheng [1 ]
Han, Junwei [1 ]
机构
[1] Northwestern Polytech Univ, Sch Automat, Xian 710072, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2016年 / 54卷 / 12期
基金
美国国家科学基金会;
关键词
Convolutional neural networks (CNNs); machine learning; object detection; remote sensing images; rotation-invariant CNN (RICNN); BINARY HYPOTHESIS MODEL; SPARSE REPRESENTATION; SCENE CLASSIFICATION; TARGET DETECTION; VEHICLE DETECTION; VISUAL SALIENCY; SHIP DETECTION; FEATURES;
D O I
10.1109/TGRS.2016.2601622
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Object detection in very high resolution optical remote sensing images is a fundamental problem faced for remote sensing image analysis. Due to the advances of powerful feature representations, machine-learning-based object detection is receiving increasing attention. Although numerous feature representations exist, most of them are handcrafted or shallow-learning-based features. As the object detection task becomes more challenging, their description capability becomes limited or even impoverished. More recently, deep learning algorithms, especially convolutional neural networks (CNNs), have shown their much stronger feature representation power in computer vision. Despite the progress made in nature scene images, it is problematic to directly use the CNN feature for object detection in optical remote sensing images because it is difficult to effectively deal with the problem of object rotation variations. To address this problem, this paper proposes a novel and effective approach to learn a rotation-invariant CNN (RICNN) model for advancing the performance of object detection, which is achieved by introducing and learning a new rotation-invariant layer on the basis of the existing CNN architectures. However, different from the training of traditional CNN models that only optimizes the multinomial logistic regression objective, our RICNN model is trained by optimizing a new objective function via imposing a regularization constraint, which explicitly enforces the feature representations of the training samples before and after rotating to be mapped close to each other, hence achieving rotation invariance. To facilitate training, we first train the rotation-invariant layer and then domain-specifically fine-tune the whole RICNN network to further boost the performance. Comprehensive evaluations on a publicly available ten-class object detection data set demonstrate the effectiveness of the proposed method.
引用
收藏
页码:7405 / 7415
页数:11
相关论文
共 50 条
  • [1] [Anonymous], 2014, 2014 IEEE C COMP VIS
  • [2] Texture-Based Airport Runway Detection
    Aytekin, O.
    Zongur, U.
    Halici, U.
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2013, 10 (03) : 471 - 475
  • [3] A Visual Search Inspired Computational Model for Ship Detection in Optical Satellite Images
    Bi, Fukun
    Zhu, Bocheng
    Gao, Lining
    Bian, Mingming
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2012, 9 (04) : 749 - 753
  • [4] Sparse Representation for Target Detection in Hyperspectral Imagery
    Chen, Yi
    Nasrabadi, Nasser M.
    Tran, Trac D.
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2011, 5 (03) : 629 - 640
  • [5] A survey on object detection in optical remote sensing images
    Cheng, Gong
    Han, Junwei
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2016, 117 : 11 - 28
  • [6] Cheng G, 2015, PROC CVPR IEEE, P1173, DOI 10.1109/CVPR.2015.7298721
  • [7] Auto-encoder-based shared mid-level visual dictionary learning for scene classification using very high resolution remote sensing images
    Cheng, Gong
    Zhou, Peicheng
    Han, Junwei
    Guo, Lei
    Han, Jungong
    [J]. IET COMPUTER VISION, 2015, 9 (05) : 639 - 647
  • [8] Effective and Efficient Midlevel Visual Elements-Oriented Land-Use Classification Using VHR Remote Sensing Images
    Cheng, Gong
    Han, Junwei
    Guo, Lei
    Liu, Zhenbao
    Bu, Shuhui
    Ren, Jinchang
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2015, 53 (08): : 4238 - 4249
  • [9] Multi-class geospatial object detection and geographic image classification based on collection of part detectors
    Cheng, Gong
    Han, Junwei
    Zhou, Peicheng
    Guo, Lei
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2014, 98 : 119 - 132
  • [10] Object detection in remote sensing imagery using a discriminatively trained mixture model
    Cheng, Gong
    Han, Junwei
    Guo, Lei
    Qian, Xiaoliang
    Zhou, Peicheng
    Yao, Xiwen
    Hu, Xintao
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2013, 85 : 32 - 43