Learning a robust CNN-based rotation insensitive model for ship detection in VHR remote sensing images

被引:28
作者
Dong, Zhong [1 ]
Lin, Baojun [1 ]
机构
[1] Chinese Acad Sci, Acad Optoelect, 9th Deng Zhuang South Rd, Beijing, Peoples R China
关键词
Convolutional neural networks - Remote sensing - Object recognition - Deep neural networks - Rotation - Image segmentation;
D O I
10.1080/01431161.2019.1706781
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Deep convolutional neural networks (CNN) have been widely applied in various fields, especially in the field of object detection. Deep CNN-based models showed great advantages over many traditional methods, even so, there are still many specific problems in the application of certain scenarios. In very high resolution (VHR) remote-sensing image datasets, the uncertainty of the object direction angle causes big trouble to the learning of the detector. Although the pooling operation can slightly alleviate the deviation caused by small angle, the feature learning of the objects with larger angle rotation still relies mainly on the sufficiency of sample data or effective data augmentation, which means the insufficiency of the training instances may cause serious performance degradation of the detector. In this paper, we propose a multi-angle box-based rotation insensitive object detection structure (MRI-CNN), which is an extended exploration for typical region-based CNN methods. On the one hand, we defined a set of directionally rotated bounding boxes before learning, and restricted the classification scene in a small angular range by rotated RoI (Region of Interest) pooling. On the other hand, we proposed a more effective screening method of bounding boxes, enabling the detector to adapt to diverse ground truth annotation methods and learn more accurate object localization. We trained our detector with different datasets containing different amount of training data, and the test results showed that the method proposed in this paper performs better than some mainstream detection methods when limited training data are provided in VHR remote-sensing datasets.
引用
收藏
页码:3614 / 3626
页数:13
相关论文
共 32 条
  • [1] The Handwritten Chinese Character Recognition Uses Convolutional Neural Networks with the GoogLeNet
    Bi, Ning
    Chen, Jiahao
    Tan, Jun
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2019, 33 (11)
  • [2] Bodla N., 2017, ARXIV170404503CSCV
  • [3] Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images
    Cheng, Gong
    Zhou, Peicheng
    Han, Junwei
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (12): : 7405 - 7415
  • [4] Multi-scale object detection in remote sensing imagery with convolutional neural networks
    Deng, Zhipeng
    Sun, Hao
    Zhou, Shilin
    Zhao, Juanping
    Lei, Lin
    Zou, Huanxin
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 145 : 3 - 22
  • [5] ImageNet Classification with Deep Convolutional Neural Networks
    Krizhevsky, Alex
    Sutskever, Ilya
    Hinton, Geoffrey E.
    [J]. COMMUNICATIONS OF THE ACM, 2017, 60 (06) : 84 - 90
  • [6] LeCun Y., 2015, Nature, V521, P436, DOI [DOI 10.1038/NATURE14539, 10.1038/nature14539]
  • [7] Multiscale Rotated Bounding Box-Based Deep Learning Method for Detecting Ship Targets in Remote Sensing Images
    Li, Shuxin
    Zhang, Zhilong
    Li, Biao
    Li, Chuwei
    [J]. SENSORS, 2018, 18 (08)
  • [8] Robust, discriminative and comprehensive dictionary learning for face recognition
    Lin, Guojun
    Yang, Meng
    Yang, Jian
    Shen, Linlin
    Xie, Weicheng
    [J]. PATTERN RECOGNITION, 2018, 81 : 341 - 356
  • [9] Liu L., 2017, ARXIV171109405CSCV
  • [10] SSD: Single Shot MultiBox Detector
    Liu, Wei
    Anguelov, Dragomir
    Erhan, Dumitru
    Szegedy, Christian
    Reed, Scott
    Fu, Cheng-Yang
    Berg, Alexander C.
    [J]. COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 : 21 - 37