Learning a robust CNN-based rotation insensitive model for ship detection in VHR remote sensing images

被引：28

作者：

Dong, Zhong ^{[1
]}

Lin, Baojun ^{[1
]}

机构：

[1] Chinese Acad Sci, Acad Optoelect, 9th Deng Zhuang South Rd, Beijing, Peoples R China

来源：

INTERNATIONAL JOURNAL OF REMOTE SENSING | 2020年 / 41卷 / 09期

关键词：

Convolutional neural networks - Remote sensing - Object recognition - Deep neural networks - Rotation - Image segmentation;

D O I：

10.1080/01431161.2019.1706781

中图分类号：

TP7 [遥感技术];

学科分类号：

081102 ; 0816 ; 081602 ; 083002 ; 1404 ;

摘要：

Deep convolutional neural networks (CNN) have been widely applied in various fields, especially in the field of object detection. Deep CNN-based models showed great advantages over many traditional methods, even so, there are still many specific problems in the application of certain scenarios. In very high resolution (VHR) remote-sensing image datasets, the uncertainty of the object direction angle causes big trouble to the learning of the detector. Although the pooling operation can slightly alleviate the deviation caused by small angle, the feature learning of the objects with larger angle rotation still relies mainly on the sufficiency of sample data or effective data augmentation, which means the insufficiency of the training instances may cause serious performance degradation of the detector. In this paper, we propose a multi-angle box-based rotation insensitive object detection structure (MRI-CNN), which is an extended exploration for typical region-based CNN methods. On the one hand, we defined a set of directionally rotated bounding boxes before learning, and restricted the classification scene in a small angular range by rotated RoI (Region of Interest) pooling. On the other hand, we proposed a more effective screening method of bounding boxes, enabling the detector to adapt to diverse ground truth annotation methods and learn more accurate object localization. We trained our detector with different datasets containing different amount of training data, and the test results showed that the method proposed in this paper performs better than some mainstream detection methods when limited training data are provided in VHR remote-sensing datasets.

引用

页码：3614 / 3626

页数：13

共 32 条

[1] The Handwritten Chinese Character Recognition Uses Convolutional Neural Networks with the GoogLeNet
Bi, Ning
Chen, Jiahao
Tan, Jun
[J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2019, 33 (11)
[2] Bodla N., 2017, ARXIV170404503CSCV
[3] Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images
Cheng, Gong
Zhou, Peicheng
Han, Junwei
[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (12): : 7405 - 7415
[4] Multi-scale object detection in remote sensing imagery with convolutional neural networks
Deng, Zhipeng
Sun, Hao
Zhou, Shilin
Zhao, Juanping
Lei, Lin
Zou, Huanxin
[J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 145 : 3 - 22
[5] ImageNet Classification with Deep Convolutional Neural Networks
Krizhevsky, Alex
Sutskever, Ilya
Hinton, Geoffrey E.
[J]. COMMUNICATIONS OF THE ACM, 2017, 60 (06) : 84 - 90
[6] LeCun Y., 2015, Nature, V521, P436, DOI [DOI 10.1038/NATURE14539, 10.1038/nature14539]
[7] Multiscale Rotated Bounding Box-Based Deep Learning Method for Detecting Ship Targets in Remote Sensing Images
Li, Shuxin
Zhang, Zhilong
Li, Biao
Li, Chuwei
[J]. SENSORS, 2018, 18 (08)
[8] Robust, discriminative and comprehensive dictionary learning for face recognition
Lin, Guojun
Yang, Meng
Yang, Jian
Shen, Linlin
Xie, Weicheng
[J]. PATTERN RECOGNITION, 2018, 81 : 341 - 356
[9] Liu L., 2017, ARXIV171109405CSCV
[10] SSD: Single Shot MultiBox Detector
Liu, Wei
Anguelov, Dragomir
Erhan, Dumitru
Szegedy, Christian
Reed, Scott
Fu, Cheng-Yang
Berg, Alexander C.
[J]. COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 : 21 - 37

← 1 2 3 4 →