Improved YOLO-V3 with DenseNet for Multi-Scale Remote Sensing Target Detection

被引：107

作者：

Xu, Danqing ^{[1
]}

Wu, Yiquan ^{[1
]}

机构：

[1] Nanjing Univ Aeronaut & Astronaut, Coll Elect & Informat Engn, Nanjing 211106, Peoples R China

来源：

SENSORS | 2020年 / 20卷 / 15期

关键词：

remote sensing image; target detection; multi-scale; YOLO-V3; convolutional neural network; DenseNet; SEGMENTATION;

D O I：

10.3390/s20154276

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

Remote sensing targets have different dimensions, and they have the characteristics of dense distribution and a complex background. This makes remote sensing target detection difficult. With the aim at detecting remote sensing targets at different scales, a new You Only Look Once (YOLO)-V3-based model was proposed. YOLO-V3 is a new version of YOLO. Aiming at the defect of poor performance of YOLO-V3 in detecting remote sensing targets, we adopted DenseNet (Densely Connected Network) to enhance feature extraction capability. Moreover, the detection scales were increased to four based on the original YOLO-V3. The experiment on RSOD (Remote Sensing Object Detection) dataset and UCS-AOD (Dataset of Object Detection in Aerial Images) dataset showed that our approach performed better than Faster-RCNN, SSD (Single Shot Multibox Detector), YOLO-V3, and YOLO-V3 tiny in terms of accuracy. Compared with original YOLO-V3, the mAP (mean Average Precision) of our approach increased from 77.10% to 88.73% in the RSOD dataset. In particular, the mAP of detecting targets like aircrafts, which are mainly made up of small targets increased by 12.12%. In addition, the detection speed was not significantly reduced. Generally speaking, our approach achieved higher accuracy and gave considerations to real-time performance simultaneously for remote sensing target detection.

引用

页码：1 / 24

页数：23

共 55 条

[1] Adarsh P, 2020, INT CONF ADVAN COMPU, P687, DOI [10.1109/icaccs48705.2020.9074315, 10.1109/ICACCS48705.2020.9074315]
[2] Benchmark Revision for HOG-SVM Pedestrian Detector Through Reinvigorated Training and Evaluation Methodologies
Bilal, Muhammad
Hanif, Muhammad Shehzad
[J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (03) : 1277 - 1287
[3] Bochkovskiy A., 2020, YOLOv4: Optimal Speed and Accuracy of Object Detection
[4] Rotation-reversal invariant HOG cascade for facial expression recognition
Chen, Jinhui
Takiguchi, Tetsuya
Ariki, Yasuo
[J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2017, 11 (08) : 1485 - 1492
[5] MDSSD: multi-scale deconvolutional single shot detector for small objects
Cui, Lisha
Ma, Rui
Lv, Pei
Jiang, Xiaoheng
Gao, Zhimin
Zhou, Bing
Xu, Mingliang
[J]. SCIENCE CHINA-INFORMATION SCIENCES, 2020, 63 (02)
[6] Calculation of the optimal segmentation scale in object-based multiresolution segmentation based on the scene complexity of high-resolution remote sensing images
Feng, Tianjing
Ma, Hairong
Cheng, Xinwen
Zhang, Hongping
[J]. JOURNAL OF APPLIED REMOTE SENSING, 2018, 12 (02):
[7] Fast R-CNN
Girshick, Ross
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1440 - 1448
[8] Rich feature hierarchies for accurate object detection and semantic segmentation
Girshick, Ross
Donahue, Jeff
Darrell, Trevor
Malik, Jitendra
[J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 580 - 587
[9] Guo C., 2019, P IEEECVF C COMPUTER
[10] Haque Md Foysal, 2018, [Journal of Korean Institute of Information Technology, 한국정보기술학회논문지], V16, P93, DOI 10.14801/jkiit.2018.16.10.93

← 1 2 3 4 5 6 →