Detecting Small Objects in Urban Settings Using SlimNet Model

被引：15

作者：

Yang, Zheng ^{[1
]}

Liu, Yaolin ^{[1
]}

Liu, Lirong ^{[2
]}

Tang, Xinming ^{[2
]}

Xie, Junfeng ^{[2
]}

Gao, Xiaoming ^{[2
]}

机构：

[1] Wuhan Univ, Sch Resource & Environm Sci, Wuhan 4300799, Hubei, Peoples R China

[2] MNR, Land Satellite Remote Sensing Applicat Ctr, Beijing 100048, Peoples R China

来源：

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2019年 / 57卷 / 11期

关键词：

Feature extraction; Licenses; Urban areas; Deep learning; Three-dimensional displays; Object detection; Roads; Convolution neural network (CNN); mobile mapping; small object detection; urban element detection (UED); LICENSE-PLATE DETECTION; EXTRACTION;

D O I：

10.1109/TGRS.2019.2921111

中图分类号：

P3 [地球物理学]; P59 [地球化学];

学科分类号：

0708 ; 070902 ;

摘要：

The automatic extraction of small objects such as roadside milestones, small traffic signs, and other urban furniture remains a technical challenge. This study focuses on methods of deep learning to detect small urban elements in mobile mapping system (MMS) images. Based on images obtained by an MMS in urban areas, we create an urban element detection (UED) data set containing several kinds of small objects found in a city. A simple feature extraction convolution neural network (CNN) called SlimNet is proposed and combined with an optimized faster R-CNN framework. The resulting deep learning method can automatically extract small objects commonly found in cities, including manhole covers, milestones, and license plates. Experiments on the UED data set show that SlimNet has the highest accuracy compared with other popular networks, including VGG, MobileNet, ResNet, and YOLOv3. The SlimNet model can achieve a mean average precision (AP) that is up to 12.3 higher than that of the lowest ResNet-152 network and can accelerate both training and detection owing to its relative simplicity. Moreover, $k$ -means clustering is used to choose the dimensions of the anchor box for detection. We ran $k$ -means clustering for different numbers of clusters, and the results show that at least four clusters are needed for detection using a small data set such as the UED. We also propose a method to use templates of different scales for anchors to further improve small object detection; this approach improved the AP by 34 in our experiments.

引用

页码：8445 / 8457

页数：13

共 49 条

[1] Vertical-Edge-Based Car-License-Plate Detection Method
Al-Ghaili, Abbas M.
Mashohor, Syamsiah
Ramli, Abdul Rahman
Ismail, Alyani
[J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2013, 62 (01) : 26 - 38
[2] [Anonymous], CALCULATING RECEPTIV
[3] [Anonymous], P 3 INT C LEARNING R
[4] [Anonymous], 2017, ARXIV171110398
[5] [Anonymous], 2015, ARXIV PREPRINT ARXIV
[6] [Anonymous], 2015, ARXIV, DOI DOI 10.48550/ARXIV.1504.08083
[7] [Anonymous], 2017, ARXIV170404861
[8] [Anonymous], 2017, P 31 AAAI C ART INT
[9] [Anonymous], PROC CVPR IEEE
[10] [Anonymous], ADV NEURAL INFORM PR, DOI DOI 10.1109/TPAMI.2016.2577031

← 1 2 3 4 5 →