Accurate Object Localization in Remote Sensing Images Based on Convolutional Neural Networks

被引:643
作者
Long, Yang [1 ]
Gong, Yiping [1 ]
Xiao, Zhifeng [1 ]
Liu, Qing [2 ]
机构
[1] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan 430079, Peoples R China
[2] Shenzhen Prafly Technol Co Ltd, Shenzhen 518048, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2017年 / 55卷 / 05期
关键词
Convolutional neural network (CNN); object localization; remote sensing images; unsupervised score-based bounding box regression (USB-BBR); SCENE CLASSIFICATION; TARGET DETECTION;
D O I
10.1109/TGRS.2016.2645610
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
In this paper, we focus on tackling the problem of automatic accurate localization of detected objects in high-resolution remote sensing images. The two major problems for object localization in remote sensing images caused by the complex context information such images contain are achieving generalizability of the features used to describe objects and achieving accurate object locations. To address these challenges, we propose a new object localization framework, which can be divided into three processes: region proposal, classification, and accurate object localization process. First, a region proposal method is used to generate candidate regions with the aim of detecting all objects of interest within these images. Then, generic image features from a local image corresponding to each region proposal are extracted by a combination model of 2-D reduction convolutional neural networks (CNNs). Finally, to improve the location accuracy, we propose an unsupervised score-based bounding box regression (USB-BBR) algorithm, combined with a nonmaximum suppression algorithm to optimize the bounding boxes of regions that detected as objects. Experiments show that the dimension-reduction model performs better than the retrained and fine-tuned models and the detection precision of the combined CNN model is much higher than that of any single model. Also our proposed USB-BBR algorithm can more accurately locate objects within an image. Compared with traditional features extraction methods, such as elliptic Fourier transform-based histogram of oriented gradients and local binary pattern histogram Fourier, our proposed localization framework shows robustness when dealing with different complex backgrounds.
引用
收藏
页码:2486 / 2498
页数:13
相关论文
共 50 条
  • [1] Agrawal P, 2014, LECT NOTES COMPUT SC, V8695, P329, DOI 10.1007/978-3-319-10584-0_22
  • [2] [Anonymous], P IAPR INT C MACH VI
  • [3] [Anonymous], 2014, ARXIV14045997V2
  • [4] [Anonymous], CAFFE CONVOLUT UNPUB
  • [5] [Anonymous], 2013, INT C MACHINE LEARNI
  • [6] [Anonymous], GPU ASYNCHRONO UNPUB
  • [7] [Anonymous], GOOD ARE DETEC UNPUB
  • [8] [Anonymous], UNDERSTANDING UNPUB
  • [9] [Anonymous], 2013, IMPROVING GEAR SELEC
  • [10] [Anonymous], DEEP RESIDUAL UNPUB