SatCNN: satellite image dataset classification using agile convolutional neural networks

被引：94

作者：

Zhong, Yanfei ^{[1
]}

Fei, Feng ^{[1
]}

Liu, Yanfei ^{[1
]}

Zhao, Bei ^{[2
]}

Jiao, Hongzan ^{[3
]}

Zhang, Liangpei ^{[1
]}

机构：

[1] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan, Peoples R China

[2] Chinese Univ Hong Kong, Dept Geog & Resource Management, Hong Kong, Hong Kong, Peoples R China

[3] Wuhan Univ, Sch Urban Design, Wuhan, Peoples R China

来源：

REMOTE SENSING LETTERS | 2017年 / 8卷 / 02期

基金：

中国国家自然科学基金;

关键词：

SCENE CLASSIFICATION; FEATURES;

D O I：

10.1080/2150704X.2016.1235299

中图分类号：

TP7 [遥感技术];

学科分类号：

081102 ; 0816 ; 081602 ; 083002 ; 1404 ;

摘要：

With the launch of various remote-sensing satellites, more and more high-spatial resolution remote-sensing (HSR-RS) images are becoming available. Scene classification of such a huge volume of HSR-RS images is a big challenge for the efficiency of the feature learning and model training. The deep convolutional neural network (CNN), a typical deep learning model, is an efficient end-to-end deep hierarchical feature learning model that can capture the intrinsic features of input HSR-RS images. However, most published CNN architectures are borrowed from natural scene classification with thousands of training samples, and they are not designed for HSR-RS images. In this paper, we propose an agile CNN architecture, named as SatCNN, for HSR-RS image scene classification. Based on recent improvements to modern CNN architectures, we use more efficient convolutional layers with smaller kernels to build an effective CNN architecture. Experiments on SAT data sets confirmed that SatCNN can quickly and effectively learn robust features to handle the intra-class diversity even with small convolutional kernels, and the deeper convolutional layers allow spontaneous modelling of the relative spatial relationships. With the help of fast graphics processing unit acceleration, SatCNN can be trained within about 40 min, achieving overall accuracies of 99.65% and 99.54%, which is the state-of-the-art for SAT data sets.

引用

页码：136 / 145

页数：10

共 22 条

[1]

[Anonymous], P 8 IND C COMP VIS G

[2]

[Anonymous], 2011, BIGLEARN NIPS WORKSH

[3]

[Anonymous], ARXIV150800092 CORR

[4] DeepSat - A Learning framework for Satellite Imagery [J].

Basu, Saikat ;

Ganguly, Sangram ;

Mukhopadhyay, Supratik ;

DiBiano, Robert ;

Karki, Manohar ;

Nemani, Ramakrishna .

23RD ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2015), 2015,

[5] Which is the best way to organize/classify images by content? [J].

Bosch, Anna ;

Munoz, Xavier ;

Marti, Robert .

IMAGE AND VISION COMPUTING, 2007, 25 (06) :778-791

[6]

Congalton R.G., 2008, ASSESSING ACCURACY R, DOI DOI 10.1201/9781420055139

[7] Mind the gap: Another look at the problem of the semantic gap in image retrieval [J].

Hare, JS ;

Lewis, PH ;

Enser, PGB ;

Sandom, CJ .

MULTIMEDIA CONTENT ANALYSIS, MANAGEMENT, AND RETRIEVAL 2006, 2006, 6073

[8] Transferring Deep Convolutional Neural Networks for the Scene Classification of High-Resolution Remote Sensing Imagery [J].

Hu, Fan ;

Xia, Gui-Song ;

Hu, Jingwen ;

Zhang, Liangpei .

REMOTE SENSING, 2015, 7 (11) :14680-14707

[9]

Krizhevsky A., 2017, COMMUN ACM, V60, P84, DOI DOI 10.1145/3065386

[10]

Lazebnik S., 2006, P IEEE COMPUTER SOC, V2, P2169, DOI 10.1109/CVPR.2006.68

← 1 2 3 →