Remote Sensing Image Scene Classification: Benchmark and State of the Art

被引:1955
作者
Cheng, Gong [1 ]
Han, Junwei [1 ]
Lu, Xiaoqiang [2 ]
机构
[1] Northwestern Polytech Univ, Sch Automat, Xian 710072, Shaanxi, Peoples R China
[2] Chinese Acad Sci, Xian Inst Opt & Precis Mech, Ctr OPT IMagery Anal & Learning OPTIMAL, State Key Lab Transient Opt & Photon, Xian 710119, Shaanxi, Peoples R China
基金
美国国家科学基金会;
关键词
Benchmark data set; deep learning; handcrafted features; remote sensing image; scene classification; unsupervised feature learning; GEOSPATIAL OBJECT DETECTION; TARGET DETECTION; FEATURE-SELECTION; SATELLITE IMAGES; VISUAL SALIENCY; DEEP; FEATURES; TEXTURE; REPRESENTATION; MULTISCALE;
D O I
10.1109/JPROC.2017.2675998
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Remote sensing image scene classification plays an important role in a wide range of applications and hence has been receiving remarkable attention. During the past years, significant efforts have been made to develop various data sets or present a variety of approaches for scene classification from remote sensing images. However, a systematic review of the literature concerning data sets and methods for scene classification is still lacking. In addition, almost all existing data sets have a number of limitations, including the small scale of scene classes and the image numbers, the lack of image variations and diversity, and the saturation of accuracy. These limitations severely limit the development of new approaches especially deep learning-based methods. This paper first provides a comprehensive review of the recent progress. Then, we propose a large-scale data set, termed "NWPU-RESISC45," which is a publicly available benchmark for REmote Sensing Image Scene Classification (RESISC), created by Northwestern Polytechnical University (NWPU). This data set contains 31 500 images, covering 45 scene classes with 700 images in each class. The proposed NWPU-RESISC45 1) is large-scale on the scene classes and the total image number; 2) holds big variations in translation, spatial resolution, viewpoint, object pose, illumination, background, and occlusion; and 3) has high within-class diversity and between-class similarity. The creation of this data set will enable the community to develop and evaluate various data-driven algorithms. Finally, several representative methods are evaluated using the proposed data set, and the results are reported as a useful baseline for future research.
引用
收藏
页码:1865 / 1883
页数:19
相关论文
共 176 条
[51]   Human Settlements: A Global Challenge for EO Data Processing and Interpretation [J].
Gamba, Paolo .
PROCEEDINGS OF THE IEEE, 2013, 101 (03) :570-581
[52]   Multimodal Classification of Remote Sensing Images: A Review and Future Directions [J].
Gomez-Chova, Luis ;
Tuia, Devis ;
Moser, Gabriele ;
Camps-Valls, Gustau .
PROCEEDINGS OF THE IEEE, 2015, 103 (09) :1560-1584
[53]  
Gong Cheng, 2016, 2016 4th International Workshop on Earth Observation and Remote Sensing Applications (EORSA), P433, DOI 10.1109/EORSA.2016.7552845
[54]   Classifying Compound Structures in Satellite Images: A Compressed Representation for Fast Queries [J].
Gueguen, Lionel .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2015, 53 (04) :1803-1818
[55]   Two-Stage Learning to Predict Human Eye Fixations via SDAEs [J].
Han, Junwei ;
Zhang, Dingwen ;
Wen, Shifeng ;
Guo, Lei ;
Liu, Tianming ;
Li, Xuelong .
IEEE TRANSACTIONS ON CYBERNETICS, 2016, 46 (02) :487-498
[56]   Background Prior-Based Salient Object Detection via Deep Reconstruction Residual [J].
Han, Junwei ;
Zhang, Dingwen ;
Hu, Xintao ;
Guo, Lei ;
Ren, Jinchang ;
Wu, Feng .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2015, 25 (08) :1309-1321
[57]   Object Detection in Optical Remote Sensing Images Based on Weakly Supervised Learning and High-Level Feature Learning [J].
Han, Junwei ;
Zhang, Dingwen ;
Cheng, Gong ;
Guo, Lei ;
Ren, Jinchang .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2015, 53 (06) :3325-3337
[58]   Efficient, simultaneous detection of multi-class geospatial targets based on visual saliency modeling and discriminative learning of sparse coding [J].
Han, Junwei ;
Zhou, Peicheng ;
Zhang, Dingwen ;
Cheng, Gong ;
Guo, Lei ;
Liu, Zhenbao ;
Bu, Shuhui ;
Wu, Jun .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2014, 89 :37-48
[59]   TEXTURAL FEATURES FOR IMAGE CLASSIFICATION [J].
HARALICK, RM ;
SHANMUGAM, K ;
DINSTEIN, I .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1973, SMC3 (06) :610-621
[60]   A comparison of three image-object methods for the multiscale analysis of landscape structure [J].
Hay, GJ ;
Blaschke, T ;
Marceau, DJ ;
Bouchard, A .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2003, 57 (5-6) :327-345