Scene Classification via a Gradient Boosting Random Convolutional Network Framework

被引:347
作者
Zhang, Fan [1 ]
Du, Bo [2 ]
Zhang, Liangpei [1 ]
机构
[1] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan 430079, Peoples R China
[2] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2016年 / 54卷 / 03期
基金
中国国家自然科学基金;
关键词
Convolutional networks (CNets); gradient boosting machine (GBM); scene classification; MODEL;
D O I
10.1109/TGRS.2015.2488681
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Due to the recent advances in satellite sensors, a large amount of high-resolution remote sensing images is now being obtained each day. How to automatically recognize and analyze scenes from these satellite images effectively and efficiently has become a big challenge in the remote sensing field. Recently, a lot of work in scene classification has been proposed, focusing on deep neural networks, which learn hierarchical internal feature representations from image data sets and produce state-of-the-art performance. However, most methods, including the traditional shallow methods and deep neural networks, only concentrate on training a single model. Meanwhile, neural network ensembles have proved to be a powerful and practical tool for a number of different predictive tasks. Can we find a way to combine different deep neural networks effectively and efficiently for scene classification? In this paper, we propose a gradient boosting random convolutional network (GBRCN) framework for scene classification, which can effectively combine many deep neural networks. As far as we know, this is the first time that a deep ensemble framework has been proposed for scene classification. Moreover, in the experiments, the proposed method was applied to two challenging high-resolution data sets: 1) the UC Merced data set containing 21 different aerial scene categories with a submeter resolution and 2) a Sydney data set containing eight land-use categories with a 1.0-m spatial resolution. The proposed GBRCN framework outperformed the state-of-the-art methods with the UC Merced data set, including the traditional single convolutional network approach. For the Sydney data set, the proposed method again obtained the best accuracy, demonstrating that the proposed framework can provide more accurate classification results than the state-of-the-art methods.
引用
收藏
页码:1793 / 1802
页数:10
相关论文
共 33 条
  • [1] [Anonymous], P ADV NEUR INF
  • [2] [Anonymous], 1989, P ADV NEUR INF PROC
  • [3] Bissacco A., 2007, CVPR
  • [4] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 993 - 1022
  • [5] Unsupervised Feature Learning for Aerial Scene Classification
    Cheriyadat, Anil M.
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2014, 52 (01): : 439 - 451
  • [6] A Discriminative Metric Learning Based Anomaly Detection Method
    Du, Bo
    Zhang, Liangpei
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2014, 52 (11): : 6844 - 6857
  • [7] Greedy function approximation: A gradient boosting machine
    Friedman, JH
    [J]. ANNALS OF STATISTICS, 2001, 29 (05) : 1189 - 1232
  • [8] Girshick R, 2013, P IEEE COMP SOC C CO
  • [9] NEURAL NETWORK ENSEMBLES
    HANSEN, LK
    SALAMON, P
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1990, 12 (10) : 993 - 1001
  • [10] He KM, 2015, PROC CVPR IEEE, P5353, DOI 10.1109/CVPR.2015.7299173