Saliency-Guided Unsupervised Feature Learning for Scene Classification

被引：481

作者：

Zhang, Fan ^{[1
]}

Du, Bo ^{[2
]}

Zhang, Liangpei ^{[1
]}

机构：

[1] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan 430079, Peoples R China

[2] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Peoples R China

来源：

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2015年 / 53卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Autoencoder; saliency detection; scene classification; unsupervised feature learning; LATENT DIRICHLET ALLOCATION; STRATEGIES;

D O I：

10.1109/TGRS.2014.2357078

中图分类号：

P3 [地球物理学]; P59 [地球化学];

学科分类号：

0708 ; 070902 ;

摘要：

Due to the rapid technological development of various different satellite sensors, a huge volume of high-resolution image data sets can now be acquired. How to efficiently represent and recognize the scenes from such high-resolution image data has become a critical task. In this paper, we propose an unsupervised feature learning framework for scene classification. By using the saliency detection algorithm, we extract a representative set of patches from the salient regions in the image data set. These unlabeled data patches are exploited by an unsupervised feature learning method to learn a set of feature extractors which are robust and efficient and do not need elaborately designed descriptors such as the scale-invariant-feature-transform-based algorithm. We show that the statistics generated from the learned feature extractors can characterize a complex scene very well and can produce excellent classification accuracy. In order to reduce overfitting in the feature learning step, we further employ a recently developed regularization method called "dropout," which has proved to be very effective in image classification. In the experiments, the proposed method was applied to two challenging high-resolution data sets: the UC Merced data set containing 21 different aerial scene categories with a submeter resolution and the Sydney data set containing seven land-use categories with a 60-cm spatial resolution. The proposed method obtained results that were equal to or even better than the previous best results with the UC Merced data set, and it also obtained the highest accuracy with the Sydney data set, demonstrating that the proposed unsupervised-feature-learning-based scene classification method provides more accurate classification results than the other latent-Dirichlet-allocation-based methods and the sparse coding method.

引用

页码：2175 / 2184

页数：10

共 38 条

[1]

Bengio Yoshua, 2006, Advances in Neural Information Processing Systems 19, V19, P153

[2] Latent Dirichlet allocation [J].

Blei, DM ;

Ng, AY ;

Jordan, MI .

JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022

[3] Scene classification using a hybrid generative/discriminative approach [J].

Bosch, Anna ;

Zisserman, Andrew ;

Munoz, Xavier .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (04) :712-727

[4]

BOUREAU YL, 2010, PROC CVPR IEEE, P2559, DOI DOI 10.1109/CVPR.2010.5539963

[5] LIBSVM: A Library for Support Vector Machines [J].

Chang, Chih-Chung ;

Lin, Chih-Jen .

ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)

[6] Unsupervised Feature Learning for Aerial Scene Classification [J].

Cheriyadat, Anil M. .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2014, 52 (01) :439-451

[7]

Ciresan D. C., 2012, ARXIV1202745

[8]

Coates A., 2011, JMLR WORKSHOP C P, P215

[9]

Csurka G, 2004, WORKSH STAT LEARN CO, V1, P1, DOI DOI 10.1234/12345678

[10] Target detection based on a dynamic subspace [J].

Du, Bo ;

Zhang, Liangpei .

PATTERN RECOGNITION, 2014, 47 (01) :344-358

← 1 2 3 4 →