Unsupervised Feature Learning Via Spectral Clustering of Multidimensional Patches for Remotely Sensed Scene Classification

被引:155
作者
Hu, Fan [1 ,2 ]
Xia, Gui-Song [1 ]
Wang, Zifeng [1 ]
Huang, Xin [1 ]
Zhang, Liangpei [1 ]
Sun, Hong [1 ,2 ]
机构
[1] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan 430072, Peoples R China
[2] Wuhan Univ, Elect Informat Sch, Wuhan 430072, Peoples R China
基金
中国国家自然科学基金;
关键词
Bag-of-visual-words (BOW) model; linear manifold analysis; scene classification; spectral clustering; unsupervised feature learning (UFL); MANIFOLD; SCALE; REPRESENTATIONS; NETWORK;
D O I
10.1109/JSTARS.2015.2444405
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Scene classification plays an important role in the interpretation of remotely sensed high-resolution imagery. However, the performance of scene classification strongly relies on the discriminative power of feature representation, which is generally hand-engineered and requires a huge amount of domain-expert knowledge as well as time-consuming hand tuning. Recently, unsupervised feature learning (UFL) provides an alternative way to automatically learn discriminative feature representation from images. However, the performances achieved by conventional UFL methods are not comparable to the state-of-the-art, mainly due to the neglect of locally substantial image structures. This paper presents an improved UFL algorithm based on spectral clustering, named UFL-SC, which cannot only adaptively learn good local feature representations but also discover intrinsic structures of local image patches. In contrast to the standard UFL pipeline, UFL-SC first maps the original image patches into a low-dimensional and intrinsic feature space by linear manifold analysis techniques, and then learns a dictionary (e.g., using K-means clustering) on the patch manifold for feature encoding. To generate a feature representation for each local patch, an explicit parameterized feature encoding method, i.e., triangle encoding, is applied with the learned dictionary on the same patch manifold. The holistic feature representation of image scenes is finally obtained by building a bag-of-visual-words (BOW) model of the encoded local features. Experiments demonstrate that the proposed UFL-SC algorithm can extract efficient local features for image scenes and show comparable performance to the state-of-the-art approach on open scene classification benchmark.
引用
收藏
页码:2015 / 2030
页数:16
相关论文
共 56 条
[1]   Learning Bayesian classifiers for scene classification with a visual grammar [J].
Aksoy, S ;
Koperski, K ;
Tusk, C ;
Marchisio, G ;
Tilton, JC .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2005, 43 (03) :581-589
[2]  
[Anonymous], 2010, P ADV NEUR INF PROC
[3]  
[Anonymous], 2003, P NEUR INF PROC SYST
[4]  
[Anonymous], ARXIV150201097
[5]  
[Anonymous], 2007, P 24 INT C MACH LEAR
[6]  
[Anonymous], IEEE T PATTERN ANAL
[7]  
[Anonymous], 2011, P 28 INT C MACHINE L
[8]  
[Anonymous], P IEEE INT C COMP VI
[9]   Improved manifold coordinate representations of large-scale hyperspectral scenes [J].
Bachmann, Charles M. ;
Ainsworth, Thomas L. ;
Fusina, Robert A. .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2006, 44 (10) :2786-2803
[10]   Laplacian eigenmaps for dimensionality reduction and data representation [J].
Belkin, M ;
Niyogi, P .
NEURAL COMPUTATION, 2003, 15 (06) :1373-1396