Batch-Mode Active-Learning Methods for the Interactive Classification of Remote Sensing Images

被引:252
作者
Demir, Begum [1 ]
Persello, Claudio [2 ]
Bruzzone, Lorenzo [2 ]
机构
[1] Kocaeli Univ, Dept Elect & Telecommun Engn, TR-41380 Kocaeli, Turkey
[2] Univ Trento, Dept Informat Engn & Comp Sci, I-38123 Trento, Italy
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2011年 / 49卷 / 03期
关键词
Active learning (AL); hyperspectral images; image classification; query functions; remote sensing (RS); support vector machines (SVMs); very high spatial resolution images; SEMISUPERVISED CLASSIFICATION;
D O I
10.1109/TGRS.2010.2072929
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
This paper investigates different batch-mode active-learning (AL) techniques for the classification of remote sensing (RS) images with support vector machines. This is done by generalizing to multiclass problem techniques defined for binary classifiers. The investigated techniques exploit different query functions, which are based on the evaluation of two criteria: uncertainty and diversity. The uncertainty criterion is associated to the confidence of the supervised algorithm in correctly classifying the considered sample, while the diversity criterion aims at selecting a set of unlabeled samples that are as more diverse (distant one another) as possible, thus reducing the redundancy among the selected samples. The combination of the two criteria results in the selection of the potentially most informative set of samples at each iteration of the AL process. Moreover, we propose a novel query function that is based on a kernel-clustering technique for assessing the diversity of samples and a new strategy for selecting the most informative representative sample from each cluster. The investigated and proposed techniques are theoretically and experimentally compared with state-of-the-art methods adopted for RS applications. This is accomplished by considering very high resolution multispectral and hyperspectral images. By this comparison, we observed that the proposed method resulted in better accuracy with respect to other investigated and state-of-the art methods on both the considered data sets. Furthermore, we derived some guidelines on the design of AL systems for the classification of different types of RS images.
引用
收藏
页码:1014 / 1031
页数:18
相关论文
共 41 条
[11]  
Dagan Ido, 1995, Proceedings of the 12th International Conference on Machine Learning, P150
[12]  
DALPONTE M, 2009, P IEEE IGARSS CAP TO, P1008
[13]   Thematic map comparison: Evaluating the statistical significance of differences in classification accuracy [J].
Foody, GM .
PHOTOGRAMMETRIC ENGINEERING AND REMOTE SENSING, 2004, 70 (05) :627-633
[14]   Selective sampling using the query by committee algorithm [J].
Freund, Y ;
Seung, HS ;
Shamir, E ;
Tishby, N .
MACHINE LEARNING, 1997, 28 (2-3) :133-168
[15]   Statistical active learning in multilayer perceptrons [J].
Fukumizu, K .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2000, 11 (01) :17-26
[16]   Investigation of the random forest framework for classification of hyperspectral data [J].
Ham, J ;
Chen, YC ;
Crawford, MM ;
Ghosh, J .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2005, 43 (03) :492-501
[17]   Batch Mode Active Learning with Applications to Text Categorization and Image Retrieval [J].
Hoi, Steven C. H. ;
Jin, Rong ;
Lyu, Michael R. .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (09) :1233-1248
[18]  
Hoi Steven C. H., 2006, Proceedings of the 23rd international conference on Machine learning, ICML '06, P417
[19]  
Jain A. K., 1988, Algorithms for Clustering Data
[20]   Confidence-based active learning [J].
Li, Mingkun ;
Sethi, Ishwar K. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (08) :1251-1261