A Multiscale Latent Dirichlet Allocation Model for Object-Oriented Clustering of VHR Panchromatic Satellite Images

被引:34
作者
Tang, Hong [1 ,2 ]
Shen, Li [1 ,2 ]
Qi, Yinfeng [1 ,2 ]
Chen, Yunhao [1 ,2 ]
Shu, Yang [1 ,2 ]
Li, Jing [1 ,2 ]
Clausi, David A. [3 ]
机构
[1] Beijing Normal Univ, State Key Lab Earth Surface Proc & Resource Ecol, Beijing 100875, Peoples R China
[2] Beijing Normal Univ, Key Lab Environm Change & Nat Disaster, Beijing 100875, Peoples R China
[3] Univ Waterloo, Dept Syst Design Engn, Waterloo, ON N2L 3G1, Canada
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2013年 / 51卷 / 03期
关键词
Latent Dirichlet allocation (LDA); object-oriented clustering; probabilistic topic models; scale space theory; MARKOV RANDOM-FIELD; SEGMENTATION; CLASSIFICATION;
D O I
10.1109/TGRS.2012.2205579
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
A novel model is presented to address the problem of semantic clustering of geo-objects in very high resolution panchromatic satellite images. The proposed model combines a probabilistic topic model with a multiscale image representation into an automatic framework by embedding both document and scale selections. The probabilistic topic model is used to characterize the statistical distributions of both intraclass appearance and inter-class coherence of geo-objects within documents, i.e., squared sub-images. Because the bag-of-words assumption involved in the probabilistic topic models does not consider the spatial coherence between topic labels, the multiscale image representation is designed to provide a self-adaptive spatial regularization for various geo-object categories. By introducing scale and document selections, the automatic framework integrates the probabilistic topic model and the multiscale image representation to ensure that words on a site should be allocated the same topic label no matter what documents they reside in. Consequently, unlike the traditional method of applying topic models for analyzing satellite images, the process of explicitly generating a set of documents before modeling and then combining multiple labels for a word on a given site is unnecessary. Gibbs sampling is adopted for parameter estimation and image clustering. Extensive experimental evaluations are designed to first analyze the effect of parameters in the proposed model and then compare the results of our model with those of some state-of-the-art methods for three different types of images. The results indicate that the proposed algorithm consistently outperforms these exiting state-of-the-art methods in all of the experiments.
引用
收藏
页码:1680 / 1692
页数:13
相关论文
共 41 条
[1]   Automatic detection of geospatial objects using multiple hierarchical segmentations [J].
Akcay, H. Goekhan ;
Aksoy, Selim .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2008, 46 (07) :2097-2111
[2]  
[Anonymous], 2008, PARAMETER ESTIMATION
[3]  
[Anonymous], 2007, NIPS
[4]  
[Anonymous], 2003, P 26 ANN INT ACM SIG
[5]   Object based image analysis for remote sensing [J].
Blaschke, T. .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2010, 65 (01) :2-16
[6]   Probabilistic Topic Models [J].
Blei, David ;
Carin, Lawrence ;
Dunson, David .
IEEE SIGNAL PROCESSING MAGAZINE, 2010, 27 (06) :55-65
[7]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[8]  
Castilla G., 2008, Object-Based Image Analysis - Spatial Concepts for Knowledge-Driven Remote Sensing Applications
[9]   A COEFFICIENT OF AGREEMENT FOR NOMINAL SCALES [J].
COHEN, J .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1960, 20 (01) :37-46
[10]  
DEERWESTER S, 1990, J AM SOC INFORM SCI, V41, P391, DOI 10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO