A 2-D wavelet decomposition-based bag-of-visual-words model for land-use scene classification

被引：75

作者：

Zhao, Lijun ^{[1
,2
]}

Tang, Ping ^{[1
]}

Huo, Lianzhi ^{[1
]}

机构：

[1] Chinese Acad Sci, Inst Remote Sensing & Digital Earth, Beijing 100101, Peoples R China

[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China

来源：

INTERNATIONAL JOURNAL OF REMOTE SENSING | 2014年 / 35卷 / 06期

关键词：

FEATURE-EXTRACTION; COVER CLASSIFICATION; TEXTURE; FEATURES; IMAGERY; KERNELS; SCALE;

D O I：

10.1080/01431161.2014.890762

中图分类号：

TP7 [遥感技术];

学科分类号：

081102 ; 0816 ; 081602 ; 083002 ; 1404 ;

摘要：

Previous works about spatial information incorporation into a traditional bag-of-visual-words (BOVW) model mainly consider the spatial arrangement of an image, ignoring the rich textural information in land-use remote-sensing images. Hence, this article presents a 2-D wavelet decomposition (WD)-based BOVW model for land-use scene classification, since the 2-D wavelet decomposition method does well not only in textural feature extraction, but also in the multi-resolution representation of an image, which is favourable for the use of both spatial arrangement and textural information in land-use images. The proposed method exploits the textural structures of an image with colour information transformed into greyscale. Moreover, it works first by decomposing the greyscale image into different sub-images using 2-D discrete wavelet transform (DWT) and then by extracting local features of the greyscale image and all the decomposed images with dense regions in which a given image is evenly sampled by a regular grid with a specified grid space. After that, the method generates the corresponding visual vocabularies and computes histograms of visual word occurrences of local features found in each former image. Specifically, the soft-assignment or multi-assignment (MA) technique is employed, accounting for the impact of clustering on visual vocabulary creation that two similar image patches may be clustered into different clusters when increasing the size of visual vocabulary. The proposed method is evaluated on a ground truth image dataset of 21 land-use classes manually extracted from high-resolution remote-sensing images. Experimental results demonstrate that the proposed method significantly outperforms previous methods, such as the traditional BOVW model, the spatial pyramid representation-based BOVW method, the multi-resolution representation-based BOVW method, and so on, and even exceeds the best result obtained from the creator of the land-use dataset. Therefore, the proposed approach is very suitable for land-use scene classification tasks.

引用

页码：2296 / 2310

页数：15

共 44 条

[1] [Anonymous], 2007, P 6 ACM INT C IMAGE
[2] Texture classification using wavelet transform
Arivazhagan, S
Ganesan, L
[J]. PATTERN RECOGNITION LETTERS, 2003, 24 (9-10) : 1513 - 1521
[3] Barla A, 2003, 2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 3, PROCEEDINGS, P513
[4] Bag of spatio-visual words for context inference in scene classification
Bolovinou, A.
Pratikakis, I.
Perantonis, S.
[J]. PATTERN RECOGNITION, 2013, 46 (03) : 1039 - 1053
[5] Scene classification using a hybrid generative/discriminative approach
Bosch, Anna
Zisserman, Andrew
Munoz, Xavier
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (04) : 712 - 727
[6] Which is the best way to organize/classify images by content?
Bosch, Anna
Munoz, Xavier
Marti, Robert
[J]. IMAGE AND VISION COMPUTING, 2007, 25 (06) : 778 - 791
[7] LIBSVM: A Library for Support Vector Machines
Chang, Chih-Chung
Lin, Chih-Jen
[J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[8] Automatic landslide detection from remote-sensing imagery using a scene classification method based on BoVW and pLSA
Cheng, Gong
Guo, Lei
Zhao, Tianyun
Han, Junwei
Li, Huihui
Fang, Jun
[J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 2013, 34 (01) : 45 - 59
[9] Latent semantic kernels
Cristianini, N
Shawe-Taylor, J
Lodhi, H
[J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2002, 18 (2-3) : 127 - 152
[10] Csurka G., 2004, WORKSH STAT LEARN CO, V1, P1, DOI DOI 10.1234/12345678

← 1 2 3 4 5 →