Discriminative compact pyramids for object and scene recognition

被引：44

作者：

Elfiky, Noha M. ^{[1
]}

Shahbaz Khan, Fahad

van de Weijer, Joost

Gonzalez, Jordi

机构：

[1] Campus Univ Autonoma Barcelona, Dept Comp Sci, Bellaterra 08193, Barcelona, Spain

来源：

PATTERN RECOGNITION | 2012年 / 45卷 / 04期

关键词：

Object and scene recognition; Bag of features; Pyramid representation; AIB; DITC; PERFORMANCE EVALUATION; CLASSIFICATION; TEXTURE;

D O I：

10.1016/j.patcog.2011.09.020

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Spatial pyramids have been successfully applied to incorporating spatial information into bag-of-words based image representation. However, a major drawback is that it leads to high dimensional image representations. In this paper, we present a novel framework for obtaining compact pyramid representation. First, we investigate the usage of the divisive information theoretic feature clustering (DITC) algorithm in creating a compact pyramid representation. In many cases this method allows us to reduce the size of a high dimensional pyramid representation up to an order of magnitude with little or no loss in accuracy. Furthermore, comparison to clustering based on agglomerative information bottleneck (AIB) shows that our method obtains superior results at significantly lower computational costs. Moreover, we investigate the optimal combination of multiple features in the context of our compact pyramid representation. Finally, experiments show that the method can obtain state-of-the-art results on several challenging data sets. (C) 2011 Elsevier Ltd. All rights reserved.

引用

页码：1627 / 1636

页数：10

共 47 条

[31] A sparse texture representation using local affine regions [J].

Lazebnik, S ;

Schmid, C ;

Ponce, J .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (08) :1265-1278

[32] Supervised Learning of Quantizer Codebooks by Information Loss Minimization [J].

Lazebnik, Svetlana ;

Raginsky, Maxim .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (07) :1294-1309

[33] Distinctive image features from scale-invariant keypoints [J].

Lowe, DG .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 60 (02) :91-110

[34]

Maji S., 2008, P COMP VIS PATT REC

[35] A performance evaluation of local descriptors [J].

Mikolajczyk, K ;

Schmid, C .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (10) :1615-1630

[36]

RAKOTOMAMONJY A, 2007, P INT C MACH LEARN

[37]

Shechtman E., 2007, P COMP VIS PATT REC

[38]

Slonim N., 1999, ADV NEURAL INFORM PR

[39] Evaluating Color Descriptors for Object and Scene Recognition [J].

van de Sande, Koen E. A. ;

Gevers, Theo ;

Snoek, Cees G. M. .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (09) :1582-1596

[40] Learning Color Names for Real-World Applications [J].

van de Weijer, Joost ;

Schmid, Cordelia ;

Verbeek, Jakob ;

Larlus, Diane .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2009, 18 (07) :1512-1523

← 1 2 3 4 5 →