Compact Representation of High-Dimensional Feature Vectors for Large-Scale Image Recognition and Retrieval

被引：1776

作者：

Zhang, Yu ^{[1
,2
]}

Wu, Jianxin ^{[3
]}

Cai, Jianfei ^{[1
]}

机构：

[1] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore

[2] Agcy Sci Technol & Res, Bioinformat Inst, Singapore 138671, Singapore

[3] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing 210023, Jiangsu, Peoples R China

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2016年 / 25卷 / 05期

基金：

中国国家自然科学基金;

关键词：

Feature selection; large scale; image representation; FEATURE-SELECTION; PRODUCT QUANTIZATION; CLASSIFICATION; RELEVANCE;

D O I：

10.1109/TIP.2016.2549360

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In large-scale visual recognition and image retrieval tasks, feature vectors, such as Fisher vector (FV) or the vector of locally aggregated descriptors (VLAD), have achieved state-of-the-art results. However, the combination of the large numbers of examples and high-dimensional vectors necessitates dimensionality reduction, in order to reduce its storage and CPU costs to a reasonable range. In spite of the popularity of various feature compression methods, this paper shows that the feature (dimension) selection is a better choice for high-dimensional FV/VLAD than the feature (dimension) compression methods, e.g., product quantization. We show that strong correlation among the feature dimensions in the FV and the VLAD may not exist, which renders feature selection a natural choice. We also show that, many dimensions in FV/VLAD are noise. Throwing them away using feature selection is better than compressing them and useful dimensions altogether using feature compression methods. To choose features, we propose an efficient importance sorting algorithm considering both the supervised and unsupervised cases, for visual recognition and image retrieval, respectively. Combining with the 1-bit quantization, feature selection has achieved both higher accuracy and less computational cost than feature compression methods, such as product quantization, on the FV and the VLAD image representations.

引用

页码：2407 / 2419

页数：13

共 54 条

[1]

[Anonymous], WORKSH STRUCT KNOWL

[2]

[Anonymous], 2008, Introduction to information retrieval

[3]

[Anonymous], ADV NEURAL INF PROCE

[4]

[Anonymous], P IEEE INT C COMP VI

[5]

[Anonymous], 2013, IJCAI

[6]

[Anonymous], 2007, Tech. Rep

[7] All about VLAD [J].

Arandjelovic, Relja ;

Zisserman, Andrew .

2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :1578-1585

[8]

Berg A., IMAGENET LARGE SCALE

[9] The devil is in the details: an evaluation of recent feature encoding methods [J].

Chatfield, Ken ;

Lempitsky, Victor ;

Vedaldi, Andrea ;

Zisserman, Andrew .

PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2011, 2011,

[10] Blessing of Dimensionality: High-dimensional Feature and Its Efficient Compression for Face Verification [J].

Chen, Dong ;

Cao, Xudong ;

Wen, Fang ;

Sun, Jian .

2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :3025-3032

← 1 2 3 4 5 6 →