Improving Bag-of-Features for Large Scale Image Search

被引:284
作者
Jegou, Herve [1 ]
Douze, Matthijs [1 ]
Schmid, Cordelia [1 ]
机构
[1] INRIA Grenoble Rhone Alpes, F-38334 Montbonnot St Martin, Saint Ismier, France
关键词
Image retrieval; Nearest neighbor search; Object recognition; Image search;
D O I
10.1007/s11263-009-0285-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article improves recent methods for large scale image search. We first analyze the bag-of-features approach in the framework of approximate nearest neighbor search. This leads us to derive a more precise representation based on Hamming embedding (HE) and weak geometric consistency constraints (WGC). HE provides binary signatures that refine the matching based on visual words. WGC filters matching descriptors that are not consistent in terms of angle and scale. HE and WGC are integrated within an inverted file and are efficiently exploited for all images in the dataset. We then introduce a graph-structured quantizer which significantly speeds up the assignment of the descriptors to visual words. A comparison with the state of the art shows the interest of our approach when high accuracy is needed. Experiments performed on three reference datasets and a dataset of one million of images show a significant improvement due to the binary signature and the weak geometric consistency constraints, as well as their efficiency. Estimation of the full geometric transformation, i.e., a re-ranking step on a short-list of images, is shown to be complementary to our weak geometric consistency constraints. Our approach is shown to outperform the state-of-the-art on the three datasets.
引用
收藏
页码:316 / 336
页数:21
相关论文
共 23 条
[11]  
Jegou H., 2008, INRIA HOLIDAYS DATAS
[12]   Feature detection with automatic scale selection [J].
Lindeberg, T .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 1998, 30 (02) :79-116
[13]   Distinctive image features from scale-invariant keypoints [J].
Lowe, DG .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 60 (02) :91-110
[14]  
Matas J., 2002, Electronic Proceedings of the 13th British Machine Vision Conference, P384
[15]   Scale & affine invariant interest point detectors [J].
Mikolajczyk, K ;
Schmid, C .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 60 (01) :63-86
[16]  
MUJA M, 2009, INT C COMP VIS APPL
[17]   Modeling the shape of the scene: A holistic representation of the spatial envelope [J].
Oliva, A ;
Torralba, A .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2001, 42 (03) :145-175
[18]  
OMERCEVIC D, 2007, INT C COMP VIS
[19]  
PHILBIN J, 2008, C COMP VIS PATT REC
[20]  
SCHINDLER G, 2007, C COMP VIS PATT REC