Learning Vocabularies over a Fine Quantization

被引:50
作者
Mikulik, Andrej [1 ]
Perdoch, Michal [1 ]
Chum, Ondrej [1 ]
Matas, Jiri [1 ]
机构
[1] Czech Tech Univ, CMP, Dept Cybernet, Fac Elect Engn, CR-16635 Prague, Czech Republic
关键词
Image retrieval; Vocabulary; Feature track;
D O I
10.1007/s11263-012-0600-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A novel similarity measure for bag-of-words type large scale image retrieval is presented. The similarity function is learned in an unsupervised manner, requires no extra space over the standard bag-of-words method and is more discriminative than both L2-based soft assignment and Hamming embedding. The novel similarity function achieves mean average precision that is superior to any result published in the literature on the standard Oxford 5k, Oxford 105k and Paris datasets/protocols. We study the effect of a fine quantization and very large vocabularies (up to 64 million words) and show that the performance of specific object retrieval increases with the size of the vocabulary. This observation is in contradiction with previously published results. We further demonstrate that the large vocabularies increase the speed of the tf-idf scoring step.
引用
收藏
页码:163 / 175
页数:13
相关论文
共 30 条
[1]  
Agarwal S, 2009, P ICCV KYOT
[2]  
Avrithis Y., 2012, P EUR C COMP VIS ECC
[3]  
Baeza-Yates R, 1999, MODERN INFORM RETRIE
[4]  
Cech J, 2008, P CVPR ANCH
[5]  
Chum O, 2007, P ICCV RIO DE JAN
[6]  
Chum O, 2009, P CVPR MIAM
[7]   Large-Scale Discovery of Spatially Related Images [J].
Chum, Ondrej ;
Matas, Jiri .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (02) :371-377
[8]  
Duda R.O., 1995, Pattern Classification and Scene Analysis, Vsecond
[9]  
Ferrari V, 2004, P ECCV PRAG
[10]  
Fraundorfer F, 2007, P CVPR MINN