Beyond the Euclidean distance: Creating effective visual codebooks using the histogram intersection kernel

被引:134
作者
Wu, Jianxin [1 ]
Rehg, James M. [1 ]
机构
[1] Georgia Inst Technol, Ctr Robot & Intelligent Machines, Sch Interact Comp, Atlanta, GA 30332 USA
来源
2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2009年
关键词
D O I
10.1109/ICCV.2009.5459178
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Common visual codebook generation methods used in a Bag of Visual words model, e. g. k-means or Gaussian Mixture Model, use the Euclidean distance to cluster features into visual code words. However, most popular visual descriptors are histograms of image measurements. It has been shown that the Histogram Intersection Kernel (HIK) is more effective than the Euclidean distance in supervised learning tasks with histogram features. In this paper, we demonstrate that HIK can also be used in an unsupervised manner to significantly improve the generation of visual codebooks. We propose a histogram kernel k-means algorithm which is easy to implement and runs almost as fast as k-means. The HIK codebook has consistently higher recognition accuracy over k-means codebooks by 2-4%. In addition, we propose a one-class SVM formulation to create more effective visual code words which can achieve even higher accuracy. The proposed method has established new state-of-the-art performance numbers for 3 popular benchmark datasets on object and scene recognition. In addition, we show that the standard k-median clustering method can be used for visual codebook generation and can act as a compromise between HIK and k-means approaches.
引用
收藏
页码:630 / 637
页数:8
相关论文
共 36 条
[1]   Multilevel image coding with hyperfeatures [J].
Agarwal, Ankur ;
Triggs, Bill .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2008, 78 (01) :15-27
[2]  
[Anonymous], 2007, ICCV
[3]  
[Anonymous], GITGVU0905
[4]  
Arthur D, 2007, PROCEEDINGS OF THE EIGHTEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, P1027
[5]  
Boiman O., 2008, CVPR
[6]  
Bosch A, 2007, IEEE I CONF COMP VIS, P1863
[7]   Scene classification using a hybrid generative/discriminative approach [J].
Bosch, Anna ;
Zisserman, Andrew ;
Munoz, Xavier .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (04) :712-727
[8]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[9]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[10]  
Demsar J, 2006, J MACH LEARN RES, V7, P1