Image representations and feature selection for multimedia database search

被引:33
作者
Evgeniou, T
Pontil, M
Papageorgiou, C
Poggio, T
机构
[1] INSEAD, Technol Management Dept, F-77300 Fontainebleau, France
[2] UCL, Dept Comp Sci, London WC1E 6BT, England
[3] MIT, Ctr Biol & Computat Learning, Cambridge, MA 02142 USA
关键词
machine learning; object detection; support vector machines; image representation; multimedia data search;
D O I
10.1109/TKDE.2003.1209008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The success of a multimedia information system depends heavily on the way the data is represented. Although there are "natural" ways to represent numerical data, it is not clear what is a good way to represent multimedia data, such as images, video, or sound. In this paper, we investigate various image representations where the quality of the representation is judged based on how well a system for searching through an image database can perform-although the same techniques and representations can be used for other types of object detection tasks or multimedia data analysis problems. The system is based on a machine learning method used to develop object detection models from example images that can subsequently be used for examples to detect-search-images of a particular object in an image database. As a base classifier for the detection task, we use support vector machines (SVM), a kernel-based learning method. Within the framework of kernel classifiers, we investigate new image representations/kernels derived from probabilistic models of the class of images considered and present a new feature selection method which can be used to reduce the dimensionality of the image representation without significant losses in terms of the performance of the detection-search-system.
引用
收藏
页码:911 / 920
页数:10
相关论文
共 40 条
[31]   DETECTION OF THE MOVEMENTS OF PERSONS FROM A SPARSE SEQUENCE OF TV IMAGES [J].
TSUKIYAMA, T ;
SHIRAI, Y .
PATTERN RECOGNITION, 1985, 18 (3-4) :207-213
[32]  
Turk M. A., 1991, Proceedings 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (91CH2983-5), P586, DOI 10.1109/CVPR.1991.139758
[33]  
VAILLANT R, 1994, IEEE P VISUAL IMAGE, V141
[34]  
Vapnik V., 1998, STAT LEARNING THEORY, V1, P2
[35]   UNIFORM CONVERGENCE OF RELATIVE FREQUENCIES OF EVENTS TO THEIR PROBABILITIES [J].
VAPNIK, VN ;
CHERVONENKIS, AY .
THEORY OF PROBILITY AND ITS APPLICATIONS,USSR, 1971, 16 (02) :264-+
[36]  
VELTKAMP RC, 2000, UUCS200034 DEP COMP
[37]   THE IMPORTANCE OF SYMMETRY AND VIRTUAL VIEWS IN 3-DIMENSIONAL OBJECT RECOGNITION [J].
VETTER, T ;
POGGIO, T ;
BULTHOFF, HH .
CURRENT BIOLOGY, 1994, 4 (01) :18-23
[38]  
Wahba G., 1990, SERIES APPL MATH
[39]  
WREN C, 1995, 353 MIT MED LAB
[40]   FEATURE-EXTRACTION FROM FACES USING DEFORMABLE TEMPLATES [J].
YUILLE, AL ;
HALLINAN, PW ;
COHEN, DS .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 1992, 8 (02) :99-111