Learning by kernel polarization

被引:52
作者
Baram, Y [1 ]
机构
[1] Technion Israel Inst Technol, Dept Comp Sci, IL-32000 Haifa, Israel
关键词
D O I
10.1162/0899766053630341
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Kernels are key components of pattern recognition mechanisms. We propose a universal kernel optimality criterion, which is independent of the classifier to be used. Defining data polarization as a process by which points of different classes are driven to geometrically opposite locations in a confined domain, we propose selecting the kernel parameter values that polarize the data in the associated feature space. Conversely, the kernel is said to be polarized by the data. Kernel polarization gives rise to an unconstrained optimization problem. We show that complete kernel polarization yields consistent classification by kernel-sum classifiers. Tested on real-life data, polarized kernels demonstrate a clear advantage over the Euclidean distance in proximity classifiers. Embedded in a support vectors classifier, kernel polarization is found to yield about the same performance as exhaustive parameter search.
引用
收藏
页码:1264 / 1275
页数:12
相关论文
共 16 条
[1]  
[Anonymous], DOKL AKAD NAUK SSSR
[2]  
[Anonymous], 2001, NEURAL NETWORKS COMP
[3]   Classification by balanced binary representation [J].
Baram, Y .
NEUROCOMPUTING, 1996, 13 (2-4) :347-357
[4]  
Cristianini N, 2002, ADV NEUR IN, V14, P367
[5]  
Fukunaga K., 1990, INTRO STAT PATTERN R
[6]  
HERBRICH R, 2000, THESIS TU BERLIN
[7]  
Micchelli Charles A., 1986, S APPL MATH NEW YORK, Vvol. 36, P81
[8]   ESTIMATION OF A PROBABILITY DENSITY-FUNCTION AND MODE [J].
PARZEN, E .
ANNALS OF MATHEMATICAL STATISTICS, 1962, 33 (03) :1065-&
[9]  
Platt JC, 2002, ADV NEUR IN, V14, P1425
[10]   THE PERCEPTRON - A PROBABILISTIC MODEL FOR INFORMATION-STORAGE AND ORGANIZATION IN THE BRAIN [J].
ROSENBLATT, F .
PSYCHOLOGICAL REVIEW, 1958, 65 (06) :386-408