Support vector clustering

被引:923
作者
Ben-Hur, A
Horn, D
Siegelmann, HT
Vapnik, V
机构
[1] BIOwulf Technol, Berkeley, CA 94704 USA
[2] Tel Aviv Univ, Raymond & Beverly Sackler Fac Exact Sci, Sch Phys & Astron, IL-69978 Tel Aviv, Israel
[3] MIT, Informat & Decis Syst Lab, Cambridge, MA 02139 USA
[4] AT&T Labs Res, Red Bank, NJ 07701 USA
关键词
clustering; support vectors machines; Gaussian kernel;
D O I
10.1162/15324430260185565
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present a novel clustering method using the approach of support vector machines. Data points are mapped by means of a Gaussian kernel to a high dimensional feature space, where we search for the minimal enclosing sphere. This sphere, when mapped back to data space, can separate into several components, each enclosing a separate cluster of points. We present a simple algorithm for identifying these clusters. The width of the Gaussian kernel controls the scale at which the data is probed while the soft margin constant helps coping with outliers and overlapping clusters. The structure of a dataset is explored by varying the two parameters, maintaining a minimal number of support vectors to assure smooth cluster boundaries. We demonstrate the performance of our algorithm on several datasets.
引用
收藏
页码:125 / 137
页数:13
相关论文
共 22 条
[1]  
Ben-Hur A, 2000, INT C PATT RECOG, P724, DOI 10.1109/ICPR.2000.906177
[2]  
BENHUR A, 2001, ADV NEURAL INFORMATI, V13
[3]  
BENHUR A, 2002, PAC S BIOC
[4]  
Blake C.L., 1998, UCI repository of machine learning databases
[5]   Data clustering using a model granular magnet [J].
Blatt, M ;
Wiseman, S ;
Domany, E .
NEURAL COMPUTATION, 1997, 9 (08) :1805-1842
[6]   The use of multiple measurements in taxonomic problems [J].
Fisher, RA .
ANNALS OF EUGENICS, 1936, 7 :179-188
[7]  
Fletcher R., 1981, PRACTICAL METHODS OP
[8]  
Fukunaga K., 1990, INTRO STAT PATTERN R
[9]  
Hart, 2006, PATTERN CLASSIFICATI
[10]  
Jain K, 1988, Algorithms for clustering data