Massive data discrimination via linear support vector machines

被引：126

作者：

Bradley, PS

Mangasarian, OL

机构：

[1] Microsoft Corp, Res, Redmond, WA 98052 USA

[2] Univ Wisconsin, Dept Comp Sci, Madison, WI 53706 USA

来源：

OPTIMIZATION METHODS & SOFTWARE | 2000年 / 13卷 / 01期

基金：

美国国家科学基金会;

关键词：

support vector machines; linear programming chunking;

D O I：

10.1080/10556780008805771

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

A linear support vector machine formulation is used to generate a fast, finitely-terminating linear-programming algorithm for discriminating between two massive sets in n-dimensional space, where the number of points can be orders of magnitude larger than n. The algorithm creates a succession of sufficiently small linear programs that separate I:hunks of the data at a time. The key idea is that a small number of support vectors, corresponding to linear programming constraints with positive dual variables, are carried over between the successive small linear programs, each of which containing a chunk of the data. We prove that this procedure is monotonic and terminates in a finite number of steps at an exact solution that leads to an optimal separating plane for the entire dataset. Numerical results on fully dense publicly available datasets, numbering 20,000 to 1 million points in 32-dimensional space, confirm the theoretical results and demonstrate the ability to handle very large problems.

引用

页码：1 / 10

页数：10

共 22 条

[1] BENNETT KP, 1949, OPTIMIZATION METHODS, V1, P23
[2] BENNETT KP, 1998, UNPUB COMBINING SUPP
[3] BENNETT KP, 1997, P IJCNN 98 ANCH AL, P2396
[4] Bradley P. S., 1998, Machine Learning. Proceedings of the Fifteenth International Conference (ICML'98), P82
[5] A tutorial on Support Vector Machines for pattern recognition
Burges, CJC
[J]. DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) : 121 - 167
[6] Cherkassky V, 1997, IEEE Trans Neural Netw, V8, P1564, DOI 10.1109/TNN.1997.641482
[7] Chvatal V, 1983, Linear programming
[8] CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
[9] DECOMPOSITION PRINCIPLE FOR LINEAR-PROGRAMS
DANTZIG, GB
WOLFE, P
[J]. OPERATIONS RESEARCH, 1960, 8 (01) : 101 - 111
[10] A LINEAR-PROGRAMMING APPROACH TO THE CUTTING-STOCK PROBLEM
GILMORE, PC
GOMORY, RE
[J]. OPERATIONS RESEARCH, 1961, 9 (06) : 849 - 859

← 1 2 3 →