A tutorial on Support Vector Machines for pattern recognition

被引：11975

作者：

Burges, CJC ^{[1
]}

机构：

[1] Lucent Technol, Bell Labs, Murray Hill, NJ 07974 USA

来源：

DATA MINING AND KNOWLEDGE DISCOVERY | 1998年 / 2卷 / 02期

关键词：

Support Vector Machines; statistical learning theory; VC dimension; pattern recognition;

D O I：

10.1023/A:1009715923555

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The tutorial starts with an overview of the concepts of VC dimension and structural risk minimization. We then describe linear Support Vector Machines (SVMs) for separable and non-separable data, working through a non-trivial example in detail. We describe a mechanical analogy, and discuss when SVM solutions are unique and when they are global. We describe how support vector training can be practically implemented, and discuss in detail the kernel mapping technique which is used to construct SVM solutions which are nonlinear in the data. We show how Support Vector machines can have very large (even infinite) VC dimension by computing the VC dimension for homogeneous polynomial and Gaussian radial basis function kernels. While very high VC dimension would normally bode ill for generalization performance, and while at present there exists no theory which shows that good generalization performance is guaranteed for SVMs, there are several arguments which support the observed high accuracy of SVMs, which we review. Results of some experiments which were inspired by these arguments are also presented. We give numerous examples and proofs of most of the key theorems. There is new material, and I hope that the reader will find that even old material is cast in a fresh light.

引用

页码：121 / 167

页数：47

共 59 条

[1] AIZERMAN MA, 1965, AUTOMAT REM CONTR+, V25, P821
[2] [Anonymous], 1992, ADV NEUR IN
[3] [Anonymous], HDB BRAIN THEORY NEU
[4] [Anonymous], 1982, ESTIMATION DEPENDENC
[5] BENNETT KP, 1998, GEOMETRY WORK
[6] Bishop C. M., 1995, NEURAL NETWORKS PATT
[7] BLANZ V, 1996, SPRINGER LECT NOTES, V1112, P251
[8] Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
[9] BUNCH JR, 1977, MATH COMPUT, V31, P163, DOI 10.1090/S0025-5718-1977-0428694-0
[10] A COMPUTATIONAL METHOD FOR THE INDEFINITE QUADRATIC-PROGRAMMING PROBLEM
BUNCH, JR
KAUFMAN, L
[J]. LINEAR ALGEBRA AND ITS APPLICATIONS, 1980, 34 (DEC) : 341 - 370

← 1 2 3 4 5 6 →