Multi-group support vector machines with measurement costs: A biobjective approach

被引:18
作者
Carrizosa, Emilio [1 ]
Martin-Barragan, Belen [2 ]
Morales, Dolores Romero [3 ]
机构
[1] Univ Seville, Fac Matemat, Seville, Spain
[2] Univ Carlos III Madrid, E-28903 Getafe, Spain
[3] Univ Oxford, Said Business Sch, Oxford OX1 2JD, England
关键词
multi-group classification; pareto optimality; biobjective mixed integer programming; feature cost; support vector machines;
D O I
10.1016/j.dam.2007.05.060
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Support Vector Machine has shown to have good performance in many practical classification settings. In this paper we propose, for multi-group classification, a biobjective optimization model in which we consider not only the generalization ability (modeled through the margin maximization), but also costs associated with the features. This cost is not limited to an economical payment, but can also refer to risk, computational effort, space requirements, etc. We introduce a Biobjective Mixed Integer Problem, for which Pareto optimal solutions are obtained. Those Pareto optimal solutions cot-respond to different classification rules, among which the user would choose the one yielding the most appropriate compromise between the cost and the expected misclassification rate. (C) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:950 / 966
页数:17
相关论文
共 36 条
[1]  
Allwein E. L., 2000, J MACHINE LEARNING R, V1, P113, DOI DOI 10.1162/15324430152733133
[2]  
[Anonymous], P 11 INT JOINT C ART
[3]  
APTE C, 2003, OR MS TODAY FEB
[4]  
BAYERZUBEK V, 2003, THESIS OREGON STATE
[5]  
Bennett KP, 1999, ADVANCES IN KERNEL METHODS, P307
[6]  
BENNETT KP, 1992, OPTIMIZATION METHODS, V1, P23, DOI DOI 10.1080/10556789208805504
[7]  
Blake C.L., 1998, UCI repository of machine learning databases
[8]   Logical analysis of numerical data [J].
Boros, E ;
Hammer, PL ;
Ibaraki, T ;
Kogan, A .
MATHEMATICAL PROGRAMMING, 1997, 79 (1-3) :163-190
[9]   Mathematical programming for data mining: Formulations and challenges [J].
Bradley, PS ;
Fayyad, UM ;
Mangasarian, OL .
INFORMS JOURNAL ON COMPUTING, 1999, 11 (03) :217-238
[10]  
BRADLEY PS, 2002, HDB MASSIVE DATASETS, P439