Orthogonal support vector machine for credit scoring

被引:40
作者
Han, Lu [1 ,2 ]
Han, Liyan [1 ]
Zhao, Hongwei [2 ]
机构
[1] Beihang Univ, Sch Econ & Management, Beijing 100191, Peoples R China
[2] Tsinghua Univ, PBC Sch Finance, Beijing 100083, Peoples R China
基金
中国国家自然科学基金;
关键词
Dimension curse; Orthogonal dimension reduction; Support vector machine; Logistic regression; Principal component analysis; Credit scoring; ARTIFICIAL NEURAL-NETWORKS; DIMENSIONALITY REDUCTION; BANKRUPTCY PREDICTION; LOGISTIC-REGRESSION; CLASSIFICATION; OPTIMIZATION;
D O I
10.1016/j.engappai.2012.10.005
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The most commonly used techniques for credit scoring is logistic regression, and more recent research has proposed that the support vector machine is a more effective method. However, both logistic regression and support vector machine suffers from curse of dimension. In this paper, we introduce a new way to address this problem which is defined as orthogonal dimension reduction. We discuss the related properties of this method in detail and test it against other common statistical approaches principal component analysis and hybridizing logistic regression to better solve and evaluate the data. With experiments on German data set, there is also an interesting phenomenon with respect to the use of support vector machine, which we define as 'Dimensional interference', and discuss in general. Based on the results of cross-validation, it can be found that through the use of logistic regression filtering the dummy variables and orthogonal extracting feature, the support vector machine not only reduces complexity and accelerates convergence, but also achieves better performance. Crown Copyright (C) 2012 Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:848 / 862
页数:15
相关论文
共 32 条
[1]   Credit risk measurement: Developments over the last 20 years [J].
Altman, EI ;
Saunders, A .
JOURNAL OF BANKING & FINANCE, 1997, 21 (11-12) :1721-1742
[2]  
Anderson T.W., 1962, INTRO MULTIVARIATE S
[3]  
[Anonymous], 2002, Principal components analysis
[4]  
[Anonymous], 1961, Adaptive Control Processes: a Guided Tour, DOI DOI 10.1515/9781400874668
[5]   The use of the area under the roc curve in the evaluation of machine learning algorithms [J].
Bradley, AP .
PATTERN RECOGNITION, 1997, 30 (07) :1145-1159
[6]   SUPPORT-VECTOR NETWORKS [J].
CORTES, C ;
VAPNIK, V .
MACHINE LEARNING, 1995, 20 (03) :273-297
[7]   An introduction to ROC analysis [J].
Fawcett, Tom .
PATTERN RECOGNITION LETTERS, 2006, 27 (08) :861-874
[8]   The use of multiple measurements in taxonomic problems [J].
Fisher, RA .
ANNALS OF EUGENICS, 1936, 7 :179-188
[9]  
Fukunaga K, 1990, INTRO STAT PATTERN R, V2nd
[10]  
Gestel T.V., 2003, Journal of Bank and Finance, V2, P73