Adaptive credit scoring with kernel learning methods

被引:53
作者
Yang, Yingxu [1 ]
机构
[1] SHS VIVEON, D-82152 Munich, Germany
关键词
risk management; credit scoring; kernel methods; machine learning;
D O I
10.1016/j.ejor.2006.10.066
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
Credit scoring is a method of modelling potential risk of credit applications. Traditionally, logistic regression and discriminant analysis are the most widely used approaches to create scoring models in the industry. However, these methods are associated with quite a few limitations, such as being instable with high-dimensional data and small sample size, intensive variable selection effort and incapability of efficiently handling non-linear features. Most importantly, based on these algorithms, it is difficult to automate the modelling process and when population changes occur, the static models usually fail to adapt and may need to be rebuilt from scratch. In the last few years, the kernel learning approach has been investigated to solve these problems. However, the existing applications of this type of methods (in particular the SVM) in credit scoring have all focused on the batch model and did not address the important problem of how to update the scoring model on-line. This paper presents a novel and practical adaptive scoring system based on an incremental kernel method. With this approach, the scoring model is adjusted according to an on-line update procedure that can always converge to the optimal solution without information loss or running into numerical difficulties. Non-linear features in the data are automatically included in the model through a kernel transformation. This approach does not require any variable reduction effort and is also robust for scoring data with a large number of attributes and highly unbalanced class distributions. Moreover, a new potential kernel function is introduced to further improve the predictive performance of the scoring model and a kernel attribute ranking technique is used that adds transparency in the final model. Experimental studies using real world data sets have demonstrated the effectiveness of the proposed method. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:1521 / 1536
页数:16
相关论文
共 36 条
[1]  
AIZERMAN MA, 1965, AUTOMAT REM CONTR+, V25, P821
[2]  
[Anonymous], 1989, Applied Logistic Regression
[3]  
[Anonymous], 1939, THESIS U CHICAGO ILL
[4]  
Back B, 1996, P 1 INT M ART INT AC, P337
[5]   Benchmarking state-of-the-art classification algorithms for credit scoring [J].
Baesens, B ;
Van Gestel, T ;
Viaene, S ;
Stepanova, M ;
Suykens, J ;
Vanthienen, J .
JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2003, 54 (06) :627-635
[6]  
BOMHARDT C, 2004, WEB INT IEEE WIC ACM
[7]  
Cauwenberghs G, 2001, ADV NEUR IN, V13, P409
[8]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
[9]  
DIEHL CP, 2003, P IEEE INT JOINT C N
[10]   Accurate identification of alternatively spliced exons using support vector machine [J].
Dror, G ;
Sorek, R ;
Shamir, R .
BIOINFORMATICS, 2005, 21 (07) :897-901