Hybrid mining approach in the design of credit scoring models

被引:135
作者
Hsieh, NC [1 ]
机构
[1] Natl Taipei Coll Nursing, Dept Informat Management, Taipei 11257, Taiwan
关键词
data mining; credit scoring model; clustering; class-wise classification; neural network;
D O I
10.1016/j.eswa.2004.12.022
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unrepresentative data samples are likely to reduce the utility of data classifiers in practical application. This study presents a hybrid mining approach in the design of an effective credit scoring model, based on clustering and neural network techniques. We used clustering techniques to preprocess the input samples with the objective of indicating unrepresentative samples into isolated and inconsistent clusters, and used neural networks to construct the credit scoring model. The clustering stage involved a class-wise classification process. A self-organizing map clustering algorithm was used to automatically determine the number of clusters and the starting points of each cluster. Then, the K-means clustering algorithm was used to generate clusters of samples belonging to new classes and eliminate the unrepresentative samples from each class. In the neural network stage, samples with new class labels were used in the design of the credit scoring model. The proposed method demonstrates by two real world credit data sets that the hybrid mining approach can be used to build effective credit scoring models. (c) 2005 Elsevier Ltd. All rights reserved.
引用
收藏
页码:655 / 665
页数:11
相关论文
共 28 条
[1]  
[Anonymous], 1999, APPL MULTIVARIATE AN
[2]   Comparative performance of the FSCL neural net and K-means algorithm for market segmentation [J].
Balakrishnan, PV ;
Cooper, MC ;
Jacob, VS ;
Lewis, PA .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1996, 93 (02) :346-357
[3]   Credit scoring and rejected instances reassigning through evolutionary computation techniques [J].
Chen, MC ;
Huang, SH .
EXPERT SYSTEMS WITH APPLICATIONS, 2003, 24 (04) :433-441
[4]   A comparison of neural networks and linear scoring models in the credit union environment [J].
Desai, VS ;
Crook, JN ;
Overstreet, GA .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1996, 95 (01) :24-37
[5]   SOME APPLICATIONS OF CLUSTERING IN THE DESIGN OF NEURAL NETWORKS [J].
GOPALAKRISHNAN, M ;
SRIDHAR, V ;
KRISHNAMURTHY, H .
PATTERN RECOGNITION LETTERS, 1995, 16 (01) :59-65
[6]  
Hand D.J., 1981, DISCRIMINATION CLASS
[7]  
HORNIK K, 1989, NEURAL NETWORKS, V2, P336
[8]   Comparing performance of feedforward neural nets and K-means for cluster-based market segmentation [J].
Hruschka, H ;
Natter, M .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1999, 114 (02) :346-353
[9]   An integrated data mining and behavioral scoring model for analyzing bank customers [J].
Hsieh, NC .
EXPERT SYSTEMS WITH APPLICATIONS, 2004, 27 (04) :623-633
[10]   Progress in supervised neural networks [J].
Hush, Don R. ;
Horne, Bill G. .
IEEE SIGNAL PROCESSING MAGAZINE, 1993, 10 (01) :8-39