Combining cluster analysis with classifier ensembles to predict financial distress

被引:126
作者
Tsai, Chih-Fong [1 ]
机构
[1] Natl Cent Univ, Dept Informat Management, Taipei, Taiwan
关键词
Financial distress; Machine learning; Classifier ensembles; Hybrid classifiers; Bankruptcy prediction; Credit scoring; SUPPORT VECTOR MACHINES; BANKRUPTCY PREDICTION; NEURAL-NETWORKS; MINING APPROACH;
D O I
10.1016/j.inffus.2011.12.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The ability to accurately predict business failure is a very important issue in financial decision-making. Incorrect decision-making in financial institutions is very likely to cause financial crises and distress. Bankruptcy prediction and credit scoring are two important problems facing financial decision support. As many related studies develop financial distress models by some machine learning techniques, more advanced machine learning techniques, such as classifier ensembles and hybrid classifiers, have not been fully assessed. The aim of this paper is to develop a novel hybrid financial distress model based on combining the clustering technique and classifier ensembles. In addition, single baseline classifiers, hybrid classifiers, and classifier ensembles are developed for comparisons. In particular, two clustering techniques, Self-Organizing Maps (SOMs) and k-means and three classification techniques, logistic regression, multilayer-perceptron (MLP) neural network, and decision trees, are used to develop these four different types of bankruptcy prediction models. As a result, 21 different models are compared in terms of average prediction accuracy and Type I & II errors. By using five related datasets, combining Self-Organizing Maps (SOMs) with MLP classifier ensembles performs the best, which provides higher predication accuracy and lower Type I & II errors. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:46 / 58
页数:13
相关论文
共 39 条
[1]   Bankruptcy forecasting:: An empirical comparison of AdaBoost and neural networks [J].
Alfaro, Esteban ;
Garcia, Noelia ;
Gamez, Matias ;
Elizondo, David .
DECISION SUPPORT SYSTEMS, 2008, 45 (01) :110-122
[2]  
[Anonymous], 2001, Pattern Classification
[3]  
Haykin S., 1999, Neural Networks: A Comprehensive Foundation, DOI DOI 10.1017/S0269888998214044
[4]   Inferring descriptive and approximate fuzzy rules for credit scoring using evolutionary algorithms [J].
Hoffmann, F. ;
Baesens, B. ;
Mues, C. ;
Van Gestel, T. ;
Vanthienen, J. .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2007, 177 (01) :540-555
[5]   Hybrid mining approach in the design of credit scoring models [J].
Hsieh, NC .
EXPERT SYSTEMS WITH APPLICATIONS, 2005, 28 (04) :655-665
[6]   Predicting corporate financial distress based on integration of support vector machine and logistic regression [J].
Hua, Zhongsheng ;
Wang, Yu ;
Xu, Xiaoyan ;
Zhang, Bin ;
Liang, Liang .
EXPERT SYSTEMS WITH APPLICATIONS, 2007, 33 (02) :434-440
[7]   Credit scoring with a data mining approach based on support vector machines [J].
Huang, Cheng-Lung ;
Chen, Mu-Chen ;
Wang, Chieh-Jen .
EXPERT SYSTEMS WITH APPLICATIONS, 2007, 33 (04) :847-856
[8]   Credit rating analysis with support vector machines and neural networks: a market comparative study [J].
Huang, Z ;
Chen, HC ;
Hsu, CJ ;
Chen, WH ;
Wu, SS .
DECISION SUPPORT SYSTEMS, 2004, 37 (04) :543-558
[9]   Failure prediction with self organizing maps [J].
Huysmans, J ;
Baesens, B ;
Vanthienen, J ;
van Gestel, T .
EXPERT SYSTEMS WITH APPLICATIONS, 2006, 30 (03) :479-487
[10]   Data clustering: A review [J].
Jain, AK ;
Murty, MN ;
Flynn, PJ .
ACM COMPUTING SURVEYS, 1999, 31 (03) :264-323