Enhancing accuracy and interpretability of ensemble strategies in credit risk assessment. A correlated-adjusted decision forest proposal

被引:142
作者
Florez-Lopez, Raquel [1 ]
Manuel Ramon-Jeronimo, Juan [1 ]
机构
[1] Univ Pablo Olavide Seville, Dept Financial Econ & Accounting, Seville 41013, Spain
关键词
Ensemble strategies; Credit scoring; Decision forests; Diversity; Gradient boosting; Random forests; BANKRUPTCY PREDICTION; CLASSIFIER ENSEMBLES; RULE EXTRACTION; NEURAL-NETWORKS; MACHINE; PERFORMANCE; ALGORITHMS; SELECTION;
D O I
10.1016/j.eswa.2015.02.042
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
Credit risk assessment is a critical topic for finance activity and bankruptcy prediction that has been broadly explored using statistical models and Machine Learning methods. Recently, studies have suggested the use of ensemble strategies to enhance credit modelling performance. However, accuracy is obtained at the expense of interpretability, leading to the reluctance of financial industry to employ ensemble models in favour of simpler models. In this work we introduce an ensemble approach based on merged decision trees, the correlated-adjusted decision forest (CADF), to produce both accurate and comprehensible models. As main innovation, our proposal explores the combination of complementary sources of diversity as mechanisms to optimise model's structure, which leads to a manageable number of comprehensive decision rules without sacrificing performance. We evaluate our approach in comparison to individual classifiers and alternative ensemble strategies (gradient boosting, random forests). Empirical results suggest CADF is an encouraging solution for credit risk problems, being able to compete in accuracy with much complex proposals while producing a rule-based structure directly useful for managerial decisions. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:5737 / 5753
页数:17
相关论文
共 77 条
[1]
Improving experimental studies about ensembles of classifiers for bankruptcy prediction and credit scoring [J].
Abellan, Joaquin ;
Mantas, Carlos J. .
EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (08) :3825-3830
[2]
Bagging schemes on the presence of class noise in classification [J].
Abellan, Joaquin ;
Masegosa, Andres R. .
EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (08) :6827-6837
[3]
FINANCIAL RATIOS, DISCRIMINANT ANALYSIS AND PREDICTION OF CORPORATE BANKRUPTCY [J].
ALTMAN, EI .
JOURNAL OF FINANCE, 1968, 23 (04) :589-609
[4]
Benchmarking state-of-the-art classification algorithms for credit scoring [J].
Baesens, B ;
Van Gestel, T ;
Viaene, S ;
Stepanova, M ;
Suykens, J ;
Vanthienen, J .
JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2003, 54 (06) :627-635
[5]
Basel Committee on Banking Supervision (BCBS), 2011, Basel III: A global regulatory framework for more resilient banks and banking systems
[6]
SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation [J].
Blewitt, Marnie E. ;
Gendrel, Anne-Valerie ;
Pang, Zhenyi ;
Sparrow, Duncan B. ;
Whitelaw, Nadia ;
Craig, Jeffrey M. ;
Apedaile, Anwyn ;
Hilton, Douglas J. ;
Dunwoodie, Sally L. ;
Brockdorff, Neil ;
Kay, Graham F. ;
Whitelaw, Emma .
NATURE GENETICS, 2008, 40 (05) :663-669
[7]
Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[8]
Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[9]
An experimental comparison of classification algorithms for imbalanced credit scoring data sets [J].
Brown, Iain ;
Mues, Christophe .
EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (03) :3446-3453
[10]
Cestnik B., 1987, PROGR MACH LEARNING