Incorporating logistic regression to decision-theoretic rough sets for classifications

被引:162
作者
Liu, Dun [1 ]
Li, Tianrui [2 ]
Liang, Decui [1 ]
机构
[1] Southwest Jiaotong Univ, Sch Econ & Management, Chengdu 610031, Sichuan, Peoples R China
[2] Southwest Jiaotong Univ, Sch Informat Sci & Technol, Chengdu 610031, Sichuan, Peoples R China
基金
美国国家科学基金会; 高等学校博士学科点专项科研基金; 中国博士后科学基金;
关键词
Decision-theoretic rough sets; Binary logistic analysis; Multivariate logistic regression; Decision making; ATTRIBUTE REDUCTION; DISCRIMINANT-ANALYSIS; MODEL SELECTION; 3-WAY DECISION; RISK DECISION; DISCRETIZATION;
D O I
10.1016/j.ijar.2013.02.013
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Logistic regression analysis is an effective approach to the classification problem. However, it may lead to high misclassification rate in real decision procedures. Decision-Theoretic Rough Sets (DTRS) employs a three-way decision to avoid most direct misclassification. We integrate logistic regression and DTRS to provide a new classification approach. On one hand, DTRS is utilized to systematically calculate the corresponding thresholds with Bayesian decision procedure. On the other hand, logistic regression is employed to compute the conditional probability of the three-way decision. The empirical studies of corporate failure prediction and high school program choices' prediction validate the rationality and effectiveness of the proposed approach. (C) 2013 Elsevier Inc. All rights reserved.
引用
收藏
页码:197 / 210
页数:14
相关论文
共 69 条
[1]  
Abbas AR, 2009, LECT NOTES COMPUT SC, V5552, P192, DOI 10.1007/978-3-642-01510-6_22
[2]  
ABDELMONSEF MME, 2007, INT J MATH MATH SCI
[3]  
[Anonymous], 2012, ROUGH SETS SELECTED
[4]   Variable precision rough set theory and data discretisation: an application to corporate failure prediction [J].
Beynon, MJ ;
Peel, MJ .
OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE, 2001, 29 (06) :561-576
[5]  
Duda R.O., 1973, Pattern Classification and Scene Analysis, V3
[6]  
Dun Liu, 2012, Rough Sets and Knowledge Technology. Proceedings of the 7th International Conference, RSKT 2012, P389, DOI 10.1007/978-3-642-31900-6_48
[7]   Key concepts in model selection: Performance and generalizability [J].
Forster, MR .
JOURNAL OF MATHEMATICAL PSYCHOLOGY, 2000, 44 (01) :205-231
[8]   Bayesian model selection for logistic regression with misclassified outcomes [J].
Gerlach, Richard ;
Stamey, James .
STATISTICAL MODELLING, 2007, 7 (03) :255-273
[9]   Do statistical inferences allowing three alternative decisions give better feedback for environmentally precautionary decision-making? [J].
Goudey, Rob .
JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2007, 85 (02) :338-344
[10]  
He X. Q., 2007, MULTIVARIABLE STAT A