Data mining for credit card fraud: A comparative study

被引:400
作者
Bhattacharyya, Siddhartha [1 ]
Jha, Sanjeev [2 ]
Tharakunnel, Kurian [3 ]
Westland, J. Christopher [1 ]
机构
[1] Univ Illinois, Coll Business Adm, Dept Informat & Decis Sci MC 294, Chicago, IL 60607 USA
[2] Univ New Hampshire, Whittemore Sch Business & Econ, Dept Decis Sci, Durham, NH 03824 USA
[3] Millikin Univ, Tabor Sch Business, Decatur, IL 62522 USA
基金
美国国家科学基金会;
关键词
Credit card fraud detection; Data mining; Logistic regression; MULTINOMIAL LOGIT MODEL; INSURANCE FRAUD; DISCRETE-CHOICE;
D O I
10.1016/j.dss.2010.08.008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Credit card fraud is a serious and growing problem. While predictive models for credit card fraud detection are in active use in practice, reported studies on the use of data mining approaches for credit card fraud detection are relatively few, possibly due to the lack of available data for research. This paper evaluates two advanced data mining approaches, support vector machines and random forests, together with the well-known logistic regression, as part of an attempt to better detect (and thus control and prosecute) credit card fraud. The study is based on real-life data of transactions from an international credit card operation. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:602 / 613
页数:12
相关论文
共 52 条
  • [21] Neural fraud detection in credit card operations
    Dorronsoro, JR
    Ginel, F
    Sanchez, C
    Cruz, CS
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1997, 8 (04): : 827 - 834
  • [22] *FAIRISSAC, 2009, FALC FRAUD MAN
  • [23] GHOSH S, 1994, P 27 ANN HAW INT C S, V3
  • [24] Misclassification of the dependent variable in a discrete-response setting
    Hausman, JA
    Abrevaya, J
    Scott-Morton, FM
    [J]. JOURNAL OF ECONOMETRICS, 1998, 87 (02) : 239 - 269
  • [25] Hosmer DW., 2000, Applied logistic regression, DOI DOI 10.1002/0471722146.CH4
  • [26] Binary choice models for rare events data: a crop insurance fraud application
    Jin, YF
    Rejesus, RM
    Little, BB
    [J]. APPLIED ECONOMICS, 2005, 37 (07) : 841 - 848
  • [27] Khoshgoftaar TM, 2007, P 19 IEEE INT C TOOL, V2
  • [28] Kuncheva L.I., 2005, Combining Pattern Classifiers, Methods and Algorithms
  • [29] Predicting customer retention and profitability by using random forests and regression forests techniques
    Larivière, B
    Van den Poel, D
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2005, 29 (02) : 472 - 484
  • [30] An extensive comparison of recent classification tools applied to microarray data
    Lee, JW
    Lee, JB
    Park, M
    Song, SH
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2005, 48 (04) : 869 - 885