Data mining for credit card fraud: A comparative study

被引:400
作者
Bhattacharyya, Siddhartha [1 ]
Jha, Sanjeev [2 ]
Tharakunnel, Kurian [3 ]
Westland, J. Christopher [1 ]
机构
[1] Univ Illinois, Coll Business Adm, Dept Informat & Decis Sci MC 294, Chicago, IL 60607 USA
[2] Univ New Hampshire, Whittemore Sch Business & Econ, Dept Decis Sci, Durham, NH 03824 USA
[3] Millikin Univ, Tabor Sch Business, Decatur, IL 62522 USA
基金
美国国家科学基金会;
关键词
Credit card fraud detection; Data mining; Logistic regression; MULTINOMIAL LOGIT MODEL; INSURANCE FRAUD; DISCRETE-CHOICE;
D O I
10.1016/j.dss.2010.08.008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Credit card fraud is a serious and growing problem. While predictive models for credit card fraud detection are in active use in practice, reported studies on the use of data mining approaches for credit card fraud detection are relatively few, possibly due to the lack of available data for research. This paper evaluates two advanced data mining approaches, support vector machines and random forests, together with the well-known logistic regression, as part of an attempt to better detect (and thus control and prosecute) credit card fraud. The study is based on real-life data of transactions from an international credit card operation. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:602 / 613
页数:12
相关论文
共 52 条
  • [1] ALESKEROV E, 1998, P IEEE IAFE IEEE PIS, P220
  • [2] [Anonymous], 2008, PROC 25 INT C MACH L
  • [3] [Anonymous], COMPUTER FRAUD 0501
  • [4] [Anonymous], IEEE INT C NETW SENS
  • [5] [Anonymous], 2002, The handbook of brain theory and neural networks
  • [6] Detection of automobile insurance fraud with discrete choice models and misclassified claims
    Artís, M
    Ayuso, M
    Guillén, M
    [J]. JOURNAL OF RISK AND INSURANCE, 2002, 69 (03) : 325 - 340
  • [7] Modeling discrete choice with response error: Food stamp participation
    Bollinger, CR
    David, MH
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1997, 92 (439) : 827 - 835
  • [8] Bolton R. J., 2001, C CRED SCOR CRED CON
  • [9] Bolton RJ, 2002, STAT SCI, V17, P235
  • [10] Brause R., 1999, NEURAL DATA MINING C, P103