Forecasting dangerous inmate misconduct: An application of ensemble statistical procedures

被引:57
作者
Berk, Richard A. [1 ]
Kriegler, Brian [1 ]
Baek, Jong-Ho [1 ]
机构
[1] Univ Calif Los Angeles, Dept Stat, Los Angeles, CA 90095 USA
基金
美国国家科学基金会;
关键词
prison; incarceration; misconduct; classification;
D O I
10.1007/s10940-006-9005-z
中图分类号
DF [法律]; D9 [法律];
学科分类号
0301 ;
摘要
In this paper, we attempt to forecast which prison inmates are likely to engage in very serious misconduct while incarcerated. Such misconduct would usually be a major felony if committed outside of prison: drug trafficking, assault, rape, attempted murder and other crimes. The binary response variable is problematic because it is highly unbalanced. Using data from nearly 10,000 inmates held in facilities operated by the California Department of Corrections, we show that several popular classification procedures do no better than the marginal distribution unless the data are weighted in a fashion that compensates for the lack of balance. Then, random forests performs reasonably well, and better than CART or logistic regression. Although less than 3% of the inmates studied over 24 months were reported for very serious misconduct, we are able to correctly forecast such behavior about half the time.
引用
收藏
页码:131 / 145
页数:15
相关论文
共 23 条
  • [1] ALEXANDER J, 1992, HDB EVALUATING PRISO
  • [3] AUSTIN J, 1993, CLASSIFICATION INTER
  • [4] BAIRD C, 1993, OBJECTIVE CLASSIFICA
  • [5] An evaluation of California's inmate classification system using a generalized regression discontinuity design
    Berk, RA
    de Leeuw, J
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1999, 94 (448) : 1045 - 1052
  • [6] BERK RA, 2005, IN PRESS SOC METH RE
  • [7] BERK RA, 2003, J CRIMINOLOGY PUBLIC, V2, P215
  • [8] SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation
    Blewitt, Marnie E.
    Gendrel, Anne-Valerie
    Pang, Zhenyi
    Sparrow, Duncan B.
    Whitelaw, Nadia
    Craig, Jeffrey M.
    Apedaile, Anwyn
    Hilton, Douglas J.
    Dunwoodie, Sally L.
    Brockdorff, Neil
    Kay, Graham F.
    Whitelaw, Emma
    [J]. NATURE GENETICS, 2008, 40 (05) : 663 - 669
  • [9] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [10] Statistical modeling: The two cultures
    Breiman, L
    [J]. STATISTICAL SCIENCE, 2001, 16 (03) : 199 - 215