Calibration drift in regression and machine learning models for acute kidney injury

被引:196
作者
Davis, Sharon E. [1 ]
Lasko, Thomas A. [1 ]
Chen, Guanhua [2 ]
Siew, Edward D. [3 ,4 ,5 ]
Matheny, Michael E. [1 ,2 ,3 ,6 ]
机构
[1] Vanderbilt Univ, Sch Med, Dept Biomed Informat, Nashville, TN 37212 USA
[2] Vanderbilt Univ, Sch Med, Dept Biostat, Nashville, TN 37212 USA
[3] VA Tennessee Valley Healthcare Syst, Geriatr Res Educ & Clin Care Serv, Nashville, TN USA
[4] Vanderbilt Univ, Sch Med, Div Nephrol, Vanderbilt Ctr Kidney Dis, Nashville, TN 37212 USA
[5] Integrated Program AKI, Nashville, TN USA
[6] Vanderbilt Univ, Sch Med, Div Gen Internal Med, Nashville, TN 37212 USA
关键词
clinical prediction; machine learning; discrimination; calibration; acute kidney injury; clinical decision support; RISK-PREDICTION MODELS; ACUTE-RENAL-FAILURE; EXTERNAL VALIDATION; CARDIAC-SURGERY; STRATIFICATION MODELS; PROGNOSTIC MODELS; CARE UNITS; APACHE-III; PERFORMANCE; MORTALITY;
D O I
10.1093/jamia/ocx030
中图分类号
TP [自动化技术、计算机技术];
学科分类号
080201 [机械制造及其自动化];
摘要
Predictive analytics create opportunities to incorporate personalized risk estimates into clinical decision support. Models must be well calibrated to support decision-making, yet calibration deteriorates over time. This study explored the influence of modeling methods on performance drift and connected observed drift with data shifts in the patient population. Using 2003 admissions to Department of Veterans Affairs hospitals nationwide, we developed 7 parallel models for hospital-acquired acute kidney injury using common regression and machine learning methods, validating each over 9 subsequent years. Discrimination was maintained for all models. Calibration declined as all models increasingly overpredicted risk. However, the random forest and neural network models maintained calibration across ranges of probability, capturing more admissions than did the regression models. The magnitude of overprediction increased over time for the regression models while remaining stable and small for the machine learning models. Changes in the rate of acute kidney injury were strongly linked to increasing overprediction, while changes in predictor-outcome associations corresponded with diverging patterns of calibration drift across methods. Efficient and effective updating protocols will be essential for maintaining accuracy of, user confidence in, and safety of personalized risk predictions to support decision-making. Model updating protocols should be tailored to account for variations in calibration drift across methods and respond to periods of rapid performance drift rather than be limited to regularly scheduled annual or biannual intervals.
引用
收藏
页码:1052 / 1061
页数:10
相关论文
共 64 条
[1]
Implementing Electronic Health Care Predictive Analytics: Considerations And Challenges [J].
Amarasingham, Ruben ;
Patzer, Rachel E. ;
Huesch, Marco ;
Nguyen, Nam Q. ;
Xie, Bin .
HEALTH AFFAIRS, 2014, 33 (07) :1148-1154
[2]
Amarasinghe R., 2016, EUROPEAN J COMPUTER, V4, P1
[3]
Bishop CM, 1995, Neural Networks for Pattern Recognition
[4]
Reporting and Methods in Clinical Prediction Research: A Systematic Review [J].
Bouwmeester, Walter ;
Zuithoff, Nicolaas P. A. ;
Mallett, Susan ;
Geerlings, Mirjam I. ;
Vergouwe, Yvonne ;
Steyerberg, Ewout W. ;
Altman, Douglas G. ;
Moons, Karel G. M. .
PLOS MEDICINE, 2012, 9 (05)
[5]
A Combined Cardiorenal Assessment for the Prediction of Acute Kidney Injury in Lower Respiratory Tract Infections [J].
Breidthardt, Tobias ;
Christ-Crain, Mirjam ;
Stolz, Daiana ;
Bingisser, Roland ;
Drexler, Beatrice ;
Klima, Theresia ;
Balmelli, Catharina ;
Schuetz, Philipp ;
Haaf, Philip ;
Schaerer, Michael ;
Tamm, Michael ;
Mueller, Beat ;
Mueller, Christian .
AMERICAN JOURNAL OF MEDICINE, 2012, 125 (02) :168-175
[6]
Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[7]
Statistical modeling: The two cultures [J].
Breiman, L .
STATISTICAL SCIENCE, 2001, 16 (03) :199-215
[8]
Acute renal failure in intensive care units - Causes, outcome, and prognostic factors of hospital mortality: A prospective, multicenter study [J].
Brivet, FG ;
Kleinknecht, DJ ;
Loirat, P ;
Landais, PJM ;
Bedock, B ;
Bleichner, G ;
Richard, C ;
Coste, F ;
BrunBuisson, C ;
Sicot, C ;
Tenaillon, A ;
Gajdos, P ;
Blin, F ;
Saulnier, F ;
Agostini, MM ;
Nicolas, F ;
FeryLemonnier, E ;
Staikowski, F ;
Carlet, J ;
Guivarch, G ;
Fraisse, F ;
Ricome, J ;
Tempe, JD ;
Mezzarobba, P .
CRITICAL CARE MEDICINE, 1996, 24 (02) :192-198
[9]
Acute Kidney Injury Risk Prediction in Patients Undergoing Coronary Angiography in a National Veterans Health Administration Cohort With External Validation [J].
Brown, Jeremiah R. ;
MacKenzie, Todd A. ;
Maddox, Thomas M. ;
Fly, James ;
Tsai, Thomas T. ;
Plomondon, Mary E. ;
Nielson, Christopher D. ;
Siew, Edward D. ;
Resnic, Frederic S. ;
Baker, Clifton R. ;
Rumsfeld, John S. ;
Matheny, Michael E. .
JOURNAL OF THE AMERICAN HEART ASSOCIATION, 2015, 4 (12)
[10]
Long-term Risk of Mortality and Other Adverse Outcomes After Acute Kidney Injury: A Systematic Review and Meta-analysis [J].
Coca, Steven G. ;
Yusuf, Bushra ;
Shlipak, Michael G. ;
Garg, Amit X. ;
Parikh, Chirag R. .
AMERICAN JOURNAL OF KIDNEY DISEASES, 2009, 53 (06) :961-973