Analysis of Machine Learning Techniques for Heart Failure Readmissions

被引:251
作者
Mortazavi, Bobak J. [1 ,5 ,6 ]
Downing, Nicholas S. [1 ,5 ,7 ]
Bucholz, Emily M. [1 ,5 ,8 ]
Dharmarajan, Kumar [1 ,5 ]
Manhapra, Ajay [2 ,3 ]
Li, Shu-Xia [5 ]
Negahban, Sahand N. [6 ]
Krumholz, Harlan M. [1 ,4 ,5 ]
机构
[1] Yale Sch Med, Dept Internal Med, Sect Cardiovasc Med, 1 Church St,Suite 200, New Haven, CT 06510 USA
[2] Yale Sch Med, Dept Psychiat, New Haven, CT 06510 USA
[3] Yale Sch Med, Sect Gen Med, Dept Internal Med, New Haven, CT 06510 USA
[4] Yale Sch Publ Hlth, Dept Hlth Policy & Management, New Haven, CT USA
[5] Yale New Haven Hosp, Ctr Outcomes Res & Evaluat, 20 York St, New Haven, CT 06504 USA
[6] Yale Univ, Dept Stat, New Haven, CT USA
[7] Brigham & Womens Hosp, 75 Francis St, Boston, MA 02115 USA
[8] Boston Childrens Hosp, Boston, MA USA
来源
CIRCULATION-CARDIOVASCULAR QUALITY AND OUTCOMES | 2016年 / 9卷 / 06期
关键词
computers; heart failure; machine learning; meta-analysis; patient readmission; RISK PREDICTION MODELS; 30-DAY READMISSIONS; FEATURE-SELECTION; RANDOM FORESTS; HOSPITALIZATION; DEATH;
D O I
10.1161/CIRCOUTCOMES.116.003039
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background-The current ability to predict readmissions in patients with heart failure is modest at best. It is unclear whether machine learning techniques that address higher dimensional, nonlinear relationships among variables would enhance prediction. We sought to compare the effectiveness of several machine learning algorithms for predicting readmissions. Methods and Results-Using data from the Telemonitoring to Improve Heart Failure Outcomes trial, we compared the effectiveness of random forests, boosting, random forests combined hierarchically with support vector machines or logistic regression (LR), and Poisson regression against traditional LR to predict 30-and 180-day all-cause readmissions and readmissions because of heart failure. We randomly selected 50% of patients for a derivation set, and a validation set comprised the remaining patients, validated using 100 bootstrapped iterations. We compared C statistics for discrimination and distributions of observed outcomes in risk deciles for predictive range. In 30-day all-cause readmission prediction, the best performing machine learning model, random forests, provided a 17.8% improvement over LR (mean C statistics, 0.628 and 0.533, respectively). For readmissions because of heart failure, boosting improved the C statistic by 24.9% over LR (mean C statistic 0.678 and 0.543, respectively). For 30-day all-cause readmission, the observed readmission rates in the lowest and highest deciles of predicted risk with random forests (7.8% and 26.2%, respectively) showed a much wider separation than LR (14.2% and 16.4%, respectively). Conclusions-Machine learning methods improved the prediction of readmission after hospitalization for heart failure compared with LR and provided the greatest predictive range in observed readmission rates.
引用
收藏
页码:629 / +
页数:54
相关论文
共 43 条
[1]   An Automated Model to Identify Heart Failure Patients at Risk for 30-Day Readmission or Death Using Electronic Medical Record Data [J].
Amarasingham, Ruben ;
Moore, Billy J. ;
Tabak, Ying P. ;
Drazner, Mark H. ;
Clark, Christopher A. ;
Zhang, Song ;
Reed, W. Gary ;
Swanson, Timothy S. ;
Ma, Ying ;
Halm, Ethan A. .
MEDICAL CARE, 2010, 48 (11) :981-988
[2]   Data-Driven Decisions for Reducing Readmissions for Heart Failure: General Methodology and Case Study [J].
Bayati, Mohsen ;
Braverman, Mark ;
Gillam, Michael ;
Mack, Karen M. ;
Ruiz, George ;
Smith, Mark S. ;
Horvitz, Eric .
PLOS ONE, 2014, 9 (10)
[3]   Aggregate features and ADABOOST for music classification [J].
Bergstra, James ;
Casagrande, Norman ;
Erhan, Dumitru ;
Eck, Douglas ;
Kegl, Balazs .
MACHINE LEARNING, 2006, 65 (2-3) :473-484
[4]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[5]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[6]   Randomized trial of telemonitoring to improve heart failure outcomes (Tele-HF): Study design [J].
Chaudhry, Sarwat I. ;
Barton, Barbara ;
Mattera, Jennifer ;
Spertus, John ;
Krumholz, Harlan M. .
JOURNAL OF CARDIAC FAILURE, 2007, 13 (09) :709-714
[7]   Telemonitoring in Patients with Heart Failure [J].
Chaudhry, Sarwat I. ;
Mattera, Jennifer A. ;
Curtis, Jeptha P. ;
Spertus, John A. ;
Herrin, Jeph ;
Lin, Zhenqiu ;
Phillips, Christopher O. ;
Hodshon, Beth V. ;
Cooper, Lawton S. ;
Krumholz, Harlan M. .
NEW ENGLAND JOURNAL OF MEDICINE, 2010, 363 (24) :2301-2309
[8]  
Chen YW, 2006, STUD FUZZ SOFT COMP, V207, P315
[9]   The Analysis of Count Data: A Gentle Introduction to Poisson Regression and Its Alternatives [J].
Coxe, Stefany ;
West, Stephen G. ;
Aiken, Leona S. .
JOURNAL OF PERSONALITY ASSESSMENT, 2009, 91 (02) :121-136
[10]  
Culp M, 2006, J STAT SOFTW, V17, P1