Applying Classification Trees to Hospital Administrative Data to Identify Patients with Lower Gastrointestinal Bleeding

被引:11
作者
Siddique, Juned [1 ]
Ruhnke, Gregory W. [2 ]
Flores, Andrea [2 ]
Prochaska, Micah T. [2 ]
Paesch, Elizabeth [2 ]
Meltzer, David O. [2 ]
Whelan, Chad T. [3 ]
机构
[1] Northwestern Univ, Feinberg Sch Med, Dept Prevent Med, Chicago, IL 60611 USA
[2] Univ Chicago, Dept Med, Chicago, IL 60637 USA
[3] Loyola Univ, Stritch Sch Med, Dept Med, Maywood, IL 60153 USA
来源
PLOS ONE | 2015年 / 10卷 / 09期
关键词
EPIDEMIOLOGY; OUTCOMES;
D O I
10.1371/journal.pone.0138987
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background Lower gastrointestinal bleeding (LGIB) is a common cause of acute hospitalization. Currently, there is no accepted standard for identifying patients with LGIB in hospital administrative data. The objective of this study was to develop and validate a set of classification algorithms that use hospital administrative data to identify LGIB. Methods Our sample consists of patients admitted between July 1, 2001 and June 30, 2003 (derivation cohort) and July 1, 2003 and June 30, 2005 (validation cohort) to the general medicine inpatient service of the University of Chicago Hospital, a large urban academic medical center. Confirmed cases of LGIB in both cohorts were determined by reviewing the charts of those patients who had at least 1 of 36 principal or secondary International Classification of Diseases, Ninth revision, Clinical Modification (ICD-9-CM) diagnosis codes associated with LGIB. Classification trees were used on the data of the derivation cohort to develop a set of decision rules for identifying patients with LGIB. These rules were then applied to the validation cohort to assess their performance. Results Three classification algorithms were identified and validated: a high specificity rule with 80.1% sensitivity and 95.8% specificity, a rule that balances sensitivity and specificity (87.8% sensitivity, 90.9% specificity), and a high sensitivity rule with 100% sensitivity and 91.0% specificity. Conclusion These classification algorithms can be used in future studies to evaluate resource utilization and assess outcomes associated with LGIB without the use of chart review.
引用
收藏
页数:15
相关论文
共 17 条
[1]   Population fluctuations affect inference in ecological networks of multi-species interactions [J].
Wells, Konstans ;
Feldhaar, Heike ;
O'Hara, Robert B. .
OIKOS, 2014, 123 (05) :589-598
[2]  
[Anonymous], 2012, R LANG ENV STAT COMP
[3]  
[Anonymous], 2019, Statistical learning with sparsity: the lasso and generalizations
[4]  
Berk RA, 2008, SPRINGER SER STAT, P1, DOI 10.1007/978-0-387-77501-2_1
[5]   Statistical modeling: The two cultures [J].
Breiman, L .
STATISTICAL SCIENCE, 2001, 16 (03) :199-215
[6]  
Breiman L, 1984, OLSHEN STONE CLASSIF, DOI 10.1201/9781315139470
[7]   Ascertainment of Colonoscopy Indication Using Administrative Data [J].
Fisher, Deborah A. ;
Grubber, Janet M. ;
Castor, John M. ;
Coffman, Cynthia J. .
DIGESTIVE DISEASES AND SCIENCES, 2010, 55 (06) :1721-1725
[8]   A decision-theoretic generalization of on-line learning and an application to boosting [J].
Freund, Y ;
Schapire, RE .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1997, 55 (01) :119-139
[9]  
Hastie T., 2009, ELEMENTS STAT LEARNI, V2
[10]  
Longstreth GF, 1997, AM J GASTROENTEROL, V92, P419