Prediction of Chronic Obstructive Pulmonary Disease (COPD) in Asthma Patients Using Electronic Medical Records

被引:87
作者
Himes, Blanca E. [1 ,2 ,3 ,4 ,6 ]
Dal, Yi [5 ]
Kohane, Isaac S. [1 ,2 ,3 ]
Weiss, Scott T. [3 ,4 ,6 ]
Ramoni, Marco F. [1 ,2 ,3 ]
机构
[1] MIT, Harvard Mit Div Hlth Sci & Technol, Cambridge, MA 02139 USA
[2] Harvard Univ, Childrens Hosp, Sch Med, Informat Program, Boston, MA 02115 USA
[3] Healthcare Ctr Personalized Genet Med, Boston, MA USA
[4] Harvard Univ, Sch Med, Boston, MA USA
[5] Wellesley Coll, Wellesley, MA 02181 USA
[6] Brigham & Womens Hosp, Channing Lab, Boston, MA 02115 USA
关键词
BAYESIAN NETWORKS; RISK; IDENTIFICATION; CANCER;
D O I
10.1197/jamia.M2846
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: Identify clinical factors that modulate the risk of progression to COPD among asthma patients using data extracted from electronic medical records. Design: Demographic information and comorbidities from adult asthma patients who were observed for at least 5 years with initial observation dates between 1988 and 1998, were extracted from electronic medical records of the Partners Healthcare System Using, tools of the National Center for Biomedical Computing "Informatics, for Integrating Biology to the Bedside" (i2b2). Measurements: A predictive model of COPD was constructed from a set of 9,349 patients (843 cases, 8,506 controls) using Bayesian networks. The model's predictive accuracy was tested using it to predict COPD in a future independent set of asthma patients (992 patients; 46 cases, 946 controls), who had initial observation dates between 1999 and 2002. Results: A Bayesian network model composed of age, sex, race, smoking history, and 8 comorbidity variables is able to predict COPD in the independent set of patients with an accuracy of 83.3%, computed as the area Under the Receiver Operating Characteristic curve (AUROC). Conclusions: Our results demonstrate that data extracted from electronic medical records can be used to create predictive models. With improvements in data extraction and inclusion of more variables, such models InaV prove to be clinically useful. J Am Med Inform Assoc. 2009;16:371-379, DOI 10.1197/jamia.M2846.
引用
收藏
页码:371 / 379
页数:9
相关论文
共 40 条
[1]  
[Anonymous], 2007, GLOBAL STRATEGY DIAG
[2]  
[Anonymous], 2007, R LANG ENV STAT COMP
[3]   Using literature and data to learn Bayesian networks as clinical models of ovarian tumors [J].
Antal, P ;
Fannes, G ;
Timmerman, D ;
Moreau, Y ;
De Moor, B .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2004, 30 (03) :257-281
[4]   EFFECTS OF SMOKING INTERVENTION AND THE USE OF AN INHALED ANTICHOLINERGIC BRONCHODILATOR ON THE RATE OF DECLINE OF FEV(1) - THE LUNG HEALTH STUDY [J].
ANTHONISEN, NR ;
CONNETT, JE ;
KILEY, JP ;
ALTOSE, MD ;
BAILEY, WC ;
BUIST, AS ;
CONWAY, WA ;
ENRIGHT, PL ;
KANNER, RE ;
OHARA, P ;
OWENS, GR ;
SCANLON, PD ;
TASHKIN, DP ;
WISE, RA .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 1994, 272 (19) :1497-1505
[5]  
*COMM QUAL HLTH CA, 2001, CROSS QUAL CHASM NEW
[6]   A BAYESIAN METHOD FOR THE INDUCTION OF PROBABILISTIC NETWORKS FROM DATA [J].
COOPER, GF ;
HERSKOVITS, E .
MACHINE LEARNING, 1992, 9 (04) :309-347
[7]  
CUNNINGHAM H, 1997, TIPSTER TEXT PROGR P
[8]   COMPARING THE AREAS UNDER 2 OR MORE CORRELATED RECEIVER OPERATING CHARACTERISTIC CURVES - A NONPARAMETRIC APPROACH [J].
DELONG, ER ;
DELONG, DM ;
CLARKEPEARSON, DI .
BIOMETRICS, 1988, 44 (03) :837-845
[9]   Histamine airway hyper-responsiveness and mortality from chronic obstructive pulmonary disease: a cohort study [J].
Hospers, JJ ;
Postma, DS ;
Rijcken, B ;
Weiss, ST ;
Schouten, JP .
LANCET, 2000, 356 (9238) :1313-1317
[10]  
I2B2, INF INT BIOL BEDS