Predictive modeling of depression and anxiety using electronic health records and a novel machine learning approach with artificial intelligence

被引:104
作者
Nemesure, Matthew D. [1 ,2 ]
Heinz, Michael V. [1 ,3 ]
Huang, Raphael [1 ]
Jacobson, Nicholas C. [1 ,2 ,4 ,5 ]
机构
[1] Dartmouth Coll, Geisel Sch Med, Ctr Technol & Behav Hlth, 46 Centerra Pkwy, Lebanon, NH 03766 USA
[2] Dartmouth Coll, Quantitat Biomed Sci Program, 1 Med Ctr Dr, Lebanon, NH 03766 USA
[3] Dartmouth Hitchcock Med Ctr, 1 Med Ctr Dr, Lebanon, NH 03766 USA
[4] Dartmouth Coll, Geisel Sch Med, Dept Biomed Data Sci, 1 Med Ctr Dr, Lebanon, NH 03766 USA
[5] Dartmouth Coll, Geisel Sch Med, Dept Psychiat, Lebanon, NH 03766 USA
关键词
MENTAL-HEALTH; UNTREATED DEPRESSION; SOCIOECONOMIC-STATUS; MEDICAL-RECORDS; DISORDERS; SAMPLE; RISK; ASSOCIATIONS; COMORBIDITY; DIAGNOSIS;
D O I
10.1038/s41598-021-81368-4
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Generalized anxiety disorder (GAD) and major depressive disorder (MDD) are highly prevalent and impairing problems, but frequently go undetected, leading to substantial treatment delays. Electronic health records (EHRs) collect a great deal of biometric markers and patient characteristics that could foster the detection of GAD and MDD in primary care settings. We approached the problem of predicting MDD and GAD using a novel machine learning pipeline to re-analyze data from an observational study. The pipeline constitutes an ensemble of algorithmically distinct machine learning methods, including deep learning. A sample of 4,184 undergraduate students completed the study, undergoing a general health screening and completing a psychiatric assessment for MDD and GAD. After explicitly excluding all psychiatric information, 59 biomedical and demographic features from the general health survey in addition to a set of engineered features were used for model training. We assessed the model's performance on a held-out test set and found an AUC of 0.73 (sensitivity: 0.66, specificity: 0.7) and 0.67 (sensitivity: 0.55, specificity: 0.7) for GAD, and MDD, respectively. Additionally, we used advanced techniques (SHAP values) to illuminate which features had the greatest impact on prediction for each disease. The top predictive features for MDD were being satisfied with living conditions and having public health insurance. The top predictive features for GAD were vaccinations being up to date and marijuana use. Our results indicate moderate predictive performance for the application of machine learning methods in detection of GAD and MDD based on EHR data. By identifying important predictors of GAD and MDD, these results may be used in future research to aid in the early detection of MDD and GAD.
引用
收藏
页数:9
相关论文
共 41 条
[1]   Comorbidity of Generalized Anxiety Disorder and Substance Use Disorders: Results From the National Epidemiologic Survey on Alcohol and Related Conditions [J].
Alegria, Analucia A. ;
Hasin, Deborah S. ;
Nunes, Edward V. ;
Liu, Shang-Min ;
Davies, Carrie ;
Grant, Bridget F. ;
Blanco, Carlos .
JOURNAL OF CLINICAL PSYCHIATRY, 2010, 71 (09) :1187-1195
[2]   Treatment gap for anxiety disorders is global: Results of the World Mental Health Surveys in 21 countries [J].
Alonso, Jordi ;
Liu, Zhaorui ;
Evans-Lacko, Sara ;
Sadikova, Ekaterina ;
Sampson, Nancy ;
Chatterji, Somnath ;
Abdulmalik, Jibril ;
Aguilar-Gaxiola, Sergio ;
Al-Hamzawi, Ali ;
Andrade, Laura H. ;
Bruffaerts, Ronny ;
Cardoso, Graca ;
Cia, Alfredo ;
Florescu, Silvia ;
de Girolamo, Giovanni ;
Gureje, Oye ;
Haro, Josep M. ;
He, Yanling ;
de Jonge, Peter ;
Karam, Elie G. ;
Kawakami, Norito ;
Kovess-Masfety, Viviane ;
Lee, Sing ;
Levinson, Daphna ;
Medina-Mora, Maria Elena ;
Navarro-Mateu, Fernando ;
Pennell, Beth-Ellen ;
Piazza, Marina ;
Posada-Villa, Jose ;
ten Have, Margreet ;
Zarkov, Zahari ;
Kessler, Ronald C. ;
Thornicroft, Graham .
DEPRESSION AND ANXIETY, 2018, 35 (03) :195-208
[3]   The prevalence and correlates of depression, anxiety, and stress in a sample of college students [J].
Beiter, R. ;
Nash, R. ;
McCrady, M. ;
Rhoades, D. ;
Linscomb, M. ;
Clarahan, M. ;
Sammut, S. .
JOURNAL OF AFFECTIVE DISORDERS, 2015, 173 :90-96
[4]   Perinatal Risks of Untreated Depression During Pregnancy [J].
Bonari, Lori ;
Pinto, Natasha ;
Ahn, Eric ;
Einarson, Adrienne ;
Steiner, Meir ;
Koren, Gideon .
CANADIAN JOURNAL OF PSYCHIATRY-REVUE CANADIENNE DE PSYCHIATRIE, 2004, 49 (11) :726-735
[5]  
Bystritsky Alexander, 2013, P T, V38, P30
[6]   Disentangling the relationship between delinquency and hyperactivity, low achievement, depression, and low socioeconomic status: Analysis of repeated longitudinal data [J].
Defoe, Ivy N. ;
Farrington, David P. ;
Loeber, Rolf .
JOURNAL OF CRIMINAL JUSTICE, 2013, 41 (02) :100-107
[7]   Epidemiologic evidence for the relation between socioeconomic status and depression, obesity, and diabetes [J].
Everson, SA ;
Maty, SC ;
Lynch, JW ;
Kaplan, GA .
JOURNAL OF PSYCHOSOMATIC RESEARCH, 2002, 53 (04) :891-895
[8]   Major depressive disorder [J].
Fava, M ;
Kendler, KS .
NEURON, 2000, 28 (02) :335-341
[9]   Is pulse pressure useful in predicting risk for coronary heart disease? The Framingham Heart Study [J].
Franklin, SS ;
Khan, SA ;
Wong, ND ;
Larson, MG ;
Levy, D .
CIRCULATION, 1999, 100 (04) :354-360
[10]  
Friedrich M J, 2017, JAMA, V317, P1517, DOI 10.1001/jama.2017.3826