Learning vector representation of medical objects via EMR-driven nonnegative restricted Boltzmann machines (eNRBM)

被引:120
作者
Truyen Tran [1 ,2 ]
Tu Dinh Nguyen [1 ]
Dinh Phung [1 ]
Venkatesh, Svetha [1 ]
机构
[1] Deakin Univ, Ctr Pattern Recognit & Data Analyt, Geelong, Vic 3217, Australia
[2] Curtin Univ, Dept Comp, Perth, WA 6845, Australia
关键词
Electronic medical records; Vector representation; Medical objects embedding; Feature grouping; Suicide risk stratification; SUICIDAL-BEHAVIOR; MENTAL-HEALTH; RISK-FACTORS; COMORBIDITY; RECORDS; PREVALENCE; DEPRESSION; DISORDER; VALIDITY; DISEASE;
D O I
10.1016/j.jbi.2015.01.012
中图分类号
TP39 [计算机的应用];
学科分类号
080201 [机械制造及其自动化];
摘要
Electronic medical record (EMR) offers promises for novel analytics. However, manual feature engineering from EMR is labor intensive because EMR is complex - it contains temporal, mixed-type and multimodal data packed in irregular episodes. We present a computational framework to harness EMR with minimal human supervision via restricted Boltzmann machine (RBM). The framework derives a new representation of medical objects by embedding them in a low-dimensional vector space. This new representation facilitates algebraic and statistical manipulations such as projection onto 2D plane (thereby offering intuitive visualization), object grouping (hence enabling automated phenotyping), and risk stratification. To enhance model interpretability, we introduced two constraints into model parameters: (a) nonnegative coefficients, and (b) structural smoothness. These result in a novel model called eNRBM (EMR-driven nonnegative RBM). We demonstrate the capability of the eNRBM on a cohort of 7578 mental health patients under suicide risk assessment. The derived representation not only shows clinically meaningful feature grouping but also facilitates short-term risk stratification. The F-scores, 0.21 for moderate-risk and 0.36 for high-risk, are significantly higher than those obtained by clinicians and competitive with the results obtained by support vector machines. (C) 2015 Elsevier Inc. All rights reserved.
引用
收藏
页码:96 / 105
页数:10
相关论文
共 48 条
[1]
[Anonymous], P 5 AS C MACH LEARN
[2]
[Anonymous], P 3 AS C MACH LEARN
[3]
[Anonymous], INT C MACH LEARN ICM
[4]
Representation Learning: A Review and New Perspectives [J].
Bengio, Yoshua ;
Courville, Aaron ;
Vincent, Pascal .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828
[5]
Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[6]
Twelve-Month Prevalence of and Risk Factors for Suicide Attempts in the World Health Organization World Mental Health Surveys [J].
Borges, Guilherme ;
Nock, Matthew K. ;
Haro Abad, Josep M. ;
Hwang, Irving ;
Sampson, Nancy A. ;
Alonso, Jordi ;
Andrade, Laura Helena ;
Angermeyer, Matthias C. ;
Beautrais, Annette ;
Bromet, Evelyn ;
Bruffaerts, Ronny ;
de Girolamo, Giovanni ;
Florescu, Silvia ;
Gureje, Oye ;
Hu, Chiyi ;
Karam, Elie G. ;
Kovess-Masfety, Viviane ;
Lee, Sing ;
Levinson, Daphna ;
Elena Medina-Mora, Maria ;
Ormel, Johan ;
Posada-Villa, Jose ;
Sagar, Rajesh ;
Tomov, Toma ;
Uda, Hidenori ;
Williams, David R. ;
Kessler, Ronald C. .
JOURNAL OF CLINICAL PSYCHIATRY, 2010, 71 (12) :1617-1628
[7]
Risk factors for suicide in psychiatric outpatients: A 20-year prospective study [J].
Brown, GK ;
Beck, AT ;
Steer, RA ;
Grisham, JR .
JOURNAL OF CONSULTING AND CLINICAL PSYCHOLOGY, 2000, 68 (03) :371-377
[8]
Data-driven approach for creating synthetic electronic medical records [J].
Buczak, Anna L. ;
Babin, Steven ;
Moniz, Linda .
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2010, 10
[9]
Relationships between obesity and DSM-IV major depressive disorder, suicide ideation, and suicide attempts:: Results from a general population study [J].
Carpenter, KM ;
Hasin, DS ;
Allison, DB ;
Faith, MS .
AMERICAN JOURNAL OF PUBLIC HEALTH, 2000, 90 (02) :251-257
[10]
LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)