Natural language processing to extract symptoms of severe mental illness from clinical text: the Clinical Record Interactive Search Comprehensive Data Extraction (CRIS-CODE) project

被引:133
作者
Jackson, Richard G. [1 ]
Patel, Rashmi [1 ]
Jayatilleke, Nishamali [1 ]
Kolliakou, Anna [1 ]
Ball, Michael [1 ]
Gorrell, Genevieve [2 ]
Roberts, Angus [2 ]
Dobson, Richard J. [1 ]
Stewart, Robert [1 ]
机构
[1] Kings Coll London, Inst Psychiat Psychol & Neurosci, London, England
[2] Univ Sheffield, Dept Comp Sci, Sheffield, S Yorkshire, England
来源
BMJ OPEN | 2017年 / 7卷 / 01期
基金
英国医学研究理事会;
关键词
Natural Language Processing; Serious Mental Illness; Symptomatology; MENTAL HEALTH; clinical informatics; ELECTRONIC HEALTH RECORDS; PERSONALITY-DISORDER; BIPOLAR DISORDER; SECONDARY USE; SCALE; REPRESENTATION; VALIDATION; SYSTEM;
D O I
10.1136/bmjopen-2016-012012
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Objectives We sought to use natural language processing to develop a suite of language models to capture key symptoms of severe mental illness (SMI) from clinical text, to facilitate the secondary use of mental healthcare data in research. Design Development and validation of information extraction applications for ascertaining symptoms of SMI in routine mental health records using the Clinical Record Interactive Search (CRIS) data resource; description of their distribution in a corpus of discharge summaries. Setting Electronic records from a large mental healthcare provider serving a geographic catchment of 1.2 million residents in four boroughs of south London, UK. Participants The distribution of derived symptoms was described in 23128 discharge summaries from 7962 patients who had received an SMI diagnosis, and 13496 discharge summaries from 7575 patients who had received a non-SMI diagnosis. Outcome measures Fifty SMI symptoms were identified by a team of psychiatrists for extraction based on salience and linguistic consistency in records, broadly categorised under positive, negative, disorganisation, manic and catatonic subgroups. Text models for each symptom were generated using the TextHunter tool and the CRIS database. Results We extracted data for 46 symptoms with a median F1 score of 0.88. Four symptom models performed poorly and were excluded. From the corpus of discharge summaries, it was possible to extract symptomatology in 87% of patients with SMI and 60% of patients with non-SMI diagnosis. Conclusions This work demonstrates the possibility of automatically extracting a broad range of SMI symptoms from English text discharge summaries for patients with an SMI diagnosis. Descriptive data also indicated that most symptoms cut across diagnoses, rather than being restricted to particular groups.
引用
收藏
页数:10
相关论文
共 37 条
[1]   On the spectrum [J].
Adam, David .
NATURE, 2013, 496 (7446) :416-418
[2]  
Andreasen N.C., 1983, Scale for the Assessment of Positive Symptoms (SAPS)
[3]  
Antolík J, 2005, ST HEAL T, V116, P817
[4]   VALIDATION OF THE 16-ITEM NEGATIVE SYMPTOM ASSESSMENT [J].
AXELROD, BN ;
GOLDMAN, RS ;
ALPHS, LD .
JOURNAL OF PSYCHIATRIC RESEARCH, 1993, 27 (03) :253-258
[5]   Validation of Electronic Health Record Phenotyping of Bipolar Disorder Cases and Controls [J].
Castro, Victor M. ;
Minnier, Jessica ;
Murphy, Shawn N. ;
Kohane, Isaac ;
Churchill, Susanne E. ;
Gainer, Vivian ;
Cai, Tianxi ;
Hoffnagle, Alison G. ;
Dai, Yael ;
Block, Stefanie ;
Weill, Sydney R. ;
Nadal-Vicens, Mireya ;
Pollastri, Alisha R. ;
Rosenquist, J. Niels ;
Goryachev, Sergey ;
Ongur, Dost ;
Sklar, Pamela ;
Perlis, Roy H. ;
Smoller, Jordan W. .
AMERICAN JOURNAL OF PSYCHIATRY, 2015, 172 (04) :363-372
[6]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[7]   Overcoming barriers to NLP for clinical text: the role of shared tasks and the need for additional creative solutions [J].
Chapman, Wendy W. ;
Nadkarni, Prakash M. ;
Hirschman, Lynette ;
D'Avolio, Leonard W. ;
Savova, Guergana K. ;
Uzuner, Ozlem .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2011, 18 (05) :540-543
[8]   OPENNESS TO EXPERIENCE, INTELLECT, SCHIZOTYPAL PERSONALITY DISORDER, AND PSYCHOTICISM: RESOLVING THE CONTROVERSY [J].
Chmielewski, Michael ;
Bagby, R. Michael ;
Markon, Kristian ;
Ring, Angela J. ;
Ryder, Andrew G. .
JOURNAL OF PERSONALITY DISORDERS, 2014, 28 (04) :483-499
[9]   Integrating psychopathological dimensions in functional psychoses: a hierarchical approach [J].
Cuesta, MJ ;
Peralta, V .
SCHIZOPHRENIA RESEARCH, 2001, 52 (03) :215-229
[10]   Combining dimensional and categorical representation of psychosis: the way forward for DSM-V and ICD-11? [J].
Demjaha, A. ;
Morgan, K. ;
Morgan, C. ;
Landau, S. ;
Dean, K. ;
Reichenberg, A. ;
Sham, P. ;
Fearon, P. ;
Hutchinson, G. ;
Jones, P. B. ;
Murray, R. M. ;
Dazzan, P. .
PSYCHOLOGICAL MEDICINE, 2009, 39 (12) :1943-1955