Validation of Electronic Health Record Phenotyping of Bipolar Disorder Cases and Controls

被引:87
作者
Castro, Victor M.
Minnier, Jessica
Murphy, Shawn N.
Kohane, Isaac
Churchill, Susanne E.
Gainer, Vivian
Cai, Tianxi
Hoffnagle, Alison G.
Dai, Yael
Block, Stefanie
Weill, Sydney R.
Nadal-Vicens, Mireya
Pollastri, Alisha R.
Rosenquist, J. Niels
Goryachev, Sergey
Ongur, Dost
Sklar, Pamela
Perlis, Roy H.
Smoller, Jordan W. [1 ]
机构
[1] Partners HealthCare Syst, Res Informat Syst & Comp, Boston, MA 02199 USA
关键词
RHEUMATOID-ARTHRITIS; MAJOR DEPRESSION; MEDICAL-RECORDS; LARGE-SCALE; RISK; RELIABILITY; DISCOVERY; LOCI;
D O I
10.1176/appi.ajp.2014.14030423
中图分类号
R749 [精神病学];
学科分类号
100205 ;
摘要
Objective: The study was designed to validate use of electronic health records (EHRs) for diagnosing bipolar disorder and classifying control subjects. Method: EHR data were obtained from a health care system of more than 4.6 million patients spanning more than 20 years. Experienced clinicians reviewed charts to identify text features and coded data consistent or inconsistent with a diagnosis of bipolar disorder. Natural language processing was used to train a diagnostic algorithm with 95% specificity for classifying bipolar disorder. Filtered coded data were used to derive three additional classification rules for case subjects and one for control subjects. The positive predictive value (PPV) of EHR-based bipolar disorder and subphenotype diagnoses was calculated against diagnoses from direct semi-structured interviews of 190 patients by trained clinicians blind to EHR diagnosis. Results: The PPV of bipolar disorder defined by natural language processing was 0.85. Coded classification based on strict filtering achieved a value of 0.79, but classifications based on less stringent criteria performed less well. No EHR-classified control subject received a diagnosis of bipolar disorder on the basis of direct interview (PPV=1.0). For most subphenotypes, values exceeded 0.80. The EHR-based classifications were used to accrue 4,500 bipolar disorder cases and 5,000 controls for genetic analyses. Conclusions: Semiautomated mining of EHRs can be used to ascertain bipolar disorder patients and control subjects with high specificity and predictive value compared with diagnostic interviews. EHRs provide a powerful resource for high-throughput phenotyping for genetic and clinical research.
引用
收藏
页码:363 / 372
页数:10
相关论文
共 32 条
[21]   DSM-5 Field Trials in the United States and Canada, Part II: Test-Retest Reliability of Selected Categorical Diagnoses [J].
Regier, Darrel A. ;
Narrow, William E. ;
Clarke, Diana E. ;
Kraemer, Helena C. ;
Kuramoto, S. Janet ;
Kuhl, Emily A. ;
Kupfer, David J. .
AMERICAN JOURNAL OF PSYCHIATRY, 2013, 170 (01) :59-70
[22]   Genome-wide association analysis identifies 13 new risk loci for schizophrenia [J].
Ripke, Stephan ;
O'Dushlaine, Colm ;
Chambert, Kimberly ;
Moran, Jennifer L. ;
Kaehler, Anna K. ;
Akterin, Susanne ;
Bergen, Sarah E. ;
Collins, Ann L. ;
Crowley, James J. ;
Fromer, Menachem ;
Kim, Yunjung ;
Lee, Sang Hong ;
Magnusson, Patrik K. E. ;
Sanchez, Nick ;
Stahl, Eli A. ;
Williams, Stephanie ;
Wray, Naomi R. ;
Xia, Kai ;
Bettella, Francesco ;
Borglum, Anders D. ;
Bulik-Sullivan, Brendan K. ;
Cormican, Paul ;
Craddock, Nick ;
de Leeuw, Christiaan ;
Durmishi, Naser ;
Gill, Michael ;
Golimbet, Vera ;
Hamshere, Marian L. ;
Holmans, Peter ;
Hougaard, David M. ;
Kendler, Kenneth S. ;
Lin, Kuang ;
Morris, Derek W. ;
Mors, Ole ;
Mortensen, Preben B. ;
Neale, Benjamin M. ;
O'Neill, Francis A. ;
Owen, Michael J. ;
Milovancevic, Milica Pejovic ;
Posthuma, Danielle ;
Powell, John ;
Richards, Alexander L. ;
Riley, Brien P. ;
Ruderfer, Douglas ;
Rujescu, Dan ;
Sigurdsson, Engilbert ;
Silagadze, Teimuraz ;
Smit, August B. ;
Stefansson, Hreinn ;
Steinberg, Stacy .
NATURE GENETICS, 2013, 45 (10) :1150-+
[23]   Genome-wide association study identifies five new schizophrenia loci [J].
Ripke, Stephan ;
Sanders, Alan R. ;
Kendler, Kenneth S. ;
Levinson, Douglas F. ;
Sklar, Pamela ;
Holmans, Peter A. ;
Lin, Dan-Yu ;
Duan, Jubao ;
Ophoff, Roel A. ;
Andreassen, Ole A. ;
Scolnick, Edward ;
Cichon, Sven ;
Clair, David St. ;
Corvin, Aiden ;
Gurling, Hugh ;
Werge, Thomas ;
Rujescu, Dan ;
Blackwood, Douglas H. R. ;
Pato, Carlos N. ;
Malhotra, Anil K. ;
Purcell, Shaun ;
Dudbridge, Frank ;
Neale, Benjamin M. ;
Rossin, Lizzy ;
Visscher, Peter M. ;
Posthuma, Danielle ;
Ruderfer, Douglas M. ;
Fanous, Ayman ;
Stefansson, Hreinn ;
Steinberg, Stacy ;
Mowry, Bryan J. ;
Golimbet, Vera ;
De Hert, Marc ;
Jonsson, Erik G. ;
Bitter, Istvan ;
Pietilainen, Olli P. H. ;
Collier, David A. ;
Tosato, Sarah ;
Agartz, Ingrid ;
Albus, Margot ;
Alexander, Madeline ;
Amdur, Richard L. ;
Amin, Farooq ;
Bass, Nicholas ;
Bergen, Sarah E. ;
Black, Donald W. ;
Borglum, Anders D. ;
Brown, Matthew A. ;
Bruggeman, Richard ;
Buccola, Nancy G. .
NATURE GENETICS, 2011, 43 (10) :969-976
[24]   Robust Replication of Genotype-Phenotype Associations across Multiple Diseases in an Electronic Medical Record [J].
Ritchie, Marylyn D. ;
Denny, Joshua C. ;
Crawford, Dana C. ;
Ramirez, Andrea H. ;
Weiner, Justin B. ;
Pulley, Jill M. ;
Basford, Melissa A. ;
Brown-Gentry, Kristin ;
Balser, Jeffrey R. ;
Masys, Daniel R. ;
Haines, Jonathan L. ;
Roden, Dan M. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2010, 86 (04) :560-572
[25]   Diagnostic Consistency of Major Depression With Psychosis Across 10 Years [J].
Ruggero, Camilo J. ;
Kotov, Roman ;
Carlson, Gabrielle A. ;
Tanenberg-Karant, Marsha ;
Gonzalez, David A. ;
Bromet, Evelyn J. .
JOURNAL OF CLINICAL PSYCHIATRY, 2011, 72 (09) :1207-1213
[26]   A genome- and phenome-wide association study to identify genetic variants influencing platelet count and volume and their pleiotropic effects [J].
Shameer, Khader ;
Denny, Joshua C. ;
Ding, Keyue ;
Jouni, Hayan ;
Crosslin, David R. ;
de Andrade, Mariza ;
Chute, Christopher G. ;
Peissig, Peggy ;
Pacheco, Jennifer A. ;
Li, Rongling ;
Bastarache, Lisa ;
Kho, Abel N. ;
Ritchie, Marylyn D. ;
Masys, Daniel R. ;
Chisholm, Rex L. ;
Larson, Eric B. ;
McCarty, Catherine A. ;
Roden, Dan M. ;
Jarvik, Gail P. ;
Kullo, Iftikhar J. .
HUMAN GENETICS, 2014, 133 (01) :95-109
[27]   Diagnostic reliability of bipolar II disorder [J].
Simpson, SG ;
McMahon, FJ ;
McInnis, MG ;
MacKinnon, DF ;
Edwin, D ;
Folstein, SE ;
DePaulo, JR .
ARCHIVES OF GENERAL PSYCHIATRY, 2002, 59 (08) :736-740
[28]   Detecting Drug Interactions From Adverse-Event Reports: Interaction Between Paroxetine and Pravastatin Increases Blood Glucose Levels [J].
Tatonetti, N. P. ;
Denny, J. C. ;
Murphy, S. N. ;
Fernald, G. H. ;
Krishnan, G. ;
Castro, V. ;
Yue, P. ;
Tsau, P. S. ;
Kohane, I. ;
Roden, D. M. ;
Altman, R. B. .
CLINICAL PHARMACOLOGY & THERAPEUTICS, 2011, 90 (01) :133-142
[29]   Facilitating pharmacogenetic studies using electronic health records and natural-language processing: a case study of warfarin [J].
Xu, Hue ;
Jiang, Min ;
Oetjens, Matt ;
Bowton, Erica A. ;
Ramirez, Andrea H. ;
Jeff, Janina M. ;
Basford, Melissa A. ;
Pulley, Jill M. ;
Cowan, James D. ;
Wang, Xiaoming ;
Ritchie, Marylyn D. ;
Masys, Daniel R. ;
Roden, Dan M. ;
Crawford, Dana C. ;
Denny, Joshua C. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2011, 18 (04) :387-391
[30]   Meta-analysis of genome-wide association data and large-scale replication identifies additional susceptibility loci for type 2 diabetes [J].
Zeggini, Eleftheria ;
Scott, Laura J. ;
Saxena, Richa ;
Voight, Benjamin F. ;
Marchini, Jonathan L. ;
Hu, Tianle ;
de Bakker, Paul I. W. ;
Abecasis, Goncalo R. ;
Almgren, Peter ;
Andersen, Gitte ;
Ardlie, Kristin ;
Bostroem, Kristina Bengtsson ;
Bergman, Richard N. ;
Bonnycastle, Lori L. ;
Borch-Johnsen, Knut ;
Burtt, Noel P. ;
Chen, Hong ;
Chines, Peter S. ;
Daly, Mark J. ;
Deodhar, Parimal ;
Ding, Chia-Jen ;
Doney, Alex S. F. ;
Duren, William L. ;
Elliott, Katherine S. ;
Erdos, Michael R. ;
Frayling, Timothy M. ;
Freathy, Rachel M. ;
Gianniny, Lauren ;
Grallert, Harald ;
Grarup, Niels ;
Groves, Christopher J. ;
Guiducci, Candace ;
Hansen, Torben ;
Herder, Christian ;
Hitman, Graham A. ;
Hughes, Thomas E. ;
Isomaa, Bo ;
Jackson, Anne U. ;
Jorgensen, Torben ;
Kong, Augustine ;
Kubalanza, Kari ;
Kuruvilla, Finny G. ;
Kuusisto, Johanna ;
Langenberg, Claudia ;
Lango, Hana ;
Lauritzen, Torsten ;
Li, Yun ;
Lindgren, Cecilia M. ;
Lyssenko, Valeriya ;
Marvelle, Amanda F. .
NATURE GENETICS, 2008, 40 (05) :638-645