Facilitating pharmacogenetic studies using electronic health records and natural-language processing: a case study of warfarin

被引:55
作者
Xu, Hue [1 ]
Jiang, Min [1 ]
Oetjens, Matt [2 ]
Bowton, Erica A. [3 ]
Ramirez, Andrea H. [4 ]
Jeff, Janina M. [3 ]
Basford, Melissa A. [3 ]
Pulley, Jill M. [3 ]
Cowan, James D. [3 ]
Wang, Xiaoming [3 ]
Ritchie, Marylyn D. [1 ,2 ]
Masys, Daniel R. [1 ]
Roden, Dan M. [4 ,5 ]
Crawford, Dana C. [2 ]
Denny, Joshua C. [1 ,4 ]
机构
[1] Vanderbilt Univ, Sch Med, Dept Biomed Informat, Nashville, TN 37232 USA
[2] Vanderbilt Univ, Sch Med, Ctr Human Genet Res, Dept Mol Physiol & Biophys, Nashville, TN 37232 USA
[3] Vanderbilt Univ, Sch Med, Vanderbilt Inst Clin & Translat Res, Nashville, TN 37232 USA
[4] Vanderbilt Univ, Sch Med, Dept Med, Nashville, TN 37232 USA
[5] Vanderbilt Univ, Sch Med, Dept Pharmacol, Nashville, TN 37232 USA
关键词
MEDICATION INFORMATION EXTRACTION; CLINICAL TEXT; DNA BIOBANK; SYSTEM; IDENTIFICATION; ASSOCIATION; NARRATIVES; PREDICT; VKORC1; TOOL;
D O I
10.1136/amiajnl-2011-000208
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective DNA biobanks linked to comprehensive electronic health records systems are potentially powerful resources for pharmacogenetic studies. This study sought to develop natural-language-processing algorithms to extract drug-dose information from clinical text, and to assess the capabilities of such tools to automate the data-extraction process for pharmacogenetic studies. Materials and methods A manually validated warfarin pharmacogenetic study identified a cohort of 1125 patients with a stable warfarin dose, in which 776 patients were managed by Coumadin Clinic physicians, and the remaining 349 patients were managed by their providers. The authors developed two algorithms to extract weekly warfarin doses from both data sets: a regular expression-based program for semistructured Coumadin Clinic notes; and an advanced weekly dose calculator based on an existing medication information extraction system (Med Ex) for narrative providers' notes. The authors then conducted an association analysis between an automatically extracted stable weekly dose of warfarin and four genetic variants of VKORC1 and CYP2C9 genes. The performance of the weekly dose-extraction program was evaluated by comparing it with a gold standard containing manually curated weekly doses. Precision, recall, F-measure, and overall accuracy were reported. Associations between known variants in VKORC1 and CYP2C9 and warfarin stable weekly dose were performed with linear regression adjusted for age, gender, and body mass index. Results The authors' evaluation showed that the Med Ex-based system could determine patients' warfarin weekly doses with 99.7% recall, 90.8% precision, and 93.8% accuracy. Using the automatically extracted weekly doses of warfarin, the authors successfully replicated the previous known associations between warfarin stable dose and genetic variants in VKORC1 and CYP2C9.
引用
收藏
页码:387 / 391
页数:5
相关论文
共 30 条
[1]   A multicomponent intervention to prevent major bleeding complications in older patients receiving warfarin - A randomized, controlled trial [J].
Beyth, RJ ;
Quinn, L ;
Landefeld, CS .
ANNALS OF INTERNAL MEDICINE, 2000, 133 (09) :687-695
[2]   CYP2C9 genotype-guided warfarin prescribing enhances the efficacy and safety of anticoagulation:: A prospective randomized controlled study [J].
Caraco, Y. ;
Blotnick, S. ;
Muszkat, M. .
CLINICAL PHARMACOLOGY & THERAPEUTICS, 2008, 83 (03) :460-470
[3]   Identifying the genotype phenotype:: a role model behind the found in VKORC1 and its association with warfarin dosing [J].
Crawford, Dana C. ;
Ritchie, Marylyn D. ;
Rieder, Mark J. .
PHARMACOGENOMICS, 2007, 8 (05) :487-496
[4]   Identification of Genomic Predictors of Atrioventricular Conduction Using Electronic Medical Records as a Tool for Genome Science [J].
Denny, Joshua C. ;
Ritchie, Marylyn D. ;
Crawford, Dana C. ;
Schildcrout, Jonathan S. ;
Ramirez, Andrea H. ;
Pulley, Jill M. ;
Basford, Melissa A. ;
Masys, Daniel R. ;
Haines, Jonathan L. ;
Roden, Dan M. .
CIRCULATION, 2010, 122 (20) :2016-2021
[5]   Integrating existing natural language processing tools for medication extraction from discharge summaries [J].
Doan, Son ;
Bastarache, Lisa ;
Klimkowski, Sergio ;
Denny, Joshua C. ;
Xu, Hua .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2010, 17 (05) :528-531
[6]   Use of pharmacogenetic and clinical factors to predict the therapeutic dose of warfarin [J].
Gage, B. F. ;
Eby, C. ;
Johnson, J. A. ;
Deych, E. ;
Rieder, M. J. ;
Ridker, P. M. ;
Milligan, P. E. ;
Grice, G. ;
Lenzini, P. ;
Rettie, A. E. ;
Aquilante, C. L. ;
Grosso, L. ;
Marsh, S. ;
Langaee, T. ;
Farnett, L. E. ;
Voora, D. ;
Veenstra, D. L. ;
Glynn, R. J. ;
Barrett, A. ;
McLeod, H. L. .
CLINICAL PHARMACOLOGY & THERAPEUTICS, 2008, 84 (03) :326-331
[7]   Azathioprine and 6-mercaptopurine pharmacogenetics and metabolite monitoring in inflammatory bowel disease [J].
Gearry, RB ;
Barclay, ML .
JOURNAL OF GASTROENTEROLOGY AND HEPATOLOGY, 2005, 20 (08) :1149-1157
[8]   Linguistic approach for identification of medication names and related information in clinical narratives [J].
Hamon, Thierry ;
Grabar, Natalia .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2010, 17 (05) :549-554
[9]   Genetic variants in the UDP-glucuronosyltransferase 1A1 gene predict the risk of severe neutropenia of irinotecan [J].
Innocenti, F ;
Undevia, SD ;
Iyer, L ;
Chen, PX ;
Das, S ;
Kocherginsky, M ;
Karrison, T ;
Janisch, L ;
Ramírez, J ;
Rudin, CM ;
Vokes, EE ;
Ratain, MJ .
JOURNAL OF CLINICAL ONCOLOGY, 2004, 22 (08) :1382-1388
[10]   Use of Electronic Health Records in U. S. Hospitals [J].
Jha, Ashish K. ;
DesRoches, Catherine M. ;
Campbell, Eric G. ;
Donelan, Karen ;
Rao, Sowmya R. ;
Ferris, Timothy G. ;
Shields, Alexandra ;
Rosenbaum, Sara ;
Blumenthal, David .
NEW ENGLAND JOURNAL OF MEDICINE, 2009, 360 (16) :1628-1638