Evaluation of matched control algorithms in EHR-based phenotyping studies: A case study of inflammatory bowel disease comorbidities

被引:17
作者
Castro, Victor M. [1 ]
Apperson, W. Kay [1 ]
Gainer, Vivian S. [1 ]
Ananthakrishnan, Ashwin N. [3 ]
Goodson, Alyssa P. [1 ]
Wang, Taowei D. [1 ]
Herrick, Christopher D. [1 ]
Murphy, Shawn N. [1 ,2 ]
机构
[1] Partners HealthCare, Partners Res Informat Syst & Comp, Boston, MA 02129 USA
[2] Massachusetts Gen Hosp, Dept Neurol, Comp Sci Lab, Boston, MA 02114 USA
[3] Massachusetts Gen Hosp, Gastrointestinal Unit, Boston, MA 02114 USA
基金
美国国家卫生研究院;
关键词
EHR; Controls; Matching; Comorbidity; Inflammatory bowel disease; ELECTRONIC HEALTH RECORDS; INFORMATICS; BIAS;
D O I
10.1016/j.jbi.2014.08.012
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The success of many population studies is determined by proper matching of cases to controls. Some of the confounding and bias that afflict electronic health record (EHR)-based observational studies may be reduced by creating effective methods for finding adequate controls. We implemented a method to match case and control populations to compensate for sparse and unequal data collection practices common in EHR data. We did this by matching the healthcare utilization of patients after observing that more complete data was collected on high healthcare utilization patients vs. low healthcare utilization patients. In our results, we show that many of the anomalous differences in population comparisons are mitigated using this matching method compared to other traditional age and gender-based matching. As an example, the comparison of the disease associations of ulcerative colitis and Crohn's disease show differences that are not present when the controls are chosen in a random or even a matched age/gender/race algorithm. In conclusion, the use of healthcare utilization-based matching algorithms to find adequate controls greatly enhanced the accuracy of results in EHR studies. Full source code and documentation of the control matching methods is available at https://community.i2b2.org/wiki/display/conmat/. (C) 2014 Elsevier Inc. All rights reserved.
引用
收藏
页码:105 / 111
页数:7
相关论文
共 19 条
[1]   Improving Case Definition of Crohn's Disease and Ulcerative Colitis in Electronic Medical Records Using Natural Language Processing: A Novel Informatics Approach [J].
Ananthakrishnan, Ashwin N. ;
Cai, Tianxi ;
Savova, Guergana ;
Cheng, Su-Chun ;
Chen, Pei ;
Perez, Raul Guzman ;
Gainer, Vivian S. ;
Murphy, Shawn N. ;
Szolovits, Peter ;
Xia, Zongqi ;
Shaw, Stanley ;
Churchill, Susanne ;
Karlson, Elizabeth W. ;
Kohane, Isaac ;
Plenge, Robert M. ;
Liao, Katherine P. .
INFLAMMATORY BOWEL DISEASES, 2013, 19 (07) :1411-1420
[2]   Confounding Control in Healthcare Database Research Challenges and Potential Approaches [J].
Brookhart, M. Alan ;
Sturmer, Til ;
Glynn, Robert J. ;
Rassen, Jeremy ;
Schneeweiss, Sebastian .
MEDICAL CARE, 2010, 48 (06) :S114-S120
[3]   QT interval and antidepressant use: a cross sectional study of electronic health records [J].
Castro, Victor M. ;
Clements, Caitlin C. ;
Murphy, Shawn N. ;
Gainer, Vivian S. ;
Fava, Maurizio ;
Weilburg, Jeffrey B. ;
Erb, Jane L. ;
Churchill, Susanne E. ;
Kohane, Isaac S. ;
Iosifescu, Dan V. ;
Smoller, Jordan W. ;
Perlis, Roy H. .
BMJ-BRITISH MEDICAL JOURNAL, 2013, 346
[4]   Bias correction to secondary trait analysis with casecontrol design [J].
Chen, Hua Yun ;
Kittles, Rick ;
Zhang, Wei .
STATISTICS IN MEDICINE, 2013, 32 (09) :1494-1508
[5]   Chapter 13: Mining Electronic Health Records in the Genomics Era [J].
Denny, Joshua C. .
PLOS COMPUTATIONAL BIOLOGY, 2012, 8 (12)
[6]  
Des Roches CM, 2013, HLTH AFF
[7]  
Fleiss JL., 2003, STAT METHODS RATES P, DOI DOI 10.1002/0471445428
[8]   The Promise of Electronic Records Around the Corner or Down the Road? [J].
Jha, Ashish K. .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2011, 306 (08) :880-881
[9]   Use of Electronic Health Records in U. S. Hospitals [J].
Jha, Ashish K. ;
DesRoches, Catherine M. ;
Campbell, Eric G. ;
Donelan, Karen ;
Rao, Sowmya R. ;
Ferris, Timothy G. ;
Shields, Alexandra ;
Rosenbaum, Sara ;
Blumenthal, David .
NEW ENGLAND JOURNAL OF MEDICINE, 2009, 360 (16) :1628-1638
[10]   Using electronic health records to drive discovery in disease genomics [J].
Kohane, Isaac S. .
NATURE REVIEWS GENETICS, 2011, 12 (06) :417-428