Application of Big Data analysis in gastrointestinal research

被引:42
作者
Cheung, Ka-Shing [1 ,2 ]
Leung, Wai K. [1 ]
Seto, Wai-Kay [1 ,2 ]
机构
[1] Univ Hong Kong, Queen Mary Hosp, Dept Med, 102 Pokfulam Rd, Hong Kong, Peoples R China
[2] Univ Hong Kong, Shenzhen Hosp, Dept Med, Shenzhen 518053, Guangdong, Peoples R China
关键词
Healthcare dataset; Epidemiology; Gastric cancer; Inflammatory bowel disease; Colorectal cancer; Hepatocellular carcinoma; Gastrointestinal bleeding; INFLAMMATORY-BOWEL-DISEASE; PROPENSITY SCORE METHODS; PROTON PUMP INHIBITOR; ADVERSE DRUG EVENTS; HEPATITIS-B; GASTRIC-CANCER; HEPATOCELLULAR-CARCINOMA; COLORECTAL-CANCER; AUTOMATED IDENTIFICATION; NUCLEOS(T)IDE ANALOG;
D O I
10.3748/wjg.v25.i24.2990
中图分类号
R57 [消化系及腹部疾病];
学科分类号
摘要
Big Data, which are characterized by certain unique traits like volume, velocity and value, have revolutionized the research of multiple fields including medicine. Big Data in health care are defined as large datasets that are collected routinely or automatically, and stored electronically. With the rapidly expanding volume of health data collection, it is envisioned that the Big Data approach can improve not only individual health, but also the performance of health care systems. The application of Big Data analysis in the field of gastroenterology and hepatology research has also opened new research approaches. While it retains most of the advantages and avoids some of the disadvantages of traditional observational studies (case-control and prospective cohort studies), it allows for phenomapping of disease heterogeneity, enhancement of drug safety, as well as development of precision medicine, prediction models and personalized treatment. Unlike randomized controlled trials, it reflects the real-world situation and studies patients who are often under-represented in randomized controlled trials. However, residual and/or unmeasured confounding remains a major concern, which requires meticulous study design and various statistical adjustment methods. Other potential drawbacks include data validity, missing data, incomplete data capture due to the unavailability of diagnosis codes for certain clinical situations, and individual privacy. With continuous technological advances, some of the current limitations with Big Data may be further minimized. This review will illustrate the use of Big Data research on gastrointestinal and liver diseases using recently published examples.
引用
收藏
页码:2990 / 3008
页数:19
相关论文
共 102 条
[1]  
[Anonymous], ORIGINS BIG DATA ETY
[2]  
[Anonymous], 1994, IARC MONOGRAPHS EVAL, V61, P177
[3]  
[Anonymous], 3D DATA MANAGEMENT C
[4]  
[Anonymous], 2016, Study on Big Data in Public Health, Telemedicine and Healthcare: Final Report, DOI [DOI 10.2875/734795, 10.2875/734795]
[5]  
[Anonymous], 2017, BMJ OPEN, DOI DOI 10.1136/bmjopen-2017-017739
[6]  
Bellazzi R, 2014, Yearb Med Inform, V9, P8, DOI 10.15265/IY-2014-0024
[7]   Confounding Control in Healthcare Database Research Challenges and Potential Approaches [J].
Brookhart, M. Alan ;
Sturmer, Til ;
Glynn, Robert J. ;
Rassen, Jeremy ;
Schneeweiss, Sebastian .
MEDICAL CARE, 2010, 48 (06) :S114-S120
[8]   The Association Between Arthralgia and Vedolizumab Using Natural Language Processing [J].
Cai, Tianrun ;
Lin, Tzu-Chieh ;
Bond, Allison ;
Huang, Jie ;
Kane-Wanger, Gwendolyn ;
Cagan, Andrew ;
Murphy, Shawn N. ;
Ananthakrishnan, Ashwin N. ;
Liao, Katherine P. .
INFLAMMATORY BOWEL DISEASES, 2018, 24 (10) :2242-2246
[9]   Helicobacter pylori infection and gastric cardia cancer: systematic review and meta-analysis [J].
Cavaleiro-Pinto, Marlene ;
Peleteiro, Barbara ;
Lunet, Nuno ;
Barros, Henrique .
CANCER CAUSES & CONTROL, 2011, 22 (03) :375-387
[10]   Current questions for the treatment of advanced gastric cancer [J].
Cervantes, A. ;
Roda, D. ;
Tarazona, N. ;
Rosello, S. ;
Perez-Fidalgo, J. A. .
CANCER TREATMENT REVIEWS, 2013, 39 (01) :60-67