Application of Big Data analysis in gastrointestinal research

被引:42
作者
Cheung, Ka-Shing [1 ,2 ]
Leung, Wai K. [1 ]
Seto, Wai-Kay [1 ,2 ]
机构
[1] Univ Hong Kong, Queen Mary Hosp, Dept Med, 102 Pokfulam Rd, Hong Kong, Peoples R China
[2] Univ Hong Kong, Shenzhen Hosp, Dept Med, Shenzhen 518053, Guangdong, Peoples R China
关键词
Healthcare dataset; Epidemiology; Gastric cancer; Inflammatory bowel disease; Colorectal cancer; Hepatocellular carcinoma; Gastrointestinal bleeding; INFLAMMATORY-BOWEL-DISEASE; PROPENSITY SCORE METHODS; PROTON PUMP INHIBITOR; ADVERSE DRUG EVENTS; HEPATITIS-B; GASTRIC-CANCER; HEPATOCELLULAR-CARCINOMA; COLORECTAL-CANCER; AUTOMATED IDENTIFICATION; NUCLEOS(T)IDE ANALOG;
D O I
10.3748/wjg.v25.i24.2990
中图分类号
R57 [消化系及腹部疾病];
学科分类号
摘要
Big Data, which are characterized by certain unique traits like volume, velocity and value, have revolutionized the research of multiple fields including medicine. Big Data in health care are defined as large datasets that are collected routinely or automatically, and stored electronically. With the rapidly expanding volume of health data collection, it is envisioned that the Big Data approach can improve not only individual health, but also the performance of health care systems. The application of Big Data analysis in the field of gastroenterology and hepatology research has also opened new research approaches. While it retains most of the advantages and avoids some of the disadvantages of traditional observational studies (case-control and prospective cohort studies), it allows for phenomapping of disease heterogeneity, enhancement of drug safety, as well as development of precision medicine, prediction models and personalized treatment. Unlike randomized controlled trials, it reflects the real-world situation and studies patients who are often under-represented in randomized controlled trials. However, residual and/or unmeasured confounding remains a major concern, which requires meticulous study design and various statistical adjustment methods. Other potential drawbacks include data validity, missing data, incomplete data capture due to the unavailability of diagnosis codes for certain clinical situations, and individual privacy. With continuous technological advances, some of the current limitations with Big Data may be further minimized. This review will illustrate the use of Big Data research on gastrointestinal and liver diseases using recently published examples.
引用
收藏
页码:2990 / 3008
页数:19
相关论文
共 102 条
[81]   A review of approaches to identifying patient phenotype cohorts using electronic health records [J].
Shivade, Chaitanya ;
Raghavan, Preethi ;
Fosler-Lussier, Eric ;
Embi, Peter J. ;
Elhadad, Noemie ;
Johnson, Stephen B. ;
Lai, Albert M. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2014, 21 (02) :221-230
[82]   Incidence and clinical impact of perianal disease in patients with ulcerative colitis: A nationwide population-based study [J].
Song, Eun Mi ;
Lee, Ho-Su ;
Kim, Ye-Jee ;
Oh, Eun Hye ;
Ham, Nam Seok ;
Kim, Jeongseok ;
Hwang, Sung Wook ;
Park, Sang Hyoung ;
Yang, Dong-Hoon ;
Ye, Byong Duk ;
Byeon, Jeong-Sik ;
Myung, Seung-Jae ;
Yang, Suk-Kyun .
JOURNAL OF GASTROENTEROLOGY AND HEPATOLOGY, 2019, 34 (06) :1011-1017
[83]   Fiber Intake and Survival After Colorectal Cancer Diagnosis [J].
Song, Mingyang ;
Wu, Kana ;
Meyerhardt, Jeffrey A. ;
Ogino, Shuji ;
Wang, Molin ;
Fuchs, Charles S. ;
Giovannucci, Edward L. ;
Chan, Andrew T. .
JAMA ONCOLOGY, 2018, 4 (01) :71-79
[84]   A review of the application of propensity score methods yielded increasing use, advantages in specific settings, but not substantially different estimates compared with conventional multivariable methods [J].
Stürmer, T ;
Joshi, M ;
Glynn, RJ ;
Avorn, J ;
Rothman, KJ ;
Schneeweiss, S .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2006, 59 (05) :437-447
[85]   Gastric cancer [J].
Van Cutsem, Eric ;
Sagaert, Xavier ;
Topal, Baki ;
Haustermans, Karin ;
Prenen, Hans .
LANCET, 2016, 388 (10060) :2654-2664
[86]   Ethical Challenges of Big Data in Public Health [J].
Vayena, Effy ;
Salathe, Marcel ;
Madoff, Lawrence C. ;
Brownstein, John S. .
PLOS COMPUTATIONAL BIOLOGY, 2015, 11 (02)
[87]   Predicting corticosteroid-free endoscopic remission with vedolizumab in ulcerative colitis [J].
Waljee, A. K. ;
Liu, B. ;
Sauder, K. ;
Zhu, J. ;
Govani, S. M. ;
Stidham, R. W. ;
Higgins, P. D. R. .
ALIMENTARY PHARMACOLOGY & THERAPEUTICS, 2018, 47 (06) :763-772
[88]   Predicting Hospitalization and Outpatient Corticosteroid Use in Inflammatory Bowel Disease Patients Using Machine Learning [J].
Waljee, Akbar K. ;
Lipson, Rachel ;
Wiitala, Wyndy L. ;
Zhang, Yiwei ;
Liu, Boang ;
Zhu, Ji ;
Wallace, Beth ;
Govani, Shail M. ;
Stidham, Ryan W. ;
Hayward, Rodney ;
Higgins, Peter D. R. .
INFLAMMATORY BOWEL DISEASES, 2018, 24 (01) :45-53
[89]   Machine Learning Algorithms for Objective Remission and Clinical Outcomes with Thiopurines [J].
Waljee, Akbar K. ;
Sauder, Kay ;
Patel, Anand ;
Segar, Sandeep ;
Liu, Boang ;
Zhang, Yiwei ;
Zhu, Ji ;
Stidham, Ryan W. ;
Balis, Ulysses ;
Higgins, Peter D. R. .
JOURNAL OF CROHNS & COLITIS, 2017, 11 (07) :801-810
[90]   A method for systematic discovery of adverse drug events from clinical notes [J].
Wang, Guan ;
Jung, Kenneth ;
Winnenburg, Rainer ;
Shah, Nigam H. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2015, 22 (06) :1196-1204