Generalizability issues with deep learning models in medicine and their potential solutions: illustrated with cone-beam computed tomography (CBCT) to computed tomography (CT) image conversion

被引：34

作者：

Liang, Xiao

Dan Nguyen

Jiang, Steve B. ^{[1
]}

机构：

[1] Univ Texas Southwestern Med Ctr Dallas, Med Artificial Intelligence & Automat Lab, Dallas, TX 75390 USA

来源：

MACHINE LEARNING-SCIENCE AND TECHNOLOGY | 2021年 / 2卷 / 01期

关键词：

deep learning; medicine; generalizability; transfer learning;

D O I：

10.1088/2632-2153/abb214

中图分类号：

TP18 [人工智能理论];

学科分类号：

140502 [人工智能];

摘要：

Generalizability is a concern when applying a deep learning (DL) model trained on one dataset to other datasets. It is challenging to demonstrate a DL model's generalizability efficiently and sufficiently before implementing the model in clinical practice. Training a universal model that works anywhere, anytime, for anybody is unrealistic. In this work, we demonstrate the generalizability problem, then explore potential solutions based on transfer learning by using the cone-beam computed tomography (CBCT) to computed tomography (CT) image conversion task as the testbed. Previous works only studied on one or two anatomical sites and used images from the same vendor's scanners. Here, we investigated how a model trained for one machine and one anatomical site works on other machines and other anatomical sites. We trained a model on CBCT images acquired from one vendor's scanners for head and neck cancer patients and applied it to images from another vendor's scanners and for prostate, pancreatic, and cervical cancer patients. We found that generalizability could be a significant problem for this particular application when applying a trained DL model to datasets from another vendor's scanners. We then explored three practical solutions based on transfer learning to solve this generalization problem: the target model, which is trained on a target dataset from scratch; the combined model, which is trained on both source and target datasets from scratch; and the adapted model, which fine-tunes the trained source model to a target dataset. We found that when there are sufficient data in the target dataset, all three models can achieve good performance. When the target dataset is limited, the adapted model works the best, which indicates that using the fine-tuning strategy to adapt the trained model to an unseen target dataset is a viable and easy way to implement DL models in the clinic.

引用

页数：12

共 13 条

[1]

Al-Obaidi FE., 2015, AM J SIGN PROCESS, V5, P51, DOI DOI 10.5923/J.AJSP.20150503.01

[2]

[Anonymous], 2019, ARXIV190105773

[3]

[Anonymous], 2012, PRACTICAL RECOMMENDA

[4]

Assessing Radiology Research on Artificial Intelligence: A Brief Guide for Authors, Reviewers, and Readers-From the Radiology Editorial Board [J].

Bluemke, David A. ;

Moy, Linda ;

Bredella, Miriam A. ;

Ertl-Wagner, Birgit B. ;

Fowler, Kathryn J. ;

Goh, Vicky J. ;

Halpern, Elkan F. ;

Hess, Christopher P. ;

Schiebler, Mark L. ;

Weiss, Clifford R. .

RADIOLOGY, 2020, 294 (03) :487-489

[5]

Goodfellow I, 2016, ADAPT COMPUT MACH LE, P1

[6]

Paired cycle-GAN-based image correction for quantitative cone-beam computed tomography [J].

Harms, Joseph ;

Lei, Yang ;

Wang, Tonghe ;

Zhang, Rongxiao ;

Zhou, Jun ;

Tang, Xiangyang ;

Curran, Walter J. ;

Liu, Tian ;

Yang, Xiaofeng .

MEDICAL PHYSICS, 2019, 46 (09) :3998-4009

[7]

Generating synthesized computed tomography (CT) from cone-beam computed tomography (CBCT) using CycleGAN for adaptive radiation therapy [J].

Liang, Xiao ;

Chen, Liyuan ;

Dan Nguyen ;

Zhou, Zhiguo ;

Gu, Xuejun ;

Yang, Ming ;

Wang, Jing ;

Jiang, Steve .

PHYSICS IN MEDICINE AND BIOLOGY, 2019, 64 (12)

[8]

International evaluation of an AI system for breast cancer screening [J].

McKinney, Scott Mayer ;

Sieniek, Marcin ;

Godbole, Varun ;

Godwin, Jonathan ;

Antropova, Natasha ;

Ashrafian, Hutan ;

Back, Trevor ;

Chesus, Mary ;

Corrado, Greg C. ;

Darzi, Ara ;

Etemadi, Mozziyar ;

Garcia-Vicente, Florencia ;

Gilbert, Fiona J. ;

Halling-Brown, Mark ;

Hassabis, Demis ;

Jansen, Sunny ;

Karthikesalingam, Alan ;

Kelly, Christopher J. ;

King, Dominic ;

Ledsam, Joseph R. ;

Melnick, David ;

Mostofi, Hormuz ;

Peng, Lily ;

Reicher, Joshua Jay ;

Romera-Paredes, Bernardino ;

Sidebottom, Richard ;

Suleyman, Mustafa ;

Tse, Daniel ;

Young, Kenneth C. ;

De Fauw, Jeffrey ;

Shetty, Shravya .

NATURE, 2020, 577 (7788) :89-+

[9]

Rajpurkar P., 2020, ARXIV200211379

[10]

Reed R.D., 1999, Neural Smithing: Supervised Learning in Feedforward Artificial Neural Networks

← 1 2 →