Convolutional Neural Networks for Medical Image Analysis: Full Training or Fine Tuning?

被引：1979

作者：

Tajbakhsh, Nima ^{[1
]}

Shin, Jae Y. ^{[1
]}

Gurudu, Suryakanth R. ^{[2
]}

Hurst, R. Todd ^{[3
]}

Kendall, Christopher B. ^{[3
]}

Gotway, Michael B. ^{[4
]}

Liang, Jianming ^{[1
]}

机构：

[1] Arizona State Univ, Dept Biomed Informat, Scottsdale, AZ 85259 USA

[2] Mayo Clin, Div Gastroenterol & Hepatol, Scottsdale, AZ 85259 USA

[3] Mayo Clin, Div Cardiovasc Dis, Scottsdale, AZ 85259 USA

[4] Mayo Clin, Dept Radiol, Scottsdale, AZ 85259 USA

来源：

IEEE TRANSACTIONS ON MEDICAL IMAGING | 2016年 / 35卷 / 05期

关键词：

Carotid intima-media thickness; computer-aided detection; convolutional neural networks; deep learning; fine-tuning; medical image analysis; polyp detection; pulmonary embolism detection; video quality assessment; MISS RATE; PATTERN-RECOGNITION; COLORECTAL-CANCER; TEXTURE FEATURES; ULTRASOUND; SEGMENTATION; COLONOSCOPY; POLYPS; MICROCALCIFICATIONS; SHIFT;

D O I：

10.1109/TMI.2016.2535302

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Training a deep convolutional neural network (CNN) from scratch is difficult because it requires a large amount of labeled training data and a great deal of expertise to ensure proper convergence. A promising alternative is to fine-tune a CNN that has been pre-trained using, for instance, a large set of labeled natural images. However, the substantial differences between natural and medical images may advise against such knowledge transfer. In this paper, we seek to answer the following central question in the context of medical image analysis: Can the use of pre-trained deep CNNs with sufficient fine-tuning eliminate the need for training a deep CNN from scratch? To address this question, we considered four distinct medical imaging applications in three specialties (radiology, cardiology, and gastroenterology) involving classification, detection, and segmentation from three different imaging modalities, and investigated how the performance of deep CNNs trained from scratch compared with the pre-trained CNNs fine-tuned in a layer-wise manner. Our experiments consistently demonstrated that 1) the use of a pre-trained CNN with adequate fine-tuning outperformed or, in the worst case, performed as well as a CNN trained from scratch; 2) fine-tuned CNNs were more robust to the size of training sets than CNNs trained from scratch; 3) neither shallow tuning nor deep tuning was the optimal choice for a particular application; and 4) our layer-wise fine-tuning scheme could offer a practical way to reach the best performance for the application at hand based on the amount of available data.

引用

页码：1299 / 1312

页数：14

共 76 条

[1] Convolutional Neural Networks for Speech Recognition
Abdel-Hamid, Ossama
Mohamed, Abdel-Rahman
Jiang, Hui
Deng, Li
Penn, Gerald
Yu, Dong
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (10) : 1533 - 1545
[2] Color and position versus texture features for endoscopic polyp detection
Alexandre, Luis A.
Nobre, Nuno
Casteleiro, Joao
[J]. BMEI 2008: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS, VOL 2, 2008, : 38 - +
[3] [Anonymous], P SPIE MED IMAG
[4] [Anonymous], P MICCAI
[5] [Anonymous], 2015, IEEE I CONF COMP VIS, DOI DOI 10.1109/ICCV.2015.123
[6] [Anonymous], AM J ROENTGENOL
[7] [Anonymous], GOING DEEPE IN PRESS
[8] [Anonymous], VERY DEEP C IN PRESS
[9] [Anonymous], P IEEE C COMP VIS PA
[10] [Anonymous], P SPIE MED IMAG

← 1 2 3 4 5 6 7 8 →