Training and Validating a Deep Convolutional Neural Network for Computer-Aided Detection and Classification of Abnormalities on Frontal Chest Radiographs

被引:191
作者
Cicero, Mark [1 ]
Bilbily, Alexander [1 ]
Dowdell, Tim [1 ]
Gray, Bruce [1 ]
Perampaladas, Kuhan [2 ]
Barfett, Joseph [1 ]
机构
[1] St Michaels Hosp, Dept Med Imaging, 30 Bond St, Toronto, ON M5B 1W8, Canada
[2] Univ Toronto, Dept Pharmaceut Sci, Toronto, ON, Canada
关键词
machine learning; neural network; x-ray; cardiomegaly; pleural effusion; pulmonary edema; pneumothorax; pneumonia;
D O I
10.1097/RLI.0000000000000341
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Objectives: Convolutional neural networks (CNNs) are a subtype of artificial neural network that have shown strong performance in computer vision tasks including image classification. To date, there has been limited application of CNNs to chest radiographs, the most frequently performed medical imaging study. We hypothesize CNNs can learn to classify frontal chest radiographs according to common findings from a sufficiently large data set. Materials and Methods: Our institution's research ethics board approved a single-center retrospective review of 35,038 adult posterior-anterior chest radiographs and final reports performed between 2005 and 2015 (56% men, average age of 56, patient type: 24% inpatient, 39% outpatient, 37% emergency department) with a waiver for informed consent. The GoogLeNet CNN was trained using 3 graphics processing units to automatically classify radiographs as normal (n = 11,702) or into 1 or more of cardiomegaly (n = 9240), consolidation (n = 6788), pleural effusion (n = 7786), pulmonary edema (n = 1286), or pneumothorax (n = 1299). The network's performancewas evaluated using receiver operating curve analysis on a test set of 2443 radiographs with the criterion standard being board-certified radiologist interpretation. Results: Using 256 x 256-pixel images as input, the network achieved an overall sensitivity and specificity of 91% with an area under the curve of 0.964 for classifying a study as normal (n = 1203). For the abnormal categories, the sensitivity, specificity, and area under the curve, respectively, were 91%, 91%, and 0.962 for pleural effusion (n = 782), 82%, 82%, and 0.868 for pulmonary edema (n = 356), 74%, 75%, and 0.850 for consolidation (n = 214), 81%, 80%, and 0.875 for cardiomegaly (n = 482), and 78%, 78%, and 0.861 for pneumothorax (n = 167). Conclusions: Current deep CNN architectures can be trained with modest-sized medical data sets to achieve clinically useful performance at detecting and excluding common pathology on chest radiographs.
引用
收藏
页码:281 / 287
页数:7
相关论文
共 16 条
[1]   DIAGNOSTIC-TESTS-2 - PREDICTIVE VALUES .4. [J].
ALTMAN, DG ;
BLAND, JM .
BRITISH MEDICAL JOURNAL, 1994, 309 (6947) :102-102
[2]  
[Anonymous], 13112524 ARXIV
[3]  
[Anonymous], 2015, 151203385 ARXIV
[4]  
[Anonymous], 160406154 ARXIV
[5]  
[Anonymous], 2014, 13126199V4 ARXIV
[6]  
[Anonymous], 2013, McKinsey Quart
[7]  
[Anonymous], 160302518V2 ARXIV
[8]  
[Anonymous], 2015, SPIE MED IMAGING
[9]   Automated Classification of Usual Interstitial Pneumonia Using Regional Volumetric Texture Analysis in High-Resolution Computed Tomography [J].
Depeursinge, Adrien ;
Chin, Anne S. ;
Leung, Ann N. ;
Terrone, Donato ;
Bristow, Michael ;
Rosen, Glenn ;
Rubin, Daniel L. .
INVESTIGATIVE RADIOLOGY, 2015, 50 (04) :261-267
[10]   Automatic Detection of Cerebral Microbleeds From MR Images via 3D Convolutional Neural Networks [J].
Dou, Qi ;
Chen, Hao ;
Yu, Lequan ;
Zhao, Lei ;
Qin, Jing ;
Wang, Defeng ;
Mok, Vincent C. T. ;
Shi, Lin ;
Heng, Pheng-Ann .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2016, 35 (05) :1182-1195