Spine-GAN: Semantic segmentation of multiple spinal structures

被引:148
作者
Han, Zhongyi [1 ,2 ,3 ,4 ]
Wei, Benzheng [1 ,2 ]
Mercado, Ashley [3 ,4 ]
Leung, Stephanie [3 ,4 ]
Li, Shuo [3 ,4 ]
机构
[1] Shandong Univ Tradit Chinese Med, Coll Sci & Technol, Jinan, SD, Peoples R China
[2] Shandong Univ Tradit Chinese Med, Computat Med Lab, Jinan, SD, Peoples R China
[3] DIG, London, ON, Canada
[4] Western Univ, Dept Med Imaging, London, ON, Canada
关键词
Spine; Magnetic resonance imaging; Segmentation; Classification; Generative adversarial network; LSTM; Autoencoder; Computer-aided detection and diagnosis; MRI GRADING SYSTEM; INTERVERTEBRAL FORAMEN; VERTEBRA DETECTION; LUMBAR; CT; PATHOGENESIS; NETWORKS; STENOSIS; DISCS; PIXEL;
D O I
10.1016/j.media.2018.08.005
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
Spinal clinicians still rely on laborious workloads to conduct comprehensive assessments of multiple spinal structures in MRIs, in order to detect abnormalities and discover possible pathological factors. The objective of this work is to perform automated segmentation and classification (i.e., normal and abnormal) of intervertebral discs, vertebrae, and neural foramen in MRIs in one shot, which is called semantic segmentation that is extremely urgent to assist spinal clinicians in diagnosing neural foraminal stenosis, disc degeneration, and vertebral deformity as well as discovering possible pathological factors. However, no work has simultaneously achieved the semantic segmentation of intervertebral discs, vertebrae, and neural foramen due to three-fold unusual challenges: I) Multiple tasks, i.e., simultaneous semantic segmentation of multiple spinal structures, are more difficult than individual tasks; 2) Multiple targets: average 21 spinal structures per MRI require automated analysis yet have high variety and variability; 3) Weak spatial correlations and subtle differences between normal and abnormal structures generate dynamic complexity and indeterminacy. In this paper, we propose a Recurrent Generative Adversarial Network called Spine-GAN for resolving above-aforementioned challenges. Firstly, Spine-GAN explicitly solves the high variety and variability of complex spinal structures through an atrous convolution (i.e., convolution with holes) autoencoder module that is capable of obtaining semantic task-aware representation and preserving fine-grained structural information. Secondly, Spine-GAN dynamically models the spatial pathological correlations between both normal and abnormal structures thanks to a specially designed long short-term memory module. Thirdly, Spine-GAN obtains reliable performance and efficient generalization by leveraging a discriminative network that is capable of correcting predicted errors and global-level contiguity. Extensive experiments on MRIs of 253 patients have demonstrated that Spine-GAN achieves high pixel accuracy of 96.2%, Dice coefficient of 87.1%, Sensitivity of 89.1% and Specificity of 86.0%, which reveals its effectiveness and potential as a clinical tool. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:23 / 35
页数:13
相关论文
共 75 条
[1]
Abadi M., 2016, TENSORFLOW LARGESCAL
[2]
Toward a clinical lumbar CAD: herniation diagnosis [J].
Alomari, Raja' S. ;
Corso, Jason J. ;
Chaudhary, Vipin ;
Dhillon, Gurmeet .
INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2011, 6 (01) :119-126
[3]
Labeling of Lumbar Discs Using Both Pixel- and Object-Level Features With a Two-Level Probabilistic Model [J].
Alomari, Raja' S. ;
Corso, Jason J. ;
Chaudhary, Vipin .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2011, 30 (01) :1-10
[4]
[Anonymous], 2015, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2015.7298977
[5]
[Anonymous], 2017, IEEE transactions on pattern analysis and machine intelligence, DOI [10.1109/TPAMI.2016.2644615, DOI 10.1109/TPAMI.2016.2644615]
[6]
[Anonymous], 2017, ARXIV170102870
[7]
Cai Y, 2017, P SOC PHOTO-OPT INS
[8]
Multi-Modality Vertebra Recognition in Arbitrary Views Using 3D Deformable Hierarchical Model [J].
Cai, Yunliang ;
Osman, Said ;
Sharma, Manas ;
Landis, Mark ;
Li, Shuo .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2015, 34 (08) :1676-1693
[9]
Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[10]
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848