Spine-GAN: Semantic segmentation of multiple spinal structures

被引：148

作者：

Han, Zhongyi ^{[1
,2
,3
,4
]}

Wei, Benzheng ^{[1
,2
]}

Mercado, Ashley ^{[3
,4
]}

Leung, Stephanie ^{[3
,4
]}

Li, Shuo ^{[3
,4
]}

机构：

[1] Shandong Univ Tradit Chinese Med, Coll Sci & Technol, Jinan, SD, Peoples R China

[2] Shandong Univ Tradit Chinese Med, Computat Med Lab, Jinan, SD, Peoples R China

[3] DIG, London, ON, Canada

[4] Western Univ, Dept Med Imaging, London, ON, Canada

来源：

MEDICAL IMAGE ANALYSIS | 2018年 / 50卷

关键词：

Spine; Magnetic resonance imaging; Segmentation; Classification; Generative adversarial network; LSTM; Autoencoder; Computer-aided detection and diagnosis; MRI GRADING SYSTEM; INTERVERTEBRAL FORAMEN; VERTEBRA DETECTION; LUMBAR; CT; PATHOGENESIS; NETWORKS; STENOSIS; DISCS; PIXEL;

D O I：

10.1016/j.media.2018.08.005

中图分类号：

TP18 [人工智能理论];

学科分类号：

140502 [人工智能];

摘要：

Spinal clinicians still rely on laborious workloads to conduct comprehensive assessments of multiple spinal structures in MRIs, in order to detect abnormalities and discover possible pathological factors. The objective of this work is to perform automated segmentation and classification (i.e., normal and abnormal) of intervertebral discs, vertebrae, and neural foramen in MRIs in one shot, which is called semantic segmentation that is extremely urgent to assist spinal clinicians in diagnosing neural foraminal stenosis, disc degeneration, and vertebral deformity as well as discovering possible pathological factors. However, no work has simultaneously achieved the semantic segmentation of intervertebral discs, vertebrae, and neural foramen due to three-fold unusual challenges: I) Multiple tasks, i.e., simultaneous semantic segmentation of multiple spinal structures, are more difficult than individual tasks; 2) Multiple targets: average 21 spinal structures per MRI require automated analysis yet have high variety and variability; 3) Weak spatial correlations and subtle differences between normal and abnormal structures generate dynamic complexity and indeterminacy. In this paper, we propose a Recurrent Generative Adversarial Network called Spine-GAN for resolving above-aforementioned challenges. Firstly, Spine-GAN explicitly solves the high variety and variability of complex spinal structures through an atrous convolution (i.e., convolution with holes) autoencoder module that is capable of obtaining semantic task-aware representation and preserving fine-grained structural information. Secondly, Spine-GAN dynamically models the spatial pathological correlations between both normal and abnormal structures thanks to a specially designed long short-term memory module. Thirdly, Spine-GAN obtains reliable performance and efficient generalization by leveraging a discriminative network that is capable of correcting predicted errors and global-level contiguity. Extensive experiments on MRIs of 253 patients have demonstrated that Spine-GAN achieves high pixel accuracy of 96.2%, Dice coefficient of 87.1%, Sensitivity of 89.1% and Specificity of 86.0%, which reveals its effectiveness and potential as a clinical tool. (C) 2018 Elsevier B.V. All rights reserved.

引用

页码：23 / 35

页数：13

共 75 条

[1]

Abadi M., 2016, TENSORFLOW LARGESCAL

[2]

Toward a clinical lumbar CAD: herniation diagnosis [J].