Toward a standard for the evaluation of PET-Auto-Segmentation methods following the recommendations of AAPM task group No. 211: Requirements and implementation

被引:30
作者
Berthon, Beatrice [1 ]
Spezi, Emiliano [1 ]
Galavis, Paulina
Shepherd, Tony [4 ]
Apte, Aditya
Hatt, Mathieu
Fayad, Hadi [3 ,4 ]
De Bernardi, Elisabetta
Soffientini, Chiara D. [4 ]
Schmidtlein, C. Ross
El Naqa, Issam [3 ,4 ]
Jeraj, Robert [4 ]
Lu, Wei [4 ]
Das, Shiva
Zaidi, Habib [3 ]
Mawlawi, Osama R. [2 ,4 ]
Visvikis, Dimitris
Lee, John A.
Kirov, Assen S.
机构
[1] ESPCI Paris, PSL Res Univ, CNRS UMR 7587, Inst Langevin, F-75012 Paris, France
[2] Cardiff Univ, Sch Engn, Cardiff CF24 3AA, Wales
[3] Cardiff Univ, Sch Engn, Cardiff CF24 3AA, Wales
[4] New York Univ, Dept Radiat Oncol, Langone Med Ctr, New York, NY 10016 USA
基金
瑞士国家科学基金会;
关键词
conformity index; outlining assessment; PET; CT; PET segmentation; AUTOMATIC SEGMENTATION; IMAGE SEGMENTATION; STRATEGIES; ALGORITHM; HEAD;
D O I
10.1002/mp.12312
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
100231 [临床病理学]; 100902 [航空航天医学];
摘要
Purpose: The aim of this paper is to define the requirements and describe the design and implementation of a standard benchmark tool for evaluation and validation of PET-auto-segmentation (PET-AS) algorithms. This work follows the recommendations of Task Group 211 (TG211) appointed by the American Association of Physicists in Medicine (AAPM). Methods: The recommendations published in the AAPM TG211 report were used to derive a set of required features and to guide the design and structure of a benchmarking software tool. These items included the selection of appropriate representative data and reference contours obtained from established approaches and the description of available metrics. The benchmark was designed in a way that it could be extendable by inclusion of bespoke segmentation methods, while maintaining its main purpose of being a standard testing platform for newly developed PET-AS methods. An example of implementation of the proposed framework, named PETASset, was built. In this work, a selection of PET-AS methods representing common approaches to PET image segmentation was evaluated within PETASset for the purpose of testing and demonstrating the capabilities of the software as a benchmark platform. Results: A selection of clinical, physical, and simulated phantom data, including "best estimates" reference contours from macroscopic specimens, simulation template, and CT scans was built into the PETASset application database. Specific metrics such as Dice Similarity Coefficient (DSC), Positive Predictive Value (PPV), and Sensitivity (S), were included to allow the user to compare the results of any given PET-AS algorithm to the reference contours. In addition, a tool to generate structured reports on the evaluation of the performance of PET-AS algorithms against the reference contours was built. The variation of the metric agreement values with the reference contours across the PET-AS methods evaluated for demonstration were between 0.51 and 0.83, 0.44 and 0.86, and 0.61 and 1.00 for DSC, PPV, and the S metric, respectively. Examples of agreement limits were provided to show how the software could be used to evaluate a new algorithm against the existing state-of-the art. Conclusions: PETASset provides a platform that allows standardizing the evaluation and comparison of different PET-AS methods on a wide range of PET datasets. The developed platform will be available to users willing to evaluate their PET-AS methods and contribute with more evaluation datasets. (C) 2017 The Authors. Medical Physics published by Wiley Periodicals, Inc. on behalf of American Association of Physicists in Medicine.
引用
收藏
页码:4098 / 4111
页数:14
相关论文
共 26 条
[1]
Tools for consensus analysis of experts' contours for radiotherapy structure definitions [J].
Allozi, Rawon ;
Li, X. Allen ;
White, Julia ;
Apte, Aditya ;
Tai, An ;
Michalski, Jeff M. ;
Bosch, Walter R. ;
El Naqa, Issam .
RADIOTHERAPY AND ONCOLOGY, 2010, 97 (03) :572-578
[2]
Head and neck target delineation using a novel PET automatic segmentation algorithm [J].
Berthon, B. ;
Evans, M. ;
Marshall, C. ;
Palaniappan, N. ;
Cole, N. ;
Jayaprakasam, V. ;
Rackley, T. ;
Spezi, E. .
RADIOTHERAPY AND ONCOLOGY, 2017, 122 (02) :242-247
[3]
Influence of cold walls on PET image quantification and volume segmentation: A phantom study [J].
Berthon, B. ;
Marshall, C. ;
Edwards, A. ;
Evans, M. ;
Spezi, E. .
MEDICAL PHYSICS, 2013, 40 (08)
[4]
ATLAAS: an automatic decision tree-based learning algorithm for advanced image segmentation in positron emission tomography [J].
Berthon, Beatrice ;
Marshall, Christopher ;
Evans, Mererid ;
Spezi, Emiliano .
PHYSICS IN MEDICINE AND BIOLOGY, 2016, 61 (13) :4855-4869
[5]
PETSTEP: Generation of synthetic PET lesions for fast evaluation of segmentation methods [J].
Berthon, Beatrice ;
Haggstrom, Ida ;
Apte, Aditya ;
Beattie, Bradley J. ;
Kirov, Assen S. ;
Humm, John L. ;
Marshall, Christopher ;
Spezi, Emiliano ;
Larsson, Anne ;
Schmidtlein, C. Ross .
PHYSICA MEDICA-EUROPEAN JOURNAL OF MEDICAL PHYSICS, 2015, 31 (08) :969-980
[6]
Positron Emission Tomography Imaging of Cancer Biology: Current Status and Future Prospects [J].
Chen, Kai ;
Chen, Xiaoyuan .
SEMINARS IN ONCOLOGY, 2011, 38 (01) :70-86
[7]
Tri-dimensional automatic segmentation of PET volumes based on measured source-to-background ratios:: influence of reconstruction algorithms [J].
Daisne, JF ;
Sibomana, M ;
Bol, A ;
Doumont, T ;
Lonneux, M ;
Grégoire, V .
RADIOTHERAPY AND ONCOLOGY, 2003, 69 (03) :247-250
[8]
CERR: A computational environment for radiotherapy research [J].
Deasy, JO ;
Blanco, AI ;
Clark, VH .
MEDICAL PHYSICS, 2003, 30 (05) :979-985
[9]
DUBUISSON MP, 1994, INT C PATT RECOG, P566, DOI 10.1109/ICPR.1994.576361
[10]
Technology insight: PET and PET/CT in head and neck tumor staging and radiation therapy planning [J].
Frank, SJ ;
Chao, KSC ;
Schwartz, DL ;
Weber, RS ;
Apisarnthanarax, S ;
Macapinlac, HA .
NATURE CLINICAL PRACTICE ONCOLOGY, 2005, 2 (10) :526-533