Stability of FDG-PET Radiomics features: An integrated analysis of test-retest and inter-observer variability

被引:351
作者
Leijenaar, Ralph T. H. [1 ]
Carvalho, Sara [1 ]
Velazquez, Emmanuel Rios [1 ]
Van Elmpt, Wouter J. C. [1 ]
Parmar, Chintan [1 ]
Hoekstra, Otto S. [2 ]
Hoekstra, Corneline J. [3 ]
Boellaard, Ronald [2 ]
Dekker, Andre L. A. J. [1 ]
Gillies, Robert J. [4 ]
Aerts, Hugo J. W. L. [1 ,5 ,6 ]
Lambin, Philippe [1 ]
机构
[1] MUMC, Dept Radiat Oncol MAASTRO, GROW Sch Oncol & Dev Biol, Maastricht, Netherlands
[2] Vrije Univ Amsterdam Med Ctr, Dept Radiol & Nucl Med, Amsterdam, Netherlands
[3] Jeroen Bosch Med Ctr, Dept Nucl Med, Shertogenbosch, Netherlands
[4] Univ S Florida, Coll Med, H Lee Moffitt Canc Ctr & Res Inst, Dept Canc Imaging & Metab, Tampa, FL 33612 USA
[5] Harvard Univ, Brigham & Womens Hosp, Dana Farber Canc Inst, Dept Radiat Oncol,Med Sch, Boston, MA 02115 USA
[6] Harvard Univ, Brigham & Womens Hosp, Dana Farber Canc Inst, Dept Radiol,Med Sch, Boston, MA 02115 USA
关键词
CELL LUNG-CANCER; STANDARDIZED UPTAKE VALUE; RESPONSE ASSESSMENT; TEXTURAL FEATURES; F-18-FDG PET; RADIOTHERAPY; IMAGES; TUMOR; REPEATABILITY; CT;
D O I
10.3109/0284186X.2013.812798
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Purpose. Besides basic measurements as maximum standardized uptake value (SUV)(max) or SUVmean derived from 18F-FDG positron emission tomography (PET) scans, more advanced quantitative imaging features (i.e. "Radiomics" features) are increasingly investigated for treatment monitoring, outcome prediction, or as potential biomarkers. With these prospected applications of Radiomics features, it is a requisite that they provide robust and reliable measurements. The aim of our study was therefore to perform an integrated stability analysis of a large number of PET-derived features in non-small cell lung carcinoma (NSCLC), based on both a test-retest and an inter-observer setup. Methods. Eleven NSCLC patients were included in the test-retest cohort. Patients underwent repeated PET imaging within a one day interval, before any treatment was delivered. Lesions were delineated by applying a threshold of 50% of the maximum uptake value within the tumor. Twenty-three NSCLC patients were included in the inter-observer cohort. Patients underwent a diagnostic whole body PET-computed tomography (CT). Lesions were manually delineated based on fused PET-CT, using a standardized clinical delineation protocol. Delineation was performed independently by five observers, blinded to each other. Fifteen first order statistics, 39 descriptors of intensity volume histograms, eight geometric features and 44 textural features were extracted. For every feature, test-retest and inter-observer stability was assessed with the intra-class correlation coefficient (ICC) and the coefficient of variability, normalized to mean and range. Similarity between test-retest and inter-observer stability rankings of features was assessed with Spearman's rank correlation coefficient. Results. Results showed that the majority of assessed features had both a high test-retest (71%) and inter-observer (91%) stability in terms of their ICC. Overall, features more stable in repeated PET imaging were also found to be more robust against inter-observer variability. Conclusion. Results suggest that further research of quantitative imaging features is warranted with respect to more advanced applications of PET imaging as being used for treatment monitoring, outcome prediction or imaging biomarkers.
引用
收藏
页码:1391 / 1397
页数:7
相关论文
共 25 条
[1]   Agreement between methods of measurement with multiple observations per individual [J].
Bland, J. Martin ;
Altman, Douglas G. .
JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2007, 17 (04) :571-582
[2]   Quantitative Imaging Test Approval and Biomarker Qualification: Interrelated but Distinct Activities [J].
Buckler, Andrew J. ;
Bresolin, Linda ;
Dunnick, N. Reed ;
Sullivan, Daniel C. .
RADIOLOGY, 2011, 259 (03) :875-884
[3]   Assessment of tumour size in PET/CT lung cancer studies: PET- and CT-based methods compared to pathology [J].
Cheebsumon, Patsuree ;
Boellaard, Ronald ;
de Ruysscher, Dirk ;
van Elmpt, Wouter ;
van Baardwijk, Angela ;
Yaqub, Maqsood ;
Hoekstra, Otto S. ;
Comans, Emile F. I. ;
Lammertsma, Adriaan A. ;
van Velden, Floris H. P. .
EJNMMI RESEARCH, 2012, 2 :1-9
[4]   Repeatability of 18F-FDG Uptake Measurements in Tumors: A Metaanalysis [J].
de Langen, Adrianus J. ;
Vincent, Andrew ;
Velasquez, Linda M. ;
van Tinteren, Harm ;
Boellaard, Ronald ;
Shankar, Lalitha K. ;
Boers, Maarten ;
Smit, Egbert F. ;
Stroobants, Sigrid ;
Weber, Wolfgang A. ;
Hoekstra, Otto S. .
JOURNAL OF NUCLEAR MEDICINE, 2012, 53 (05) :701-708
[5]   PET scans in radiotherapy planning of lung cancer [J].
De Ruysscher, Dirk ;
Nestle, Ursula ;
Jeraj, Robert ;
MacManus, Michael .
LUNG CANCER, 2012, 75 (02) :141-145
[6]   CERR: A computational environment for radiotherapy research [J].
Deasy, JO ;
Blanco, AI ;
Clark, VH .
MEDICAL PHYSICS, 2003, 30 (05) :979-985
[7]   Exploring feature-based approaches in PET images for predicting cancer treatment outcomes [J].
El Naqa, I. ;
Grigsby, P. W. ;
Apte, A. ;
Kidd, E. ;
Donnelly, E. ;
Khullar, D. ;
Chaudhari, S. ;
Yang, D. ;
Schmitt, M. ;
Laforest, Richard ;
Thorstad, W. L. ;
Deasy, J. O. .
PATTERN RECOGNITION, 2009, 42 (06) :1162-1171
[8]   Repeatability of Metabolically Active Volume Measurements with 18F-FDG and 18F-FLT PET in Non-Small Cell Lung Cancer [J].
Frings, Virginie ;
de Langen, Adrianus J. ;
Smit, Egbert F. ;
van Velden, Floris H. P. ;
Hoekstra, Otto S. ;
van Tinteren, Harm ;
Boellaard, Ronald .
JOURNAL OF NUCLEAR MEDICINE, 2010, 51 (12) :1870-1877
[9]   Variability of textural features in FDG PET images due to different acquisition modes and reconstruction parameters [J].
Galavis, Paulina E. ;
Hollensen, Christian ;
Jallow, Ngoneh ;
Paliwal, Bhudatt ;
Jeraj, Robert .
ACTA ONCOLOGICA, 2010, 49 (07) :1012-1016
[10]  
Galloway M. M., 1975, Comput. Graphic. Image Processing, V4, P172, DOI [10.1016/S0146-664X(75)80008-6, DOI 10.1016/S0146-664X(75)80008-6]