False Discovery Rates in PET and CT Studies with Texture Features: A Systematic Review

被引:279
作者
Chalkidou, Anastasia [1 ]
O'Doherty, Michael J. [1 ]
Marsden, Paul K. [1 ]
机构
[1] St Thomas Hosp, Kings Coll London, Div Imaging Sci & Biomed Engn, London SE1 7EH, England
来源
PLOS ONE | 2015年 / 10卷 / 05期
基金
英国工程与自然科学研究理事会;
关键词
SMALL LUNG ADENOCARCINOMA; GROUND-GLASS OPACITY; TUMOR HETEROGENEITY; ESOPHAGEAL CANCER; PROGNOSTIC VALUE; FDG-PET; INTRATUMORAL HETEROGENEITY; COMPUTED-TOMOGRAPHY; POTENTIAL MARKER; PREDICTION;
D O I
10.1371/journal.pone.0124165
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Purpose A number of recent publications have proposed that a family of image-derived indices, called texture features, can predict clinical outcome in patients with cancer. However, the vestigation of multiple indices on a single data set can lead to significant inflation of type-I errors. We report a systematic review of the type-I error inflation in such studies and review the evidence regarding associations between patient outcome and texture features derived from positron emission tomography (PET) or computed tomography (CT) images. Methods For study identification PubMed and Scopus were searched (1/2000-9/2013) using combinations of the keywords texture, prognostic, predictive and cancer. Studies were divided into three categories according to the sources of the type-I error inflation and the use or not of an independent validation dataset. For each study, the true type-I error probability and the adjusted level of significance were estimated using the optimum cut-off approach correction, and the Benjamini-Hochberg method. To demonstrate explicitly the variable selection bias in these studies, we re-analyzed data from one of the published studies, but using 100 random variables substituted for the original image-derived indices. The significance of the random variables as potential predictors of outcome was examined using the analysis methods used in the identified studies. Results Fifteen studies were identified. After applying appropriate statistical corrections, an average type-I error probability of 76% (range: 34-99%) was estimated with the majority of published results not reaching statistical significance. Only 3/15 studies used a validation dataset. For the 100 random variables examined, 10% proved to be significant predictors of survival when subjected to ROC and multiple hypothesis testing analysis. Conclusions We found insufficient evidence to support a relationship between PET or CT texture features and patient survival. Further fit for purpose validation of these image-derived biomarkers should be supported by appropriate biological and statistical evidence before their association with patient outcome is investigated in prospective studies.
引用
收藏
页数:18
相关论文
共 70 条
[1]   Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach [J].
Aerts, Hugo J. W. L. ;
Velazquez, Emmanuel Rios ;
Leijenaar, Ralph T. H. ;
Parmar, Chintan ;
Grossmann, Patrick ;
Cavalho, Sara ;
Bussink, Johan ;
Monshouwer, Rene ;
Haibe-Kains, Benjamin ;
Rietveld, Derek ;
Hoebers, Frank ;
Rietbergen, Michelle M. ;
Leemans, C. Rene ;
Dekker, Andre ;
Quackenbush, John ;
Gillies, Robert J. ;
Lambin, Philippe .
NATURE COMMUNICATIONS, 2014, 5
[2]   DANGERS OF USING OPTIMAL CUTPOINTS IN THE EVALUATION OF PROGNOSTIC FACTORS [J].
ALTMAN, DG ;
LAUSEN, B ;
SAUERBREI, W ;
SCHUMACHER, M .
JOURNAL OF THE NATIONAL CANCER INSTITUTE, 1994, 86 (11) :829-835
[3]  
Altman DG, 2000, STAT MED, V19, P453, DOI 10.1002/(SICI)1097-0258(20000229)19:4<453::AID-SIM350>3.3.CO
[4]  
2-X
[5]   TEXTURAL FEATURES CORRESPONDING TO TEXTURAL PROPERTIES [J].
AMADASUN, M ;
KING, R .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1989, 19 (05) :1264-1274
[6]   Testing multiple statistical hypotheses resulted in spurious associations: A study of astrological signs and health [J].
Austin, Peter C. ;
Mamdani, Muhammad M. ;
Juurlink, David N. ;
Hux, Janet E. .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2006, 59 (09) :964-969
[7]   What you see may not be what you get: A brief, nontechnical introduction to overfitting in regression-type models [J].
Babyak, MA .
PSYCHOSOMATIC MEDICINE, 2004, 66 (03) :411-421
[8]   Predicting Future Morphological Changes of Lesions from Radiotracer Uptake in 18F-FDG-PET Images [J].
Bagci, Ulas ;
Yao, Jianhua ;
Miller-Jaster, Kirsten ;
Chen, Xinjian ;
Mollura, Daniel J. .
PLOS ONE, 2013, 8 (02)
[9]   Reproducibility and Prognosis of Quantitative Features Extracted from CT Images [J].
Balagurunathan, Yoganand ;
Gu, Yuhua ;
Wang, Hua ;
Kumar, Virendra ;
Grove, Olya ;
Hawkins, Sam ;
Kim, Jongphil ;
Goldgof, Dmitry B. ;
Hall, Lawrence O. ;
Gatenby, Robert A. ;
Gillies, Robert J. .
TRANSLATIONAL ONCOLOGY, 2014, 7 (01) :72-87
[10]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300