Enhanced secondary analysis of survival data: reconstructing the data from published Kaplan-Meier survival curves

被引:1862
作者
Guyot, Patricia [1 ,2 ]
Ades, A. E. [1 ]
Ouwens, Mario J. N. M. [2 ]
Welton, Nicky J. [1 ]
机构
[1] Univ Bristol, Sch Social & Community Med, Bristol BS8 2PS, Avon, England
[2] Mapi Consultancy, NL-3955 AX Houten, Netherlands
基金
英国医学研究理事会;
关键词
Survival analysis; Individual Patient Data; Kaplan-Meier; algorithm; life-table; Cost-Effectiveness Analysis; Health Technology Assessment; DATA METAANALYSIS; MEDIAN SURVIVAL; META-ANALYSIS; CANCER; REPRODUCIBILITY; REPEATABILITY;
D O I
10.1186/1471-2288-12-9
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
100404 [儿少卫生与妇幼保健学];
摘要
Background: The results of Randomized Controlled Trials (RCTs) on time-to-event outcomes that are usually reported are median time to events and Cox Hazard Ratio. These do not constitute the sufficient statistics required for meta-analysis or cost-effectiveness analysis, and their use in secondary analyses requires strong assumptions that may not have been adequately tested. In order to enhance the quality of secondary data analyses, we propose a method which derives from the published Kaplan Meier survival curves a close approximation to the original individual patient time-to-event data from which they were generated. Methods: We develop an algorithm that maps from digitised curves back to KM data by finding numerical solutions to the inverted KM equations, using where available information on number of events and numbers at risk. The reproducibility and accuracy of survival probabilities, median survival times and hazard ratios based on reconstructed KM data was assessed by comparing published statistics (survival probabilities, medians and hazard ratios) with statistics based on repeated reconstructions by multiple observers. Results: The validation exercise established there was no material systematic error and that there was a high degree of reproducibility for all statistics. Accuracy was excellent for survival probabilities and medians, for hazard ratios reasonable accuracy can only be obtained if at least numbers at risk or total number of events are reported. Conclusion: The algorithm is a reliable tool for meta-analysis and cost-effectiveness analyses of RCTs reporting time-to-event data. It is recommended that all RCTs should report information on numbers at risk and total number of events alongside KM curves.
引用
收藏
页数:13
相关论文
共 34 条
[31]
Confidence intervals in repeatability and reproducibility using the Bootstrap method [J].
Wang, FK ;
Eldon, YL .
TOTAL QUALITY MANAGEMENT & BUSINESS EXCELLENCE, 2003, 14 (03) :341-354
[32]
A GENERAL PARAMETRIC APPROACH TO THE METAANALYSIS OF RANDOMIZED CLINICAL-TRIALS [J].
WHITEHEAD, A ;
WHITEHEAD, J .
STATISTICS IN MEDICINE, 1991, 10 (11) :1665-1677
[33]
Individual patient data meta-analysis of randomized anti-epilepatic drug monotherapy trials [J].
Williamson, PR ;
Marson, AG ;
Tudur, C ;
Hutton, JL ;
Chadwich, D .
JOURNAL OF EVALUATION IN CLINICAL PRACTICE, 2000, 6 (02) :205-214
[34]
Aggregate data meta-analysis with time-to-event outcomes [J].
Williamson, PR ;
Smith, CT ;
Hutton, JL ;
Marson, AG .
STATISTICS IN MEDICINE, 2002, 21 (22) :3337-3351