Modeling causes of death: an integrated approach using CODEm

被引:365
作者
Foreman, Kyle J. [1 ]
Lozano, Rafael [1 ]
Lopez, Alan D. [2 ]
Murray, Christopher J. L. [1 ]
机构
[1] Univ Washington, Inst Hlth Metr & Evaluat, Seattle, WA 98121 USA
[2] Univ Queensland, Sch Populat Hlth, Herston, Qld 4006, Australia
关键词
cause of death; ensemble models; predictive validity; spatial-temporal models; maternal mortality; Global Burden of Disease; PUBLIC-HEALTH UTILITY; VARIABLE SELECTION; SYSTEMATIC ANALYSIS; MATERNAL DEATHS; CODING CHANGES; 187; COUNTRIES; MORTALITY; CERTIFICATES; TRENDS; ENSEMBLE;
D O I
10.1186/1478-7954-10-1
中图分类号
R1 [预防医学、卫生学];
学科分类号
100235 [预防医学];
摘要
Background: Data on causes of death by age and sex are a critical input into health decision-making. Priority setting in public health should be informed not only by the current magnitude of health problems but by trends in them. However, cause of death data are often not available or are subject to substantial problems of comparability. We propose five general principles for cause of death model development, validation, and reporting. Methods: We detail a specific implementation of these principles that is embodied in an analytical tool - the Cause of Death Ensemble model (CODEm) - which explores a large variety of possible models to estimate trends in causes of death. Possible models are identified using a covariate selection algorithm that yields many plausible combinations of covariates, which are then run through four model classes. The model classes include mixed effects linear models and spatial-temporal Gaussian Process Regression models for cause fractions and death rates. All models for each cause of death are then assessed using out-of-sample predictive validity and combined into an ensemble with optimal out-of-sample predictive performance. Results: Ensemble models for cause of death estimation outperform any single component model in tests of root mean square error, frequency of predicting correct temporal trends, and achieving 95% coverage of the prediction interval. We present detailed results for CODEm applied to maternal mortality and summary results for several other causes of death, including cardiovascular disease and several cancers. Conclusions: CODEm produces better estimates of cause of death trends than previous methods and is less susceptible to bias in model specification. We demonstrate the utility of CODEm for the estimation of several major causes of death.
引用
收藏
页数:23
相关论文
共 93 条
[1]
New estimates of maternal mortality and how to interpret them: choice or confusion? [J].
AbouZahr, Carla .
REPRODUCTIVE HEALTH MATTERS, 2011, 19 (37) :117-128
[2]
Improving the public health utility of global cardiovascular mortality data: the rise of ischemic heart disease [J].
Ahern, Ryan M. ;
Lozano, Rafael ;
Naghavi, Mohsen ;
Foreman, Kyle ;
Gakidou, Emmanuela ;
Murray, Christopher J. L. .
POPULATION HEALTH METRICS, 2011, 9
[3]
An integrated hydrologic Bayesian multimodel combination framework: Confronting input, parameter, and model structural uncertainty in hydrologic prediction [J].
Ajami, Newsha K. ;
Duan, Qingyun ;
Sorooshian, Soroosh .
WATER RESOURCES RESEARCH, 2007, 43 (01)
[4]
RELATIONSHIP BETWEEN VARIABLE SELECTION AND DATA AUGMENTATION AND A METHOD FOR PREDICTION [J].
ALLEN, DM .
TECHNOMETRICS, 1974, 16 (01) :125-127
[5]
Anderson R N, 2001, Natl Vital Stat Rep, V49, P1
[6]
[Anonymous], 1997, INT CLASSIFICATION D
[7]
Bell Robert M, 2007, SIGKDD Explorations, V9, P75
[8]
Bell Robert M., BELLKOR SOLUTION NET
[9]
Bell RobertM., 2010, Chance, V23, P24
[10]
Exposing misclassified HIV/AIDS deaths in South Africa [J].
Birnbaum, Jeanette Kurian ;
Murray, Christopher J. L. ;
Lozano, Rafael .
BULLETIN OF THE WORLD HEALTH ORGANIZATION, 2011, 89 (04) :278-285