A multiple imputation approach to disclosure limitation for high-age individuals in longitudinal studies

被引:4
作者
An, Di [1 ]
Little, Roderick J. A. [2 ]
McNally, James W. [3 ]
机构
[1] Merck & Co Inc, Merck Res Labs, Upper Gwynedd, PA 19454 USA
[2] Univ Michigan, Dept Biostat, Ann Arbor, MI 48109 USA
[3] Univ Michigan, Inst Social Res, Ann Arbor, MI 48109 USA
关键词
confidentiality; disclosure protection; longitudinal data; multiple imputation; survival analysis; MICRODATA;
D O I
10.1002/sim.3974
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Disclosure limitation is an important consideration in the release of public use data sets. It is particularly challenging for longitudinal data sets, since information about an individual accumulates with repeated measures over time. Research on disclosure limitation methods for longitudinal data has been very limited. We consider here problems created by high ages in cohort studies. Because of the risk of disclosure, ages of very old respondents can often not be released; in particular, this is a specific stipulation of the Health Insurance Portability and Accountability Act (HIPAA) for the release of health data for individuals. Top-coding of individuals beyond a certain age is a standard way of dealing with this issue, and it may be adequate for cross-sectional data, when a modest number of cases are affected. However, this approach leads to serious loss of information in longitudinal studies when individuals have been followed for many years. We propose and evaluate an alternative to top-coding for this situation based on multiple imputation (MI). This MI method is applied to a survival analysis of simulated data, and data from the Charleston Heart Study (CHS), and is shown to work well in preserving the relationship between hazard and covariates. Copyright (C) 2010 John Wiley & Sons, Ltd.
引用
收藏
页码:1769 / 1778
页数:10
相关论文
共 16 条
[1]   Multiple imputation: an alternative to top coding for statistical disclosure control [J].
An, Di ;
Little, Roderick J. A. .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2007, 170 :923-940
[2]  
[Anonymous], 2003, Survey Methodology
[3]  
[Anonymous], J ROYAL STAT SOC B
[4]  
Drechsler J, 2008, LECT NOTES COMPUT SC, V5262, P227
[5]  
LITTLE R. J., 2019, Statistical analysis with missing data, V793
[6]  
Little R.J.A., 1993, J. Off. Stat, V9, P407
[7]  
LITTLE RJA, 2004, APPL BAYESIAN MODELI, P141
[8]  
Nietert P.J., 2000, CHARLESTON HEART STU
[9]  
Raghunathan TE., 2003, Journal of Official Statistics, V19, P1
[10]  
Reiter J. P., 2002, J. Off. Statist., V18, P531