THE UNSEEN SAMPLE IN COHORT STUDIES - ESTIMATION OF ITS SIZE AND EFFECT

被引:23
作者
HOOVER, DR
MUNOZ, A
CAREY, V
ODAKA, N
TAYLOR, JMG
CHMIEL, JS
ARMSTRONG, J
VERMUND, SH
机构
[1] Departments of Epidemiology and Biostatistics, Johns Hopkins School of Public Health, Baltimore, Maryland, 21205
[2] Department of Biostatistics, UCLA School of Public Health and Jonsson Comprehensive Cancer Center, Los Angeles, California, 90024-1714
[3] Cancer Center Biometry Section, Northwestern University Medical School, Chicago, Illinois, 60611-4402
[4] Departments of Infectious Diseases and Microbiology, Graduate School of Public Health, University of Pittsburgh, Pittsburgh, Pennsylvania, 15261
[5] Epidemiology Branch, National Institute of Allergy and Infectious Diseases, Bethesda, Maryland, 20892
关键词
D O I
10.1002/sim.4780101212
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Recruitment of disease-free subjects into cohort studies and measurement of their time from exposure/infection to disease selectively excludes individuals (the unseen sample) who had earlier exposure and who have shorter times to disease. The unseen and observed samples may differ in other characteristics in addition to incubation period and exposure/infection time. For data with known truncation times, we develop non-parametric maximum likelihood estimates of the size, exposure/infection dates and distribution of incubation time in the unseen simple. We provide procedures to estimate and compensate for the biasing effects due to exclusion of the unseen sample in descriptive and survival analysis. We give consistency properties of these estimates and assess variability using bootstrap methods. One can use imputation to derive the above estimates from data with unknown truncation times that have been estimated parametrically. Application is made to an AIDS cohort study of over 5000 homosexual men.. Important estimates obtained from this application are the annual seroconversion rates from 1978 to 1983, not otherwise obtainable in this study population.
引用
收藏
页码:1993 / 2003
页数:11
相关论文
共 14 条
[1]  
Kaslow R.A., Ostrow D.G., Detels R., Phair J.P., Polk B.F., Rinaldo C.R., The Multicenter AIDS Cohort Study: rationale, organization and selected characteristics of the participants, American Journal of Epidemiology, 126, pp. 310-318, (1987)
[2]  
Chmiel J.S., Detels R., Kaslow R.A., Van Raden M., Kingsley L.A., Brookmeyer R., Factors associated with prevalent human immunodeficiency virus (HIV) infection in the Multicenter AIDS Cohort Study, American Journal of Epidemiology, 128, pp. 568-577, (1987)
[3]  
Kingsley L.A., Detels R., Kaslow R., Polk B.F., Rinaldo C.R., Chmiel J., Detre K., Kelsey S.F., Odaka N., Ostrow D., Van Raden M., Visscher B., Risk factors for seroconversion to human immunodeficiency virus among male homosexuals. Results from the Multicenter AIDS Cohort Study, Lancet, 1, pp. 345-349, (1987)
[4]  
Winkelstein W., Lyman D.M., Padian N., Grant R., Samuel M., Wiley J.A., Anderson R.E., Lang W., Riggs J., Levy J.A., Sexual practices and risk of infection by the human immunodeficiency virus: the San Francisco Men's Health Study, Journal of the American Medical Association, 257, pp. 321-325, (1987)
[5]  
Munoz A., Wang M-C., Bass S., Taylor J.M.G., Kingsley L.A., Chmiel J.S., Polk B.F., Acquired immunodeficiency syndrome (AIDS) ‐ free time after human immunodeficiency virus type 1 (HIV‐1) seroconversion in homosexual men, American Journal of Epidemiology, 130, pp. 530-539, (1989)
[6]  
Taylor J., Munoz A., Bass S.M., Saah A.J., Chmiel J.S., Kingsley L.A., Estimating the distribution of time from HIV seroconversion to AIDS using multiple imputation, Statistics in Medicine, 9, pp. 505-514, (1990)
[7]  
Woodroofe M., Estimating a distribution function with truncated data, The Annals of Statistics, 13, pp. 163-177, (1985)
[8]  
Turnbull B., The empirical distribution function with arbitrarily grouped, censored and truncated data, Journal of the Royal Statistical Society, Series B, 38, pp. 290-295, (1976)
[9]  
Wang M-C., Jewell N., Tsai W-Y., Asymptotic properties of the product limit estimate under random truncation, The Annals of Statistics, 14, pp. 1597-1605, (1986)
[10]  
Efron B., The two sample problem with censored data, Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, 4, pp. 831-853, (1967)