Estimating disease prevalence in two-phase studies

被引:52
作者
Alonzo, TA
Pepe, MS
Lumley, T
机构
[1] Univ So Calif, Keck Sch Med, Childrens Oncol Grp, Arcadia, CA 91066 USA
[2] Univ Washington, Dept Biostat, Seattle, WA 98195 USA
关键词
double sampling; imputation; inverse probability weighting; mean score; semiparametric; surrogate outcome; verification bias;
D O I
10.1093/biostatistics/4.2.313
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Disease prevalence is ideally estimated using a 'gold standard' to ascertain true disease status on all subjects in a population of interest. In practice, however, the gold standard may be too costly or invasive to be applied to all subjects, in which case a two-phase design is often employed. Phase I data consisting of inexpensive and non-invasive screening tests on all study subjects are used to determine the subjects that receive the gold standard in the second phase. Naive estimates of prevalence in two-phase studies can be biased (verification bias). Imputation and re-weighting estimators are often used to avoid this bias. We contrast the forms and attributes of the various prevalence estimators. Distribution theory and simulation studies are used to investigate their bias and efficiency. We conclude that the semiparametric efficient approach is the preferred method for prevalence estimation in two-phase studies. It is more robust and comparable in its efficiency to imputation and other re-weighting estimators. It is also easy to implement. We use this approach to examine the prevalence of depression in adolescents with data from the Great Smoky Mountain Study.
引用
收藏
页码:313 / 326
页数:14
相关论文
共 23 条
[1]  
ANGOLD A, 1995, PSYCHOL MED, V25, P739, DOI 10.1017/S003329170003498X
[2]  
[Anonymous], 1987, DIAGN STAT MAN MENT
[3]   ASSESSMENT OF DIAGNOSTIC-TESTS WHEN DISEASE VERIFICATION IS SUBJECT TO SELECTION BIAS [J].
BEGG, CB ;
GREENES, RA .
BIOMETRICS, 1983, 39 (01) :207-215
[4]   INFORMATION AND ASYMPTOTIC EFFICIENCY IN PARAMETRIC NONPARAMETRIC MODELS [J].
BEGUN, JM ;
HALL, WJ ;
HUANG, WM ;
WELLNER, JA .
ANNALS OF STATISTICS, 1983, 11 (02) :432-452
[5]   Analysis of longitudinal binary data from multiphase sampling [J].
Clayton, D ;
Spiegelhalter, D ;
Dunn, G ;
Pickles, A .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1998, 60 :71-87
[6]  
Costello EJ, 1996, ARCH GEN PSYCHIAT, V53, P1129
[7]  
Gao SJ, 2000, STAT MED, V19, P2101, DOI 10.1002/1097-0258(20000830)19:16<2101::AID-SIM523>3.0.CO
[8]  
2-G
[9]  
Hastie T., 1990, Generalized additive model
[10]   A GENERALIZATION OF SAMPLING WITHOUT REPLACEMENT FROM A FINITE UNIVERSE [J].
HORVITZ, DG ;
THOMPSON, DJ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1952, 47 (260) :663-685