On estimating diagnostic accuracy from studies with multiple raters and partial gold standard evaluation

被引:26
作者
Albert, Paul S. [1 ]
Dodd, Lori E. [1 ]
机构
[1] NCI, Div Canc Treatment & Diag, Biometr Branch, Bethesda, MD 20892 USA
关键词
diagnostic error; latent class models; misclassification; semilatent class models;
D O I
10.1198/016214507000000329
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We are often interested in estimating sensitivity and specificity of a group of raters or a set of new diagnostic tests in situations in which gold standard evaluation is expensive or invasive. Numerous authors have proposed latent modeling approaches for estimating diagnostic error without a gold standard. Albert and Dodd showed that, when modeling without a gold standard, estimates of diagnostic error can be biased when the dependence structure between tests is misspecified. In addition, they showed that choosing between different models for this dependence structure is difficult in most practical situations. While these results caution against using these latent class models, the difficulties of obtaining gold standard verification remain a practical reality. We extend two classes of models to provide a compromise that collects gold standard information on a subset of subjects but incorporates information from both the verified and nonverified subjects during estimation. We examine the robustness of diagnostic error estimation with this approach and show that choosing between competing models is easier in this context. In our analytic work and simulations, we consider situations in which verification is completely at random as well as settings in which the probability of verification depends on the actual test results. We apply our methodological work to a study designed to estimate the diagnostic error of digital radiography for gastric cancer.
引用
收藏
页码:61 / 73
页数:13
相关论文
共 24 条
[1]   A cautionary note on the robustness of latent class models for estimating diagnostic error without a gold standard [J].
Albert, PS ;
Dodd, LE .
BIOMETRICS, 2004, 60 (02) :427-435
[2]   Latent class modeling approaches for assessing diagnostic error without a gold standard: With applications to p53 immunohistochemical assays in bladder tumors [J].
Albert, PS ;
McShane, LM ;
Shih, JH .
BIOMETRICS, 2001, 57 (02) :610-619
[3]  
Alonzo TA, 1999, STAT MED, V18, P2987, DOI 10.1002/(SICI)1097-0258(19991130)18:22<2987::AID-SIM205>3.0.CO
[4]  
2-B
[5]  
BAHADUR RR, 1961, STUDIES ITEM ANAL PR, P169
[6]   EVALUATING MULTIPLE DIAGNOSTIC-TESTS - WITH PARTIAL VERIFICATION [J].
BAKER, SG .
BIOMETRICS, 1995, 51 (01) :330-337
[7]   ASSESSMENT OF DIAGNOSTIC-TESTS WHEN DISEASE VERIFICATION IS SUBJECT TO SELECTION BIAS [J].
BEGG, CB ;
GREENES, RA .
BIOMETRICS, 1983, 39 (01) :207-215
[8]  
COX DR, 1972, J ROY STAT SOC C-APP, V21, P113, DOI 10.2307/2346482
[9]  
Efron B., 1993, INTRO BOOTSTRAP MONO, DOI DOI 10.1201/9780429246593
[10]   Misspecified maximum likelihood estimates and generalised linear mixed models [J].
Heagerty, PJ ;
Kurland, BF .
BIOMETRIKA, 2001, 88 (04) :973-985