Hui and Walter's latent-class reference-free approach may be more useful in assessing agreement than diagnostic performance

被引:16
作者
Bertrand, P
Bénichou, J
Grenier, P
Chastang, C
机构
[1] Fac Med Tours, Lab Biostat Epidemiol & Informat Med, F-37032 Tours, France
[2] CHU Rouen, Unite Biostat, F-76031 Rouen, France
[3] Grp Hosp Pitie Salpetriere, Serv Radiol, F-75651 Paris, France
[4] Inst Natl Sante & Rech Med, INSERM U494, F-75651 Paris, France
[5] Hop St Louis, Dept Biostat & Med Informat, F-75475 Paris, France
关键词
estimation; sensitivity; specificity; latent class model; observer agreement; reliability; conditional independence assumption;
D O I
10.1016/j.jclinepi.2004.10.021
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background and Objective: Hui and Walter developed a latent class approach to assess the accuracy of a diagnostic procedure when no reference test is available. Our objective was to compare sensitivity and specificity estimates obtained with this reference-free approach and standard approaches, and to examine how and why they differed on a computerized tomography (CT) scan case study. Study Design and Setting: We compared two sets of sensitivity and specificity estimates from four radiologists independently assessing tumoral and lymph node extension of 85 lung cancer patients with preoperative thoracic CT scan, those obtained relative to pathology findings from surgical specimens (reference set), and those derived from Hui and Walter's approach. Results: The two sets of estimates significantly and markedly differed from each other. From simulations, we found that small-sample bias in Hui and Walter's estimates could be a major factor in explaining this difference. Furthermore, errors in pathology findings could account for part of this difference. Finally, our analyses revealed that the latent classes may differ intrinsically from the reference classes as defined from pathology findings and may have a different interpretation. Conclusion: Diagnostic parameters estimated with respect to latent classes may be more useful in providing a complete assessment of interobserver agreement than in assessing diagnostic performance. (c) 2005 Elsevier Inc. All rights reserved.
引用
收藏
页码:688 / 700
页数:13
相关论文
共 41 条
[1]  
Agresti A, 1992, Stat Methods Med Res, V1, P201, DOI 10.1177/096228029200100205
[2]   SURVIVAL DETERMINANTS IN EXTENSIVE-STAGE NON-SMALL-CELL LUNG-CANCER - THE SOUTHWEST-ONCOLOGY-GROUP EXPERIENCE [J].
ALBAIN, KS ;
CROWLEY, JJ ;
LEBLANC, M ;
LIVINGSTON, RB .
JOURNAL OF CLINICAL ONCOLOGY, 1991, 9 (09) :1618-1626
[3]  
BERTRAND P, 1994, REV EPIDEMIOL SANTE, V42, P502
[4]   REFERENCE TEST ERRORS BIAS THE EVALUATION OF DIAGNOSTIC-TESTS FOR ISCHEMIC HEART-DISEASE [J].
BOYKO, EJ ;
ALDERMAN, BW ;
BARON, AE .
JOURNAL OF GENERAL INTERNAL MEDICINE, 1988, 3 (05) :476-481
[5]   A COEFFICIENT OF AGREEMENT FOR NOMINAL SCALES [J].
COHEN, J .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1960, 20 (01) :37-46
[6]  
Dawid Alexander Philip, 1979, Journal of the Royal Statistical Society, V1979, P20, DOI [10.2307/2346806, DOI 10.2307/2346806]
[7]   USING LATENT CLASS MODELS TO CHARACTERIZE AND ASSESS RELATIVE ERROR IN DISCRETE MEASUREMENTS [J].
ESPELAND, MA ;
HANDELMAN, SL .
BIOMETRICS, 1989, 45 (02) :587-599
[9]  
Formann A K, 1996, Stat Methods Med Res, V5, P179, DOI 10.1177/096228029600500205
[10]   MEASUREMENT ERRORS IN CARIES DIAGNOSIS - SOME FURTHER LATENT CLASS MODELS [J].
FORMANN, AK .
BIOMETRICS, 1994, 50 (03) :865-871