Hui and Walter's latent-class reference-free approach may be more useful in assessing agreement than diagnostic performance

被引：16

作者：

Bertrand, P

Bénichou, J

Grenier, P

Chastang, C

机构：

[1] Fac Med Tours, Lab Biostat Epidemiol & Informat Med, F-37032 Tours, France

[2] CHU Rouen, Unite Biostat, F-76031 Rouen, France

[3] Grp Hosp Pitie Salpetriere, Serv Radiol, F-75651 Paris, France

[4] Inst Natl Sante & Rech Med, INSERM U494, F-75651 Paris, France

[5] Hop St Louis, Dept Biostat & Med Informat, F-75475 Paris, France

来源：

JOURNAL OF CLINICAL EPIDEMIOLOGY | 2005年 / 58卷 / 07期

关键词：

estimation; sensitivity; specificity; latent class model; observer agreement; reliability; conditional independence assumption;

D O I：

10.1016/j.jclinepi.2004.10.021

中图分类号：

R19 [保健组织与事业（卫生事业管理）];

学科分类号：

摘要：

Background and Objective: Hui and Walter developed a latent class approach to assess the accuracy of a diagnostic procedure when no reference test is available. Our objective was to compare sensitivity and specificity estimates obtained with this reference-free approach and standard approaches, and to examine how and why they differed on a computerized tomography (CT) scan case study. Study Design and Setting: We compared two sets of sensitivity and specificity estimates from four radiologists independently assessing tumoral and lymph node extension of 85 lung cancer patients with preoperative thoracic CT scan, those obtained relative to pathology findings from surgical specimens (reference set), and those derived from Hui and Walter's approach. Results: The two sets of estimates significantly and markedly differed from each other. From simulations, we found that small-sample bias in Hui and Walter's estimates could be a major factor in explaining this difference. Furthermore, errors in pathology findings could account for part of this difference. Finally, our analyses revealed that the latent classes may differ intrinsically from the reference classes as defined from pathology findings and may have a different interpretation. Conclusion: Diagnostic parameters estimated with respect to latent classes may be more useful in providing a complete assessment of interobserver agreement than in assessing diagnostic performance. (c) 2005 Elsevier Inc. All rights reserved.

引用

页码：688 / 700

页数：13

共 41 条

[11] COMPARISON OF A SCREENING TEST AND A REFERENCE TEST IN EPIDEMIOLOGIC STUDIES .2. A PROBABILISTIC MODEL FOR COMPARISON OF DIAGNOSTIC TESTS [J].