Hui and Walter's latent-class reference-free approach may be more useful in assessing agreement than diagnostic performance

被引:16
作者
Bertrand, P
Bénichou, J
Grenier, P
Chastang, C
机构
[1] Fac Med Tours, Lab Biostat Epidemiol & Informat Med, F-37032 Tours, France
[2] CHU Rouen, Unite Biostat, F-76031 Rouen, France
[3] Grp Hosp Pitie Salpetriere, Serv Radiol, F-75651 Paris, France
[4] Inst Natl Sante & Rech Med, INSERM U494, F-75651 Paris, France
[5] Hop St Louis, Dept Biostat & Med Informat, F-75475 Paris, France
关键词
estimation; sensitivity; specificity; latent class model; observer agreement; reliability; conditional independence assumption;
D O I
10.1016/j.jclinepi.2004.10.021
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background and Objective: Hui and Walter developed a latent class approach to assess the accuracy of a diagnostic procedure when no reference test is available. Our objective was to compare sensitivity and specificity estimates obtained with this reference-free approach and standard approaches, and to examine how and why they differed on a computerized tomography (CT) scan case study. Study Design and Setting: We compared two sets of sensitivity and specificity estimates from four radiologists independently assessing tumoral and lymph node extension of 85 lung cancer patients with preoperative thoracic CT scan, those obtained relative to pathology findings from surgical specimens (reference set), and those derived from Hui and Walter's approach. Results: The two sets of estimates significantly and markedly differed from each other. From simulations, we found that small-sample bias in Hui and Walter's estimates could be a major factor in explaining this difference. Furthermore, errors in pathology findings could account for part of this difference. Finally, our analyses revealed that the latent classes may differ intrinsically from the reference classes as defined from pathology findings and may have a different interpretation. Conclusion: Diagnostic parameters estimated with respect to latent classes may be more useful in providing a complete assessment of interobserver agreement than in assessing diagnostic performance. (c) 2005 Elsevier Inc. All rights reserved.
引用
收藏
页码:688 / 700
页数:13
相关论文
共 41 条
[11]   COMPARISON OF A SCREENING TEST AND A REFERENCE TEST IN EPIDEMIOLOGIC STUDIES .2. A PROBABILISTIC MODEL FOR COMPARISON OF DIAGNOSTIC TESTS [J].
GART, JJ ;
BUCK, AA .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 1966, 83 (03) :593-+
[12]  
GOODMAN LA, 1974, BIOMETRIKA, V61, P215, DOI 10.1093/biomet/61.2.215
[13]  
Grenier P, 1989, DIAGN INTERVENT RADI, V1, P23
[14]   ANALYSIS OF CATEGORICAL DATA BY LINEAR MODELS [J].
GRIZZLE, JE ;
STARMER, CF ;
KOCH, GG .
BIOMETRICS, 1969, 25 (03) :489-&
[15]  
Guggenmoos-Holzmann I, 1998, STAT MED, V17, P797, DOI 10.1002/(SICI)1097-0258(19980430)17:8<797::AID-SIM776>3.0.CO
[16]  
2-G
[17]   The meaning of kappa: Probabilistic concepts of reliability and validity revisited [J].
GuggenmoosHolzmann, I .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 1996, 49 (07) :775-782
[18]   GOODNESS-OF-FIT TESTS FOR DISCRETE DATA - REVIEW AND AN APPLICATION TO A HEALTH IMPAIRMENT SCALE [J].
HORN, SD .
BIOMETRICS, 1977, 33 (01) :237-248
[19]   Latent class analysis of child behavior checklist attention problems [J].
Hudziak, JJ ;
Wadsworth, ME ;
Heath, AC ;
Achenbach, TM .
JOURNAL OF THE AMERICAN ACADEMY OF CHILD AND ADOLESCENT PSYCHIATRY, 1999, 38 (08) :985-991
[20]   ESTIMATING THE ERROR RATES OF DIAGNOSTIC-TESTS [J].
HUI, SL ;
WALTER, SD .
BIOMETRICS, 1980, 36 (01) :167-171