Study design for calibration of clinical examiners measuring periodontal parameters

被引:36
作者
Hill, Elizabeth G.
Slate, Elizabeth H.
Wiegand, Ryan E.
Grossi, Sara G.
Salinas, Carlos F.
机构
[1] Med Univ S Carolina, Dept Biostat Bioinformat & Epidemiol, Charleston, SC 29425 USA
[2] SUNY Buffalo, Sch Dent Med, Buffalo, NY 14260 USA
[3] Med Univ S Carolina, Coll Dent Med, Charleston, SC 29425 USA
关键词
calibration; correlation of data; periodontal diseases; periodontal pocket; reproducibility of results;
D O I
10.1902/jop.2006.050395
中图分类号
R78 [口腔科学];
学科分类号
1003 ;
摘要
Background: We present an approach to examiner calibration study design where the number of calibration subjects is based on a specified margin of error (half-width of the 95% confidence interval [CI]) of the percentage of agreement (exact and within I mm) for both intra- and interexaminer reliability assessments. Methods: An experienced standard examiner (S) trained three dental hygienists (A, B, and C) in correct procedures for obtaining a variety of periodontal measures. Duplicate measurements of probing depth (PD [mm]) and the free gingival margin to the cemento-enamel junction (CEJ-GM [mm]) were obtained in a pilot study to design a formal examiner calibration study, where sample sizes were adjusted for the effects of within-subject clustering of binary indices of agreement. Results: Within-subject clustering of agreement indices resulted in an approximate four-fold increase in the variance of the estimates of percentage of agreement with the standard. PD and CEJ-GM percentage of exact agreement measurements (95% CI) for each examiner-standard pair, respectively, were as follows: AS = 55% (48%, 61%) and 70% (62%, 78%); BS = 52% (45%, 59%) and 73% (63%, 82%); and CS = 55% (50%, 61%) and 72% (65%, 79%). The corresponding 95% CIs unadjusted for the effects of clustering underestimated the margin of error associated with the estimates of exact agreement by as much as 57% for PD and 68% for CEJ-GM. Conclusion: Failure to account for dependence among site-level agreement indices results in a false sense of precision in the resulting reliability estimates and can lead to faulty inference.
引用
收藏
页码:1129 / 1141
页数:13
相关论文
共 31 条
[1]   REPRODUCIBILITY OF PROBING ATTACHMENT LEVEL MEASUREMENTS [J].
BADERSTEN, A ;
NILVEUS, R ;
EGELBERG, J .
JOURNAL OF CLINICAL PERIODONTOLOGY, 1984, 11 (07) :475-485
[2]   INTRACLASS CORRELATION COEFFICIENT AS A MEASURE OF RELIABILITY [J].
BARTKO, JJ .
PSYCHOLOGICAL REPORTS, 1966, 19 (01) :3-&
[3]   ON THE CHOICE OF COMPUTATIONAL UNIT IN STATISTICAL-ANALYSIS [J].
BLOMQVIST, N .
JOURNAL OF CLINICAL PERIODONTOLOGY, 1985, 12 (10) :873-876
[4]  
Cicchetti D. V., 1971, American Journal of EEG Technology, V11, P101
[5]  
COCHRAN WG, 1977, SAMPLING TECHNIQUES, P240
[7]   A COEFFICIENT OF AGREEMENT FOR NOMINAL SCALES [J].
COHEN, J .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1960, 20 (01) :37-46
[8]   EQUIVALENCE OF WEIGHTED KAPPA AND INTRACLASS CORRELATION COEFFICIENT AS MEASURES OF RELIABILITY [J].
FLEISS, JL ;
COHEN, J .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1973, 33 (03) :613-619
[9]   A STUDY OF INTER-EXAMINER AND INTRA-EXAMINER RELIABILITY OF POCKET DEPTH AND ATTACHMENT LEVEL [J].
FLEISS, JL ;
MANN, J ;
PAIK, M ;
GOULTCHIN, J ;
CHILTON, NW .
JOURNAL OF PERIODONTAL RESEARCH, 1991, 26 (02) :122-128
[10]   A RE-EXAMINATION OF WITHIN-MOUTH CORRELATIONS OF ATTACHMENT LEVEL AND OF CHANGE IN ATTACHMENT LEVEL [J].
FLEISS, JL ;
WALLENSTEIN, S ;
CHILTON, NW ;
GOODSON, JM .
JOURNAL OF CLINICAL PERIODONTOLOGY, 1988, 15 (07) :411-414