Undesired variance due to examiner stringency/leniency effect in communication skill scores assessed in OSCEs

被引:86
作者
Harasym, Peter H. [2 ]
Woloschuk, Wayne [1 ]
Cunning, Leslie [3 ]
机构
[1] Univ Calgary, Fac Med, Calgary, AB T2N 4N1, Canada
[2] Univ Calgary, Dept Community Hlth Sci, Calgary, AB T2N 4N1, Canada
[3] Univ Calgary, Dept Family Med, Calgary, AB T2N 4N1, Canada
关键词
Communication skills; Error variance; Item response theory; Leniency; OSCE; Rasch; Stringency; True score;
D O I
10.1007/s10459-007-9068-0
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Physician-patient communication is a clinical skill that can be learned and has a positive impact on patient satisfaction and health outcomes. A concerted effort at all medical schools is now directed at teaching and evaluating this core skill. Student communication skills are often assessed by an Objective Structure Clinical Examination (OSCE). However, it is unknown what sources of error variance are introduced into examinee communication scores by various OSCE components. This study primarily examined the effect different examiners had on the evaluation of students' communication skills assessed at the end of a family medicine clerkship rotation. The communication performance of clinical clerks from Classes 2005 and 2006 were assessed using six OSCE stations. Performance was rated at each station using the 28-item Calgary-Cambridge guide. Item Response Theory analysis using a Multifaceted Rasch model was used to partition the various sources of error variance and generate a "true" communication score where the effects of examiner, case, and items are removed. Variance and reliability of scores were as follows: communication scores (.20 and .87), examiner stringency/leniency (.86 and .91), case (.03 and .96), and item (.86 and .99), respectively. All facet scores were reliable (.87-.99). Examiner variance (.86) was more than four times the examinee variance (.20). About 11% of the clerks' outcome status shifted using "true" rather than observed/raw scores. There was large variability in examinee scores due to variation in examiner stringency/leniency behaviors that may impact pass-fail decisions. Exploring the benefits of examiner training and employing "true" scores generated using Item Response Theory analyses prior to making pass/fail decisions are recommended.
引用
收藏
页码:617 / 632
页数:16
相关论文
共 43 条
[21]   RELIABILITY AND VALIDITY OF THE OBJECTIVE STRUCTURED CLINICAL EXAMINATION IN PEDIATRICS [J].
MATSELL, DG ;
WOLFISH, NM ;
HSU, E .
MEDICAL EDUCATION, 1991, 25 (04) :293-299
[22]  
McManus IC, 2006, BMC MED ED, V6, P1272
[23]  
Myford Carol M, 2004, J Appl Meas, V5, P189
[24]  
Myford Carol M, 2003, J Appl Meas, V4, P386
[25]  
Neuwirth Z E, 1999, J Med Pract Manage, V15, P68
[26]  
Neuwirth Z E, 1999, J Med Pract Manage, V15, P14
[27]  
Osler W, 1913, LANCET, V2, P1047
[28]  
PESKIN T, 1995, JAMA-J AM MED ASSOC, V274, P22, DOI 10.1001/jama.1995.03530010036017
[29]   AN OBJECTIVE-MEASURE OF CLINICAL-PERFORMANCE [J].
PETRUSA, ER ;
BLACKWELL, TA ;
ROGERS, LP ;
SAYDJARI, C ;
PARCEL, S ;
GUCKIAN, JC .
AMERICAN JOURNAL OF MEDICINE, 1987, 83 (01) :34-42
[30]   RELIABILITY AND VALIDITY OF AN OBJECTIVE STRUCTURED CLINICAL EXAMINATION FOR ASSESSING THE CLINICAL-PERFORMANCE OF RESIDENTS [J].
PETRUSA, ER ;
BLACKWELL, TA ;
AINSWORTH, MA .
ARCHIVES OF INTERNAL MEDICINE, 1990, 150 (03) :573-577