Are specialist certification examinations a reliable measure of physician competence?

被引:11
作者
Burch, V. C. [1 ]
Norman, G. R. [2 ]
Schmidt, H. G. [3 ]
van der Vleuten, C. P. M. [4 ]
机构
[1] Univ Cape Town, Groote Schuur Hosp, Dept Med, ZA-7925 Cape Town, South Africa
[2] McMaster Univ, Programme Educ Res & Dev, Hamilton, ON, Canada
[3] Erasmus Univ, Dept Psychol, Rotterdam, Netherlands
[4] Maastricht Univ, Dept Educ Dev, Maastricht, Netherlands
关键词
Postgraduate certification; Liscensure; Assessment; Multivariate generalizability theory; Medical education;
D O I
10.1007/s10459-007-9063-5
中图分类号
G40 [教育学];
学科分类号
040101 [教育学原理]; 120403 [教育经济与管理];
摘要
High stakes postgraduate specialist certification examinations have considerable implications for the future careers of examinees. Medical colleges and professional boards have a social and professional responsibility to ensure their fitness for purpose. To date there is a paucity of published data about the reliability of specialist certification examinations and objective methods for improvement. Such data are needed to improve current assessment practices and sustain the international credibility of specialist certification processes. To determine the component and composite reliability of the Fellowship examination of the College of Physicians of South Africa, and identify strategies for further improvement, generalizability and multivariate generalizability theory were used to estimate the reliability of examination subcomponents and the overall reliability of the composite examination. Decision studies were used to identify strategies for improving the composition of the examination. Reliability coefficients of the component subtests ranged from 0.58 to 0.64. The composite reliability of the examination was 0.72. This could be increased to 0.8 by weighting all test components equally or increasing the number of patient encounters in the clinical component of the examination. Correlations between examination components were high, suggesting that similar parameters of competence were being assessed. This composite certification examination, if equally weighted, achieved an overall reliability sufficient for high stakes examination purposes. Increasing the weighting of the clinical component decreased the reliability. This could be rectified by increasing the number of patient encounters in the examination. Practical ways of achieving this are suggested.
引用
收藏
页码:521 / 533
页数:13
相关论文
共 42 条
[1]
[Anonymous], 1995, EDUC RES-UK, DOI [10.3102/0013189X024005005, DOI 10.3102/0013189X024005005, DOI 10.3102/2F0013189X024005005]
[2]
Use of encounter cards for evaluation of residents in obstetrics [J].
Brennan, BG ;
Norman, GR .
ACADEMIC MEDICINE, 1997, 72 (10) :S43-S44
[3]
BRENNAN RL, 2001, GENERALIZABILITY
[4]
BRENNAN RL, 2001, 47 TEST PROGR
[5]
A multivariate generalizability analysis of data from a performance assessment of physicians' clinical skills [J].
Clauser, Brian E. ;
Harik, Polina ;
Margolis, Melissa J. .
JOURNAL OF EDUCATIONAL MEASUREMENT, 2006, 43 (03) :173-191
[6]
CRONBACH IJ, 1972, DEPENDABILITY BEHAV
[7]
Generalisability: a key to unlock professional assessment [J].
Crossley, J ;
Davies, H ;
Humphris, G ;
Jolly, B .
MEDICAL EDUCATION, 2002, 36 (10) :972-978
[8]
Reliability of clinical oral examinations re-examined [J].
Daelmans, HEM ;
Scherpbier, AJJA ;
Van der Vleuten, CPM ;
Donker, AJM .
MEDICAL TEACHER, 2001, 23 (04) :422-424
[9]
Procedures for establishing defensible absolute passing scores on performance examinations in health professions education [J].
Downing, SM ;
Tekian, A ;
Yudkowsky, R .
TEACHING AND LEARNING IN MEDICINE, 2006, 18 (01) :50-57
[10]
RETHINKING CRITICAL ISSUES IN PERFORMANCE ASSESSMENT [J].
FRIEDMAN, M ;
MENNIN, SP .
ACADEMIC MEDICINE, 1991, 66 (07) :390-395