Assessing observer agreement when describing and classifying functioning with the international

被引:39
作者
Grill, Eva
Mansmann, Ulrich
Cieza, Alarcos
Stucki, Gerold
机构
[1] Univ Munich, Dept Phys Med & Rehabil, D-81377 Munich, Germany
[2] Univ Munich, Dept Med Informat Iometry & Epidemiol, D-81377 Munich, Germany
[3] Univ Munich, Inst Hlth & Rehabil, WHO FIC Collaborating Ctr DIMDI, ICF Res Branch, D-81377 Munich, Germany
关键词
reproducibility of results; rehabilitation; rater agreement; ICF; log linear models;
D O I
10.2340/16501977-0016
中图分类号
R49 [康复医学];
学科分类号
100215 ;
摘要
Objective: The International Classification of Functioning, Disability and Health (ICF) is used increasingly to describe and classify functioning in medicine without being a psychometrically sound measure. All categories of the ICF are quantified using the same generic 0-4 scale. The objective of this study was to assess observer agreement when describing and classifying functioning with the ICF. Design: A second-level category of the ICF, d430 lifting and carrying objects, was used as an example. To the qualifiers of this category, clinically meaningful definitions were assigned. Data were collected in a cross-sectional survey with repeated measurement. We report raw, specific and chance-corrected measures or agreement, a graphical method and the results of log-linear models for ordinal agreement. Subjects/patients: A convenience sample of patients requiring physical therapy in an acute hospital. Results: Twenty-five patients were assessed twice by 2 observers. Raw agreement was 0.52. Kappa was 0.36, indicating fair agreement. Different hierarchical log-linear models indicated that the strength of agreement was not homogeneous over all categories. Conclusion: Observer agreement has to be evaluated when describing and classifying functioning using the ICF Qualifiers' scale. When assessing inter-observer reliability, the first step is to calculate a summary statistic. Modelling agreement yields valuable insight into the structure of a contingency table, which can lead to further improvement of the scale.
引用
收藏
页码:71 / 76
页数:6
相关论文
共 36 条
[31]   SEPARATION OF SYSTEMATIC AND RANDOM DIFFERENCES IN ORDINAL RATING-SCALES [J].
SVENSSON, E ;
HOLM, S .
STATISTICS IN MEDICINE, 1994, 13 (23-24) :2437-2453
[32]   MODELING ORDINAL SCALE DISAGREEMENT [J].
TANNER, MA ;
YOUNG, MA .
PSYCHOLOGICAL BULLETIN, 1985, 98 (02) :408-415
[33]   LATENT CLASS ANALYSIS OF DIAGNOSTIC AGREEMENT [J].
UEBERSAX, JS ;
GROVE, WM .
STATISTICS IN MEDICINE, 1990, 9 (05) :559-572
[34]   Comments from WHO for the Journal of Rehabilitation Medicine special supplement on ICF core sets [J].
Üstün, B ;
Chatterji, S ;
Kostanjsek, N .
JOURNAL OF REHABILITATION MEDICINE, 2004, 36 :7-8
[35]   ESTIMATION OF TEST ERROR RATES, DISEASE PREVALENCE AND RELATIVE RISK FROM MISCLASSIFIED DATA - A REVIEW [J].
WALTER, SD ;
IRWIG, LM .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 1988, 41 (09) :923-937
[36]  
World Health Organisation, 2001, INT CLASS FUNCT DIS