Statistical methods for assessing measurement error (reliability) in variables relevant to sports medicine

被引:2789
作者
Atkinson, G [1 ]
Nevill, AM [1 ]
机构
[1] Liverpool John Moores Univ, Res Inst Sport & Exercise Sci, Liverpool L3 2ET, Merseyside, England
关键词
D O I
10.2165/00007256-199826040-00002
中图分类号
G8 [体育];
学科分类号
04 ; 0403 ;
摘要
Minimal measurement error (reliability) during the collection of interval- and ratio-type data is critically important to sports medicine research. The main components of measurement error are systematic bias (e.g. general learning or fatigue effects on the tests) and random error due to biological or mechanical variation. Both error components should be meaningfully quantified for the sports physician to relate the described error to judgements regarding 'analytical goals' (the requirements of the measurement tool for effective practical use) rather than the statistical significance of any reliability indicators. Methods based on correlation coefficients and regression provide an indication of 'relative reliability'. Since these methods are highly influenced by the range of measured values, researchers should be cautious in: (i) concluding accepting able relative reliability even if a correlation is above 0.9; (ii) extrapolating the results of a test-retest correlation to a new sample of individuals involved in an experiment; and (iii) comparing test-retest correlations between different reliability studies. Methods used to describe 'absolute reliability' include the standard error of measurements (SEM), coefficient of variation (CV) and limits of agreement (LOA). These statistics are more appropriate for comparing reliability between different measurement tools in different studies. They can be used in multiple retest studies from ANOVA procedures, help predict the magnitude of a 'real' change in individual athletes and be employed to estimate statistical power for a repeated-measures experiment. These methods vary considerably in the way they are calculated and their use also assumes the presence (CV) or absence (SEM) of heteroscedasticity. Most methods of calculating SEM and CV represent approximately 68% of the error that is actually present in the repeated measurements for the 'average' individual in the sample. LOA represent the test-retest differences for 95% of a population. The associated Bland-Altman plot shows the measurement error schematically and helps to identify the presence of heteroscedasticity. If there is evidence of heteroscedasticity or non-normality, one should logarithmically transform the data and quote the bias and random error as ratios. This allows simple comparisons of reliability across different measurement tools. It is recommended that sports clinicians and researchers should cite and interpret a number of statistical methods for assessing reliability. We encourage the inclusion of the LOA method, especially the exploration of heteroscedasticity that is inherent in this analysis. We also stress the importance of relating the results of any reliability statistic to 'analytical goals' in sports medicine.
引用
收藏
页码:217 / 238
页数:22
相关论文
共 92 条
  • [51] ASSESSING AGREEMENT BETWEEN CLINICAL MEASUREMENT METHODS
    LIEHR, P
    DEDO, YL
    TORRES, S
    MEININGER, JC
    [J]. HEART & LUNG, 1995, 24 (03): : 240 - 245
  • [52] A CONCORDANCE CORRELATION-COEFFICIENT TO EVALUATE REPRODUCIBILITY
    LIN, LI
    [J]. BIOMETRICS, 1989, 45 (01) : 255 - 268
  • [53] ANALYSIS OF SERIAL MEASUREMENTS IN MEDICAL-RESEARCH
    MATTHEWS, JNS
    ALTMAN, DG
    CAMPBELL, MJ
    ROYSTON, P
    [J]. BRITISH MEDICAL JOURNAL, 1990, 300 (6719) : 230 - 235
  • [54] Matthews JNS, 1997, STAT MED, V16, P705
  • [55] Morrow J.R., 1989, Measurement concepts in physical education and exercise science, P73
  • [56] HOW SIGNIFICANT IS YOUR RELIABILITY
    MORROW, JR
    JACKSON, AW
    [J]. RESEARCH QUARTERLY FOR EXERCISE AND SPORT, 1993, 64 (03) : 352 - 355
  • [57] MORROW JR, 1995, MEASUREMENT EVALUATI
  • [58] A CRITICAL DISCUSSION OF INTRACLASS CORRELATION-COEFFICIENTS
    MULLER, R
    BUTTNER, P
    [J]. STATISTICS IN MEDICINE, 1994, 13 (23-24) : 2465 - 2476
  • [59] Relative and absolute reliability of the KT-2000 arthrometer for uninjured knees - Testing at 67, 89, 134, and 178 N and manual maximum forces
    Myrer, JW
    Schulthies, SS
    Fellingham, GW
    [J]. AMERICAN JOURNAL OF SPORTS MEDICINE, 1996, 24 (01) : 104 - 108
  • [60] Nevill A, 1997, J SPORT SCI, V15, P457