Assessing intrarater, interrater and test-retest reliability of continuous measurements

被引:295
作者
Rousson, V [1 ]
Gasser, T [1 ]
Seifert, B [1 ]
机构
[1] Univ Zurich, Dept Biostat, Inst Social & Prevent Med, CH-8006 Zurich, Switzerland
关键词
reliability; intrarater data; interrater data; test-retest data; learning effect; intraclass correlation; limits of agreement;
D O I
10.1002/sim.1253
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In this paper we review the problem of defining and estimating intrarater, interrater and test-retest reliability of continuous measurements. We argue that the usual notion of product-moment correlation is well adapted in a test-retest situation, whereas the concept of intraclass correlation should be used for intrarater and interrater reliability. The key difference between these two approaches is the treatment of systematic error, which is often due to a learning effect for test-retest data. We also consider the reliability of a sum and a difference of variables and illustrate the effects on components. Further, we compare these approaches of reliability with the concept of limits of agreement proposed by Bland and Altman (for evaluating the agreement between two methods of clinical measurements) and show how product-moment correlation is related to it. We then propose new kinds of limits of agreement which are related to intraclass correlation. A test battery to study the development of neuro-motor functions in children and adolescents illustrates our purpose throughout the paper. Copyright (C) 2002 John Wiley Sons, Ltd.
引用
收藏
页码:3431 / 3446
页数:16
相关论文
共 14 条
[1]   INTRACLASS CORRELATION COEFFICIENT AS A MEASURE OF RELIABILITY [J].
BARTKO, JJ .
PSYCHOLOGICAL REPORTS, 1966, 19 (01) :3-&
[2]   A NOTE ON THE USE OF THE INTRACLASS CORRELATION-COEFFICIENT IN THE EVALUATION OF AGREEMENT BETWEEN 2 METHODS OF MEASUREMENT [J].
BLAND, JM ;
ALTMAN, DG .
COMPUTERS IN BIOLOGY AND MEDICINE, 1990, 20 (05) :337-340
[3]   STATISTICAL METHODS FOR ASSESSING AGREEMENT BETWEEN TWO METHODS OF CLINICAL MEASUREMENT [J].
BLAND, JM ;
ALTMAN, DG .
LANCET, 1986, 1 (8476) :307-310
[4]   APPROXIMATE INTERVAL ESTIMATION FOR A CERTAIN INTRACLASS CORRELATION-COEFFICIENT [J].
FLEISS, JL ;
SHROUT, PE .
PSYCHOMETRIKA, 1978, 43 (02) :259-262
[5]  
KOTZ L, 1988, ENCY STAT SCI, V8, P584
[6]   Neuromotor development from 5 to 18 years. Part 1: timed performance [J].
Largo, RH ;
Caflisch, JA ;
Hug, F ;
Muggli, K ;
Molnar, AA ;
Molinari, L ;
Sheehy, A ;
Gasser, T .
DEVELOPMENTAL MEDICINE AND CHILD NEUROLOGY, 2001, 43 (07) :436-443
[7]   STATISTICAL EVALUATION OF AGREEMENT BETWEEN 2 METHODS FOR MEASURING A QUANTITATIVE VARIABLE [J].
LEE, J ;
KOH, D ;
ONG, CN .
COMPUTERS IN BIOLOGY AND MEDICINE, 1989, 19 (01) :61-70
[8]  
Pearson K., 1899, PHILOS T R SOC A, V192, P257, DOI [DOI 10.1098/RSTA.1899.0006, 10.1098/rsta.1899.0006]
[9]   RELIABILITY FORMULAS FOR INDEPENDENT DECISION DATA WHEN RELIABILITY DATA ARE MATCHED [J].
RAJARATNAM, N .
PSYCHOMETRIKA, 1960, 25 (03) :261-271
[10]   AN APPROXIMATE DISTRIBUTION OF ESTIMATES OF VARIANCE COMPONENTS [J].
SATTERTHWAITE, FE .
BIOMETRICS BULLETIN, 1946, 2 (06) :110-114