What is an intracluster correlation coefficient? Crucial concepts for primary care researchers

被引:438
作者
Killip, S
Mahfoud, Z
Pearce, K
机构
[1] Univ Kentucky, Dept Family Practice & Community Med, Lexington, KY USA
[2] Univ Kentucky, Dept Stat, Lexington, KY 40506 USA
关键词
statistics; cluster analysis; data interpretation; research design; primary care; practice-based research; methods/quantitative; theory;
D O I
10.1370/afm.141
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
BACKGROUND Primary care research often involves clustered samples in which subjects are randomized at a group level but analyzed at an individual level. Analyses that do not take this clustering into account may report significance where none exists. This article explores the causes, consequences, and implications of cluster data. METHODS Using a case study with accompanying equations, we show that clustered samples are not as statistically efficient as simple random samples. RESULTS Similarity among subjects within preexisting groups or clusters reduces the variability of responses in a clustered sample, which erodes the power to detect true differences between study arms. This similarity is expressed by the intracluster correlation coefficient, or p (rho), which compares the within-group variance with the between-group variance. Rho is used in equations along with the cluster size and the number of clusters to calculate the effective sample size (ESS) in a clustered design. The ESS should be used to calculate power in the design phase of a clustered study. Appropriate accounting for similarities among subjects in a cluster almost always results in a net loss of power, requiring increased total subject recruitment. Increasing the number of clusters enhances power more efficiently than does increasing the number of subjects within a cluster. CONCLUSIONS Primary care research frequently uses clustered designs, whether consciously or unconsciously. Researchers must recognize and understand the implications of clusters to avoid costly sample size errors.
引用
收藏
页码:204 / 208
页数:5
相关论文
共 4 条
[1]  
DONNER A, 2000, DESIGN ANAL CLUSTER, V9, P112
[2]   Intraclass correlation among measures related to tobacco use by adolescents: Estimates, correlates, and applications in intervention studies [J].
Murray, DM ;
Short, BJ .
ADDICTIVE BEHAVIORS, 1997, 22 (01) :1-12
[3]   INTRACLASS CORRELATION AMONG COMMON MEASURES OF ADOLESCENT SMOKING - ESTIMATES, CORRELATES, AND APPLICATIONS IN SMOKING PREVENTION STUDIES [J].
MURRAY, DM ;
ROONEY, BL ;
HANNAN, PJ ;
PETERSON, AV ;
ARY, DV ;
BIGLAN, A ;
BOTVIN, GJ ;
EVANS, RI ;
FLAY, BR ;
FUTTERMAN, R ;
GETZ, JG ;
MAREK, PM ;
ORLANDI, M ;
PENTZ, MA ;
PERRY, CL ;
SCHINKE, SP .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 1994, 140 (11) :1038-1050
[4]   INTRACLASS CORRELATION AMONG MEASURES RELATED TO ALCOHOL-USE BY YOUNG-ADULTS - ESTIMATES, CORRELATES AND APPLICATIONS IN INTERVENTION STUDIES [J].
MURRAY, DM ;
SHORT, B .
JOURNAL OF STUDIES ON ALCOHOL, 1995, 56 (06) :681-694