Confidence Bounds and Power for the Reliability of Observational Measures on the Quality of a Social Setting

被引：8

作者：

Shin, Yongyun ^{[1
]}

Raudenbush, Stephen W. ^{[2
]}

机构：

[1] Virginia Commonwealth Univ, Dept Biostat, Richmond, VA 23298 USA

[2] Univ Chicago, Dept Sociol, Chicago, IL 60637 USA

来源：

PSYCHOMETRIKA | 2012年 / 77卷 / 03期

关键词：

confidence interval; D study; G study; power; reliability; teaching quality;

D O I：

10.1007/s11336-012-9266-4

中图分类号：

O1 [数学];

学科分类号：

0701 ; 070101 ;

摘要：

Social scientists are frequently interested in assessing the qualities of social settings such as classrooms, schools, neighborhoods, or day care centers. The most common procedure requires observers to rate social interactions within these settings on multiple items and then to combine the item responses to obtain a summary measure of setting quality. A key aspect of the quality of such a summary measure is its reliability. In this paper we derive a confidence interval for reliability, a test for the hypothesis that the reliability meets a minimum standard, and the power of this test against alternative hypotheses. Next, we consider the problem of using data from a preliminary field study of the measurement procedure to inform the design of a later study that will test substantive hypotheses about the correlates of setting quality. The preliminary study is typically called the "generalizability study" or "G study" while the later, substantive study is called the "decision study" or "D study." We show how to use data from the G study to estimate reliability, a confidence interval for the reliability, and the power of tests for the reliability of measurement produced under alternative designs for the D study. We conclude with a discussion of sample size requirements for G studies.

引用

页码：543 / 560

页数：18

共 11 条

[1] The national randomized field trial of success for all: Second-year outcomes [J].

Borman, Geoffrey D. ;

Slavin, Robert E. ;

Cheung, Alan C. K. ;

Chamberlain, Anne M. ;

Madden, Nancy A. ;

Chambers, Bette .

AMERICAN EDUCATIONAL RESEARCH JOURNAL, 2005, 42 (04) :673-696

[2]

Brennan R.L., 2001, GENERALIZABILITY THE

[3]

Burdick R.K., 1992, Confidence intervals on variance components

[4]

Hirsch B.J., 2005, HDB YOUTH MENTORING, P364

[5]

Kinzie M., 2005, P WORLD C E LEARN CO, P814

[6] The classroom assessment scoring system: Findings from the prekindergarten year [J].

La Paro, KM ;

Pianta, RC ;

Stuhlman, M .

ELEMENTARY SCHOOL JOURNAL, 2004, 104 (05) :409-426

[7]

PIANTA R, 2005, APPL DEV SCI, V0009

[8]

Raudenbush S.W., 2010, STUDYING RELIABILITY

[9]

Raudenbush S.W., 2008, Journal of Research on Educational Effectiveness, V1, P138, DOI [https://doi.org/10.1080/19345740801982104, DOI 10.1080/19345740801982104]

[10] A Latent Cluster-Mean Approach to the Contextual Effects Model With Missing Data [J].

Shin, Yongyun ;

Raudenbush, Stephen W. .

JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICS, 2010, 35 (01) :26-53

← 1 2 →