The meaning of kappa: Probabilistic concepts of reliability and validity revisited

被引：90

作者：

GuggenmoosHolzmann, I

机构：

[1] Inst. of Med. Stat. and Info. Sci., Freie Universität Berlin

[2] Inst. F. Med. Stat. und I., Universitatsklinikum Benjamin F., Freie Universität Berlin, D-12 200 Berlin

来源：

JOURNAL OF CLINICAL EPIDEMIOLOGY | 1996年 / 49卷 / 07期

关键词：

diagnostic test; reliability; validity; kappa; chance-corrected agreement; chance corrected validity;

D O I：

10.1016/0895-4356(96)00011-X

中图分类号：

R19 [保健组织与事业（卫生事业管理）];

学科分类号：

摘要：

A framework-the ''agreement concept''-is developed to study the use of Cohen's kappa as well as alternative measures of chance-corrected agreement in a unified manner. Focusing on intrarater consistency it is demonstrated that for 2 x 2 tables an adequate choice between different measures of chance-corrected agreement can be made only if the characteristics of the observational setting are taken into account. In particular, a naive use of Cohen's kappa may lead to strinkingly overoptimistic estimates of chance-corrected agreement. Such bias can be overcome by more elaborate study designs that allow for an unrestricted estimation of the probabilities at issue. When Cohen's kappa is appropriately applied as a measure of chance-corrected agreement, its values prove to be a linear-and not a parabolic-function of true prevalence. It is further shown how the validity of ratings is influenced by lack of consistency. Depending on the design of a validity study, this may lead, on purely formal grounds, to prevalence dependent estimates of sensitivity and specificity. Proposed formulas for ''chance-corrected'' validity indexes fail to adjust for this phenomenon.

引用

页码：775 / 782

页数：8

共 36 条

[1] MAXIMUM-LIKELIHOOD-ESTIMATION OF AGREEMENT IN THE CONSTANT PREDICTIVE PROBABILITY MODEL, AND ITS RELATION TO COHEN KAPPA [J].