Reliability of two instruments for critical assessment of economic evaluations

被引：27

作者：

Au, Flora ^{[2
]}

Prahardhi, Shirlina ^{[2
]}

Shiell, Alan ^{[1
]}

机构：

[1] Univ Calgary, Populat Hlth Intervent Res Ctr, Calgary, AB T2N 4N1, Canada

[2] Univ Calgary, Dept Community Hlth Sci, Ctr Hlth & Policy Studies, Calgary, AB, Canada

来源：

VALUE IN HEALTH | 2008年 / 11卷 / 03期

关键词：

critical appraisal; economic evaluation; generalizability theory; reliability;

D O I：

10.1111/j.1524-4733.2007.00255.x

中图分类号：

F [经济];

学科分类号：

02 [经济学];

摘要：

Objective: To assess the reliability of two instruments designed for critical appraisal of economic evaluations: the Quality of Health Economic Studies (QHES) scale and the Pediatric Quality Appraisal Questionnaire (PQAQ). Methods: Thirty published articles were chosen at random from a recent bibliography of economic evaluations in health promotion. The quality of each of these studies was assessed independently by two raters using each of the two instruments. Inter-rater reliability and the agreement between the instruments were measured using an intraclass correlation coefficient (ICC). Cronbach's generalizability theory was also used to assess the sources of variation in quality scores of the studies and to indicate where improvements in reliability could best be made. Results: Inter-rater reliability was excellent for both instruments (ICC = 0.81 for the QHES and 0.80 for the PQAQ).Agreement between the instruments varied (ICC = 0.77 for rater 1 and 0.56 for rater 2). The biggest source of variation in the scores assigned to the articles was the quality of the study (56% of total variance). Conventional measurement error explained 31% of the total variance. Variation due to rater (< 0.1%) and measurement instrument (1.8%) was very low. Conclusions: The results suggest that the two instruments perform equally well. Choice of instrument can therefore be based on other criteria-simplicity and speed of application in the case of one, and detail in the information provided in the case of the other. There is little improvement in reliability to be gained from using more than one rater or more than one assessment of quality.

引用

页码：435 / 439

页数：5

共 29 条

[1]

Development and validation of a grading system for the quality of cost-effectiveness studies [J].