Reliability of two instruments for critical assessment of economic evaluations

被引:27
作者
Au, Flora [2 ]
Prahardhi, Shirlina [2 ]
Shiell, Alan [1 ]
机构
[1] Univ Calgary, Populat Hlth Intervent Res Ctr, Calgary, AB T2N 4N1, Canada
[2] Univ Calgary, Dept Community Hlth Sci, Ctr Hlth & Policy Studies, Calgary, AB, Canada
关键词
critical appraisal; economic evaluation; generalizability theory; reliability;
D O I
10.1111/j.1524-4733.2007.00255.x
中图分类号
F [经济];
学科分类号
02 [经济学];
摘要
Objective: To assess the reliability of two instruments designed for critical appraisal of economic evaluations: the Quality of Health Economic Studies (QHES) scale and the Pediatric Quality Appraisal Questionnaire (PQAQ). Methods: Thirty published articles were chosen at random from a recent bibliography of economic evaluations in health promotion. The quality of each of these studies was assessed independently by two raters using each of the two instruments. Inter-rater reliability and the agreement between the instruments were measured using an intraclass correlation coefficient (ICC). Cronbach's generalizability theory was also used to assess the sources of variation in quality scores of the studies and to indicate where improvements in reliability could best be made. Results: Inter-rater reliability was excellent for both instruments (ICC = 0.81 for the QHES and 0.80 for the PQAQ).Agreement between the instruments varied (ICC = 0.77 for rater 1 and 0.56 for rater 2). The biggest source of variation in the scores assigned to the articles was the quality of the study (56% of total variance). Conventional measurement error explained 31% of the total variance. Variation due to rater (< 0.1%) and measurement instrument (1.8%) was very low. Conclusions: The results suggest that the two instruments perform equally well. Choice of instrument can therefore be based on other criteria-simplicity and speed of application in the case of one, and detail in the information provided in the case of the other. There is little improvement in reliability to be gained from using more than one rater or more than one assessment of quality.
引用
收藏
页码:435 / 439
页数:5
相关论文
共 29 条
[1]
Development and validation of a grading system for the quality of cost-effectiveness studies [J].
Chiou, CF ;
Hay, JW ;
Wallace, JF ;
Bloom, BS ;
Neumann, PJ ;
Sullivan, SD ;
Yu, HT ;
Keeler, EB ;
Henning, JM ;
Ofman, JJ .
MEDICAL CARE, 2003, 41 (01) :32-44
[2]
THEORY OF GENERALIZABILITY - A LIBERALIZATION OF RELIABILITY THEORY [J].
CRONBACH, LJ ;
RAJARATNAM, N ;
GLESER, GC .
BRITISH JOURNAL OF STATISTICAL PSYCHOLOGY, 1963, 16 (02) :137-163
[3]
CURTISS FR, 2003, J MANAGE CARE PHARM, V9, P93
[4]
Guidelines for authors and peer reviewers of economic submissions to the BMJ [J].
Drummond, MF ;
Jefferson, TO .
BRITISH MEDICAL JOURNAL, 1996, 313 (7052) :275-283
[5]
Criteria list for assessment of methodological quality of economic evaluations: Consensus on Health Economic Criteria [J].
Evers, S ;
Goossens, M ;
de Vet, H ;
van Tulder, M ;
Ament, A .
INTERNATIONAL JOURNAL OF TECHNOLOGY ASSESSMENT IN HEALTH CARE, 2005, 21 (02) :240-245
[6]
A tool to improve qualify of reporting published economic analyses [J].
Gerard, K ;
Seymour, J ;
Smoker, I .
INTERNATIONAL JOURNAL OF TECHNOLOGY ASSESSMENT IN HEALTH CARE, 2000, 16 (01) :100-110
[7]
GENERALIZABILITY OF SCORES INFLUENCED BY MULTIPLE SOURCES OF VARIANCE [J].
GLESER, GC ;
CRONBACH, LJ ;
RAJARATNAM, N .
PSYCHOMETRIKA, 1965, 30 (04) :395-418
[8]
Developing a scoring system to guality assess economic evaluations [J].
Gonzalez-Perez J.G. .
The European Journal of Health Economics, 2002, 3 (2) :131-136
[9]
MEASUREMENT OF OBSERVER AGREEMENT FOR CATEGORICAL DATA [J].
LANDIS, JR ;
KOCH, GG .
BIOMETRICS, 1977, 33 (01) :159-174
[10]
STATISTICAL EVALUATION OF AGREEMENT BETWEEN 2 METHODS FOR MEASURING A QUANTITATIVE VARIABLE [J].
LEE, J ;
KOH, D ;
ONG, CN .
COMPUTERS IN BIOLOGY AND MEDICINE, 1989, 19 (01) :61-70