Testing the Newcastle Ottawa Scale showed low reliability between individual reviewers

被引:319
作者
Hartling, Lisa [1 ,2 ]
Milne, Andrea [1 ,2 ]
Hamm, Michele P. [1 ,2 ]
Vandermeer, Ben [1 ,2 ]
Ansari, Mohammed [3 ]
Tsertsvadze, Alexander [4 ,5 ]
Dryden, Donna M. [1 ,2 ]
机构
[1] Univ Alberta, Alberta Res Ctr Hlth Evidence, Dept Pediat, Edmonton, AB T5G 1C9, Canada
[2] Univ Alberta, Evidence Based Practice Ctr, Edmonton, AB T5G 1C9, Canada
[3] Univ Ottawa, Clin Epidemiol Program, Evidence Based Practice Ctr, Ottawa Methods Ctr,Ottawa Hosp Res Inst, Ottawa, ON, Canada
[4] Univ Ottawa, Evidence Based Practice Ctr, Ottawa, ON, Canada
[5] Ottawa Hosp Res Inst, Ctr Practice Changing Res, Ottawa, ON, Canada
关键词
Methodological quality; Internal validity; Reliability; Validity; Systematic reviews; Cohort studies; RANDOMIZED CONTROLLED-TRIALS; LOW-BIRTH-WEIGHT; PRETERM BIRTH; QUALITY; RISK; METAANALYSIS;
D O I
10.1016/j.jclinepi.2013.03.003
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Objectives: To assess inter-rater reliability and validity of the Newcastle Ottawa Scale (NOS) used for methodological quality assessment of cohort studies included in systematic reviews. Study Design and Setting: Two reviewers independently applied the NOS to 131 cohort studies included in eight meta-analyses. Inter-rater reliability was calculated using kappa (kappa) statistics. To assess validity, within each meta-analysis, we generated a ratio of pooled estimates for each quality domain. Using a random-effects model, the ratios of odds ratios for each meta-analysis were combined to give an overall estimate of differences in effect estimates. Results: Inter-rater reliability varied from substantial for length of follow-up (kappa = 0.68, 95% confidence interval [CI] = 0.47, 0.89) to poor for selection of the nonexposed cohort and demonstration that the outcome was not present at the outset of the study (kappa = -0.03, 95% CI = -0.06, 0.00; kappa = -0.06, 95% CI = -0.20, 0.07). Reliability for overall score was fair (kappa = 0.29, 95% CI = 0.10, 0.47). In general, reviewers found the tool difficult to use and the decision rules vague even with additional information provided as part of this study. We found no association between individual items or overall score and effect estimates. Conclusion: Variable agreement and lack of evidence that the NOS can identify studies with biased results underscore the need for revisions and more detailed guidance for systematic reviewers using the NOS. (C) 2013 Elsevier Inc. All rights reserved.
引用
收藏
页码:982 / 993
页数:12
相关论文
共 31 条
[1]  
Agresti A, 2013, Categorical data analysis, V3rd
[2]   A review and meta-analysis of prospective studies of red and processed meat intake and prostate cancer [J].
Alexander, Dominik D. ;
Mink, Pamela J. ;
Cushing, Colleen A. ;
Sceurman, Bonnie .
NUTRITION JOURNAL, 2010, 9
[3]  
[Anonymous], J CRIT CARE
[4]  
[Anonymous], DEV TOOL EVALUATE QU
[5]  
[Anonymous], 2017, The Newcastle-Ottawa Scale (NOS) for assessing the quality of nonrandomised studies in meta-analyses
[6]   Statins, incident Alzheimer disease, change in cognitive function, and neuropathology [J].
Arvanitakis, Z. ;
Schneider, J. A. ;
Wilson, R. S. ;
Bienias, J. L. ;
Kelly, J. F. ;
Evans, D. A. ;
Bennett, D. A. .
NEUROLOGY, 2008, 70 (19) :1795-1802
[7]   Seventy-Five Trials and Eleven Systematic Reviews a Day: How Will We Ever Keep Up? [J].
Bastian, Hilda ;
Glasziou, Paul ;
Chalmers, Iain .
PLOS MEDICINE, 2010, 7 (09)
[8]   The feasibility of creating a checklist for the assessment of the methodological quality both of randomised and non-randomised studies of health care interventions [J].
Downs, SH ;
Black, N .
JOURNAL OF EPIDEMIOLOGY AND COMMUNITY HEALTH, 1998, 52 (06) :377-384
[9]  
Egger M, 2003, Health Technol Assess, V7, P1
[10]   Association between unreported outcomes and effect size estimates in Cochrane meta-analyses [J].
Furukawa, Toshi A. ;
Watanabe, Norio ;
Omori, Ichiro M. ;
Montori, Victor M. ;
Guyatt, Gordon H. .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2007, 297 (05) :468-470