Identification of randomized controlled trials in systematic reviews: accuracy and reliability of screening records

被引:222
作者
Edwards, P
Clarke, M
DiGuiseppi, C
Pratap, S
Roberts, I
Wentz, R
机构
[1] London Sch Hyg & Trop Med, Dept Epidemiol & Populat Hlth, Publ Hlth Intervent Res Unit, London WC1B 3DP, England
[2] UK Cochrane Ctr, NHS R&d Programme, Oxford OX2 7LG, England
[3] Univ Colorado, Hlth Sci Ctr, Dept Prevent Med & Biometr, Denver, CO 80262 USA
[4] London Sch Hyg & Trop Med, Dept Epidemiol & Populat Hlth, Publ Hlth Intervent Res Unit, Cochrane Injuries Grp, London WC1B 3DP, England
关键词
systematic reviews; screening; inter-observer reliability; ascertainment intersection;
D O I
10.1002/sim.1190
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
A Study was conducted to estimate the accuracy and reliability of reviewers when screening records for relevant trials for a systematic review. A sensitive search of ten electronic bibliographic databases yielded 22571 records of potentially relevant trials. Records were allocated to four reviewers such that two reviewers examined each record and so that identification of trials by each reviewer could be compared with those identified by each of the other reviewers. Agreement between reviewers was assessed using Cohen's kappa statistic, Ascertainment intersection methods were used to estimate the likely number of trials missed by reviewers. Full copies of reports were obtained and assessed independently by two researchers for eligibility for the review. Eligible reports formed the 'gold standard' against which an assessment was made about the accuracy of screening by reviewers. After screening, 301 of 22571 records were identified by at least one reviewer as potentially relevant. Agreement was 'almost perfect' (kappa > 0.8) within two pairs, 'substantial' (kappa > 0.6) within three pairs and 'moderate' (kappa > 0.4) within one pair. Of the 301 records selected, 273 complete reports were available. When pairs of reviewers agreed on the potential relevance of records, 81 per cent were eligible (range 69 to 91 per cent). If reviewers disagreed, 22 per cent were eligible (range 12 to 45 per cent). Single reviewers missed on average 8 per cent of eligible reports (range 0 to 24 per cent), whereas pairs of reviewers did not miss any (range 0 to I per cent), The use of two reviewers to screen records increased the number of randomized trials identified by an average of 9 per cent (range 0 to 32 per cent). Reviewers can reliably identify potentially relevant records when screening thousands of records for eligibility. Two reviewers should screen records for eligibility, whenever possible, in order to maximize ascertainment of relevant trials. Copyright (C) 2002 John Wiley Sons, Ltd.
引用
收藏
页码:1635 / 1640
页数:6
相关论文
共 12 条
[1]  
CASTRO AA, 1998, 6 ANN COCHR C ABSTR
[2]   SYSTEMATIC REVIEWS - OBTAINING DATA FROM RANDOMIZED CONTROLLED TRIALS - HOW MUCH DO WE NEED FOR RELIABLE AND INFORMATIVE METAANALYSES [J].
CLARKE, MJ ;
STEWART, LA .
BRITISH MEDICAL JOURNAL, 1994, 309 (6960) :1007-1010
[3]   A COEFFICIENT OF AGREEMENT FOR NOMINAL SCALES [J].
COHEN, J .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1960, 20 (01) :37-46
[4]  
EDWARDS P, 2000, COCHRANE LIB
[5]  
FLEISS JL, 1981, STAT METHODS RATES P, P220
[6]   EFFECT OF VARIATION IN PROBABILITY OF ASCERTAINMENT BY SOURCES (VARIABLE CATCHABILITY) UPON CAPTURE-RECAPTURE ESTIMATES OF PREVALENCE [J].
HOOK, EB ;
REGAL, RR .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 1993, 137 (10) :1148-1166
[7]   THE VALUE OF CAPTURE-RECAPTURE METHODS EVEN FOR APPARENT EXHAUSTIVE SURVEYS - THE NEED FOR ADJUSTMENT FOR SOURCE OF ASCERTAINMENT INTERSECTION IN ATTEMPTED COMPLETE PREVALENCE STUDIES [J].
HOOK, EB ;
REGAL, RR .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 1992, 135 (09) :1060-1067
[8]   MEASUREMENT OF OBSERVER AGREEMENT FOR CATEGORICAL DATA [J].
LANDIS, JR ;
KOCH, GG .
BIOMETRICS, 1977, 33 (01) :159-174
[9]   MISINTERPRETATION AND MISUSE OF THE KAPPA-STATISTIC [J].
MACLURE, M ;
WILLETT, WC .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 1987, 126 (02) :161-169
[10]  
REGAL RR, 1984, STAT MED, V3, P288