Discussion between reviewers does not improve reliability of peer review of hospital quality

被引:71
作者
Hofer, TP
Bernstein, SJ
DeMonner, S
Hayward, RA
机构
[1] Vet Affairs Ctr Practice Management & Outcomes Re, Ann Arbor, MI USA
[2] Univ Michigan, Sch Med, Dept Internal Med, Ann Arbor, MI USA
关键词
peer review; hospital quality; quality assessment;
D O I
10.1097/00005650-200002000-00005
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
OBJECTIVES. Peer review is used to malts final judgments about qualify of care in many quality assurance activities. To overcome the low reliability of peer review, discussion between several reviewers is often recommended to point out overlooked information or allow for reconsideration of opinions and thus improve reliability. The authors assessed. the impact of discussion between 2 reviewers on the reliability of peer review. METHODS. A group of 13 hoard-certified physicians completed a total of 741 structured implicit record reviews of 95 records for patients who experienced severe adverse events related to laboratory abnormalities while in the hospital hypokalemia (hyperkalemia, renal failure, hyponatremia, and digoxin toxicity). They independently assessed the degree to which each adverse event was caused by medical care and the quality of the care leading up to the adverse event. Working in pairs, they then discussed differences of opinion, clarified factual discrepancies, and rerated the record. The authors compared the reliability of each measure before and after discussion, and between and within pairs of reviewers, using the intraclass correlation coefficient for continuous ratings and the kappa statistic for a dichotomized rating, RESULTS. The assessment of whether the laboratory abnormality was iatrogenic had a reliability of 0.46 before discussion and 0.71 after discussion between paired reviewers, indicating considerably improved agreement between the members of a pair. However, across reviewer pairs, the reviewer reliability was 0.36 before discussion and 0.40 after discussion. Similarly, for the rating of overall quality of care, reliability of physician review went from 0.35 before discussion to 0.58 after discussion as assessed by pair. However, across pairs the reliability increased only from 0.14 to 0.17. Even for prediscussion ratings, reliability was substantially higher between 2 members of a pair than across pairs, suggesting that reviewers who work in pairs learn to be more consistent with each other even before discussion, but this consistency also did not improve overall reliability across pairs. CONCLUSIONS. When 2 physicians discuss a record that they are reviewing, it substantially improves the agreement between those 2 physicians. However, this improvement is illusory, as discussion does not improve the overall reliability as assessed by examining the reliability between physicians who were part of different discussions, This finding may also have implications. with regard to how disagreements are resolved on consensus panels, guideline committees, and reviews of literature quality for meta-analyses.
引用
收藏
页码:152 / 161
页数:10
相关论文
共 38 条
[1]  
[Anonymous], 1979, SAGE U PAPER SERIES
[2]   Carotid endarterectomy for asymptomatic carotid stenosis: a meta-analysis [J].
Benavente, O ;
Moher, D ;
Pham, B .
BMJ-BRITISH MEDICAL JOURNAL, 1998, 317 (7171) :1477-1480
[3]   THE APPROPRIATENESS OF USE OF CARDIOVASCULAR PROCEDURES IN WOMEN AND MEN [J].
BERNSTEIN, SJ ;
HILBORNE, LH ;
LEAPE, LL ;
PARK, RE ;
BROOK, RH .
ARCHIVES OF INTERNAL MEDICINE, 1994, 154 (23) :2759-2765
[4]   THE APPROPRIATENESS OF HYSTERECTOMY - A COMPARISON OF CARE IN 7 HEALTH PLANS [J].
BERNSTEIN, SJ ;
MCGLYNN, EA ;
SIU, AL ;
ROTH, CP ;
SHERWOOD, MJ ;
KEESEY, JW ;
KOSECOFF, J ;
HICKS, NR ;
BROOK, RH .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 1993, 269 (18) :2398-2402
[5]   ESTIMATING THE RELIABILITY OF CONTINUOUS MEASURES WITH CRONBACH ALPHA OR THE INTRACLASS CORRELATION-COEFFICIENT - TOWARD THE INTEGRATION OF 2 TRADITIONS [J].
BRAVO, G ;
POTVIN, L .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 1991, 44 (4-5) :381-390
[6]   INCIDENCE OF ADVERSE EVENTS AND NEGLIGENCE IN HOSPITALIZED-PATIENTS - RESULTS OF THE HARVARD MEDICAL-PRACTICE STUDY-I [J].
BRENNAN, TA ;
LEAPE, LL ;
LAIRD, NM ;
HEBERT, L ;
LOCALIO, AR ;
LAWTHERS, AG ;
NEWHOUSE, JP ;
WEILER, PC ;
HIATT, HH .
NEW ENGLAND JOURNAL OF MEDICINE, 1991, 324 (06) :370-376
[7]   RELIABILITY AND VALIDITY OF JUDGMENTS CONCERNING ADVERSE EVENTS SUFFERED BY HOSPITALIZED-PATIENTS [J].
BRENNAN, TA ;
LOCALIO, RJ ;
LAIRD, NL .
MEDICAL CARE, 1989, 27 (12) :1148-1158
[8]   INTERNAL AUDIT IN THE DEPARTMENT OF MEDICINE OF A COMMUNITY HOSPITAL 2 YEARS EXPERIENCE [J].
BUTLER, JJ ;
QUINLAN, JW .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 1958, 167 (05) :567-572
[9]   EFFECT OF OUTCOME ON PHYSICIAN JUDGMENTS OF APPROPRIATENESS OF CARE [J].
CAPLAN, RA ;
POSNER, KL ;
CHENEY, FW .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 1991, 265 (15) :1957-1960
[10]   Benzodiazepine use in pregnancy and major malformations or oral cleft: meta-analysis of cohort and case-control studies [J].
Dolovich, LR ;
Addis, A ;
Vaillancourt, JMR ;
Power, JDB ;
Koren, G ;
Einarson, TR .
BMJ-BRITISH MEDICAL JOURNAL, 1998, 317 (7162) :839-+