Effects of incorrect computer-aided detection (CAD) output on human decision-making in mammography

被引:81
作者
Alberdi, E
Povyakalo, A
Strigini, L
Ayton, P
机构
[1] City Univ London, Ctr Software Reliabil, London EC1V 0HB, England
[2] City Univ London, Dept Psychol, London EC1V 0HB, England
基金
英国工程与自然科学研究理事会;
关键词
computer-aided detection (CAD); mammography; evaluation; reliability;
D O I
10.1016/j.acra.2004.05.012
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Rationale and Objectives. To investigate the effects of incorrect computer output on the reliability of the decisions of human users. This work followed an independent UK clinical trial that evaluated the impact of computer-aided detection (CAD) in breast screening. The aim was to use data from this trial to feed into probabilistic models (similar to those used in "reliability engineering") which would detect and assess possible ways of improving the human-CAD interaction. Some analyses required extra data; therefore, two supplementary studies were conducted. Study I was designed to elucidate the effects of computer failure on human performance. Study 2 was conducted to clarify unexpected findings from Study 1. Materials and Methods. In Study 1, 20 film readers viewed 60 sets of mammograms (30 of which contained cancer) and provided "recall/no recall" decisions for each case. Computer output for each case was available to the participants. The test set was designed to contain an unusually large proportion (50%) of cancers for which CAD had generated incorrect output. In Study 2, 19 different readers viewed the same set of cases in similar conditions except that computer output was not available. Results. The average sensitivity of readers in Study I (with CAD) was significantly lower than the average sensitivity of readers in Study 2 (without CAD). The difference was most marked for cancers for which CAD failed to provide correct prompting. Conclusion. Possible automation bias effects in CAD use deserve further study because they may degrade human decision-making for some categories of cases under certain conditions. This possibility should be taken into account in the assessment and design of CAD tools.
引用
收藏
页码:909 / 918
页数:10
相关论文
共 11 条
  • [1] Castellino RA, 2000, RADIOLOGY, V217, P400
  • [2] EGGLIN TK, 1996, JAMA-J AM MED ASSOC, V76, P1752
  • [3] HARTSWOOD M, 2000, J TOPICS HLTH INFORM, V20, P38
  • [4] Modeling software design diversity - A review
    Littlewood, B
    Popov, P
    Strigini, L
    [J]. ACM COMPUTING SURVEYS, 2001, 33 (02) : 177 - 208
  • [5] Effects of warning validity and proximity on responses to warnings
    Meyer, J
    [J]. HUMAN FACTORS, 2001, 43 (04) : 563 - 572
  • [6] Assessing mammographers' accuracy: A comparison of clinical and test performance
    Rutter, CM
    Taplin, S
    [J]. JOURNAL OF CLINICAL EPIDEMIOLOGY, 2000, 53 (05) : 443 - 450
  • [7] Accountability and automation bias
    Skitka, LJ
    Mosier, K
    Burdick, MD
    [J]. INTERNATIONAL JOURNAL OF HUMAN-COMPUTER STUDIES, 2000, 52 (04) : 701 - 717
  • [8] Does automation bias decision-making?
    Skitka, LJ
    Mosier, KL
    Burdick, M
    [J]. INTERNATIONAL JOURNAL OF HUMAN-COMPUTER STUDIES, 1999, 51 (05) : 991 - 1006
  • [9] Human-machine diversity in the use of computerised advisory systems: a case study
    Strigini, L
    Povyakalo, A
    Alberdi, E
    [J]. 2003 INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS, PROCEEDINGS, 2003, : 249 - 258
  • [10] An evaluation of the impact of computer-based prompts on screen readers' interpretation of mammograms
    Taylor, PM
    Champness, J
    Given-Wilson, RM
    Potts, HWW
    Johnston, K
    [J]. BRITISH JOURNAL OF RADIOLOGY, 2004, 77 (913) : 21 - 27