Variations in measured performance of CAD schemes due to database composition and scoring protocol

被引:29
作者
Nishikawa, RM [1 ]
Yarusso, LM [1 ]
机构
[1] Univ Chicago, Dept Radiol, Kurt Rossmann Labs Radiol Image Res, Chicago, IL 60637 USA
来源
MEDICAL IMAGING 1998: IMAGE PROCESSING, PTS 1 AND 2 | 1998年 / 3338卷
关键词
computer-aided diagnosis; performance; database; scoring; digital mammography;
D O I
10.1117/12.310894
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
There is now a large effort towards developing computer-aided diagnosis (CAD) techniques. It is important to be able to compare performance of different approaches to be able to determine which ones are the most efficacious. There are currently a number of barriers preventing meaningful (statistical) comparisons, two of which are discussed in this paper: database composition and scoring protocol. We have examined how the choice of cases used to test a CAD scheme can affect its performance. We found that our computer scheme varied between a sensitivity of 100% to 77%, at a false-positive rate of 1.0 per image, with only 10% change in the composition of the database. To evaluate the performance of a CAD scheme the output of the computer must be graded. There are a number of different criteria that are being used by different investigators. We have found that for the same set of detection results, the measured sensitivity can be betwen 40-90% depending on the scoring methodology. Clearly consensus must be reached on these two issues in order for the field to make rapid progress. As it stands now, it is not possible to make meaningful comparisons of different techniques.
引用
收藏
页码:840 / 844
页数:5
相关论文
empty
未找到相关数据