Variations in measured performance of CAD schemes due to database composition and scoring protocol

被引：29

作者：

Nishikawa, RM ^{[1
]}

Yarusso, LM ^{[1
]}

机构：

[1] Univ Chicago, Dept Radiol, Kurt Rossmann Labs Radiol Image Res, Chicago, IL 60637 USA

来源：

MEDICAL IMAGING 1998: IMAGE PROCESSING, PTS 1 AND 2 | 1998年 / 3338卷

关键词：

computer-aided diagnosis; performance; database; scoring; digital mammography;

D O I：

10.1117/12.310894

中图分类号：

R318 [生物医学工程];

学科分类号：

0831 ;

摘要：

There is now a large effort towards developing computer-aided diagnosis (CAD) techniques. It is important to be able to compare performance of different approaches to be able to determine which ones are the most efficacious. There are currently a number of barriers preventing meaningful (statistical) comparisons, two of which are discussed in this paper: database composition and scoring protocol. We have examined how the choice of cases used to test a CAD scheme can affect its performance. We found that our computer scheme varied between a sensitivity of 100% to 77%, at a false-positive rate of 1.0 per image, with only 10% change in the composition of the database. To evaluate the performance of a CAD scheme the output of the computer must be graded. There are a number of different criteria that are being used by different investigators. We have found that for the same set of detection results, the measured sensitivity can be betwen 40-90% depending on the scoring methodology. Clearly consensus must be reached on these two issues in order for the field to make rapid progress. As it stands now, it is not possible to make meaningful comparisons of different techniques.

引用

页码：840 / 844

页数：5