Computer-aided diagnosis with potential application to rapid detection of disease outbreaks

被引:4
作者
Burr, Tom
Koster, Frederick
Picard, Rick
Forslund, Dave
Wokoun, Doug
Joyce, Ed
Brillman, Judith
Froman, Phil
Lee, Jack
机构
[1] Los Alamos Natl Lab, Los Alamos, NM 87545 USA
[2] Univ New Mexico, Dept Emergency Med, Albuquerque, NM 87131 USA
[3] Albuquerque Ambulance Serv, Albuquerque, NM USA
关键词
Bayes classifier; iterative proportional fitting; biosurveillance; misdiagnosis rates;
D O I
10.1002/sim.2798
中图分类号
Q [生物科学];
学科分类号
07 [理学]; 0710 [生物学]; 09 [农学];
摘要
Our objectives are to quickly interpret symptoms of emergency patients to identify likely syndromes and to improve population-wide disease outbreak detection. We constructed a database A 248 syndromes, each syndrome having an estimated probability of producing any of 85 symptoms, with some two-way, three-way, and five-way probabilities reflecting correlations among symptoms. Using these multiway probabilities in conjunction with an iterative proportional fitting algorithm allows; estimation of full conditional probabilities. Combining these conditional probabilities with misdiagnosis error rates and incidence rates via Bayes theorem, the probability of each syndrome is estimated. We tested a prototype of computer-aided differential diagnosis (CADDY) on simulated data and on more than 100 real cases, including West Nile Virus, Q fever, SARS, anthrax, plague, tularaemia and toxic shock cases. We conclude that: (1) it is important to determine whether the unrecorded positive status of a symptom means that the status is negative or that the status is unknown; (2) inclusion of misdiagnosis error rates produces more realistic results; (3) the naive Bayes classifier, which assumes all symptoms behave independently, is slightly outperformed by CADDY, which includes available multi-symptom information on correlations; as more information regarding symptom correlations becomes available, the advantage of CADDY over the naive Bayes classifier should increase; (4) overlooking low-probability, high-con sequence events is less likely if the standard output summary is augmented with a list of rare syndromes that are consistent with observed symptoms, and (5) accumulating patient-level probabilities across a larger population can aid in biosurveillance for disease outbreaks. Copyright (c) 2007 John Wiley & Sons, Ltd.
引用
收藏
页码:1857 / 1874
页数:18
相关论文
共 37 条
[1]
Agresti A., 1993, CATEGORICAL DATA ANA
[2]
Bayes T., 1763, Philosophical Transactions, V53, P370, DOI [10.1098/rstl.1763.0053, DOI 10.1098/RSTL.1763.0053]
[3]
GIDEON: A computer program for diagnosis, simulation, and informatics in geographic medicine [J].
Berger, SA .
INFECTIOUS DISEASES IN CLINICAL PRACTICE, 1998, 7 (08) :383-386
[4]
PERFORMANCE OF 4 COMPUTER-BASED DIAGNOSTIC SYSTEMS [J].
BERNER, ES ;
WEBSTER, GD ;
SHUGERMAN, AA ;
JACKSON, JR ;
ALGINA, J ;
BAKER, AL ;
BALL, EV ;
COBBS, CG ;
DENNIS, VW ;
FRENKEL, EP ;
HUDSON, LD ;
MANCALL, EL ;
RACKLEY, CE ;
TAUNTON, D .
NEW ENGLAND JOURNAL OF MEDICINE, 1994, 330 (25) :1792-1796
[5]
Bishop M.M., 1975, DISCRETE MULTIVARIAT
[6]
Primary care in the United States - Innovations in primary care in the United States [J].
Bodenheimer, T .
BMJ-BRITISH MEDICAL JOURNAL, 2003, 326 (7393) :796-798
[7]
2 CASES OF FATAL INHALATION ANTHRAX, ONE ASSOCIATED WITH SARCOIDOSIS [J].
BRACHMAN, PS ;
ALBRINK, WS ;
PAGANO, JS .
NEW ENGLAND JOURNAL OF MEDICINE, 1961, 265 (05) :203-&
[8]
Systematic review: Surveillance systems for early detection of bioterrorism-related diseases [J].
Bravata, DM ;
McDonald, KM ;
Smith, WM ;
Rydzak, C ;
Szeto, H ;
Buckeridge, DL ;
Haberland, C ;
Owens, DK .
ANNALS OF INTERNAL MEDICINE, 2004, 140 (11) :910-922
[9]
BRILLMAN JC, 2005, MED INFORM DECISION, V5, P4, DOI DOI 10.1186/1472-6947-5-4
[10]
COWDERY JS, 1947, ARCH PATHOL, V43, P396