Intelligible Support Vector Machines for Diagnosis of Diabetes Mellitus

被引:181
作者
Barakat, Nahla H. [1 ]
Bradley, Andrew P. [2 ]
Barakat, Mohamed Nabil H. [3 ]
机构
[1] German Univ Technol Oman, Dept Appl Informat Technol, Muscat 130, Oman
[2] Univ Queensland, Sch Informat Technol & Elect Engn, Brisbane, Qld 4072, Australia
[3] Minist Hlth, Dept Noncommunicable Dis Surveillance & Control, Muscat 113, Oman
来源
IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE | 2010年 / 14卷 / 04期
关键词
Data mining; diabetes; machine learning; medical diagnosis; IMPAIRED GLUCOSE-TOLERANCE; METABOLIC SYNDROME; FEATURE-SELECTION; RULE EXTRACTION; RISK SCORE; CLASSIFICATION; ASSOCIATION; POPULATION; PREVALENCE; OMAN;
D O I
10.1109/TITB.2009.2039485
中图分类号
TP [自动化技术、计算机技术];
学科分类号
080201 [机械制造及其自动化];
摘要
Diabetes mellitus is a chronic disease and a major public health challenge worldwide. According to the International Diabetes Federation, there are currently 246 million diabetic people worldwide, and this number is expected to rise to 380 million by 2025. Furthermore, 3.8 million deaths are attributable to diabetes complications each year. It has been shown that 80% of type 2 diabetes complications can be prevented or delayed by early identification of people at risk. In this context, several data mining and machine learning methods have been used for the diagnosis, prognosis, and management of diabetes. In this paper, we propose utilizing support vector machines (SVMs) for the diagnosis of diabetes. In particular, we use an additional explanation module, which turns the "black box" model of an SVM into an intelligible representation of the SVM's diagnostic (classification) decision. Results on a real-life diabetes dataset show that intelligible SVMs provide a promising tool for the prediction of diabetes, where a comprehensible ruleset have been generated, with prediction accuracy of 94%, sensitivity of 93%, and specificity of 94%. Furthermore, the extracted rules are medically sound and agree with the outcome of relevant medical studies.
引用
收藏
页码:1114 / 1120
页数:7
相关论文
共 35 条
[1]
Aggoun Y, 2007, PEDIATR RES, V61, P653, DOI [10.1203/pdr.0b013e31805d8a8c, 10.1210/jc.2004-0372]
[2]
Diabetes risk score in Oman: A tool to identify prevalent type 2 diabetes among Arabs of the Middle East [J].
Al-Lawati, J. A. ;
Tuomilehto, J. .
DIABETES RESEARCH AND CLINICAL PRACTICE, 2007, 77 (03) :438-444
[3]
Fasting cut-points in determining prevalence of diabetes in an Arab population of the Middle East [J].
Al-Lawati, Jawad A. ;
Barakat, Mohammed N. .
DIABETES RESEARCH AND CLINICAL PRACTICE, 2007, 75 (02) :241-245
[4]
Metabolic syndrome - a new world-wide definition. A consensus statement from the international diabetes federation [J].
Alberti, KGMM ;
Zimmet, P ;
Shaw, J .
DIABETIC MEDICINE, 2006, 23 (05) :469-480
[5]
High prevalence of diabetes mellitus and impaired glucose tolerance in the Sultanate of Oman: Results of the 1991 national survey [J].
Asfour, MG ;
Lambourne, A ;
Soliman, A ;
AlBehlani, S ;
AlAsfoor, D ;
Bold, A ;
Mahtab, H ;
King, H .
DIABETIC MEDICINE, 1995, 12 (12) :1122-1125
[6]
BARAKAT MN, 2005, INT J DIABETES METAB, V13, P42
[7]
Barakat N., 2007, THESIS U QUEENSLAND
[8]
BARAKAT N, 18 INT C PATT REC IC
[9]
Barakat NH, 2007, IEEE T KNOWL DATA EN, V19, P729, DOI [10.1109/TKDE.2007.1023, 10.1109/TKDE.2007.1023.]
[10]
Intelligent analysis of clinical time series: an application in the diabetes mellitus domain [J].
Bellazzi, R ;
Larizza, C ;
Magni, P ;
Montani, S ;
Stefanelli, M .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2000, 20 (01) :37-57