A data mining approach for diagnosis of coronary artery disease

被引:172
作者
Alizadehsani, Roohallah [1 ]
Habibi, Jafar [1 ]
Hosseini, Mohammad Javad [1 ]
Mashayekhi, Hoda [1 ]
Boghrati, Reihane [1 ]
Ghandeharioun, Asma [1 ]
Bahadorian, Behdad [2 ]
Sani, Zahra Alizadeh [2 ]
机构
[1] Sharif Univ Technol, Dept Comp Engn, Tehran, Iran
[2] Univ Tehran Med Sci, Rajaie Cardiovasc Med & Res Ctr, Tehran, Iran
关键词
Classification; Data mining; Coronary artery disease; SMO; Bagging; Neural Networks;
D O I
10.1016/j.cmpb.2013.03.004
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Cardiovascular diseases are very common and are one of the main reasons of death. Being among the major types of these diseases, correct and in-time diagnosis of coronary artery disease (CAD) is very important. Angiography is the most accurate CAD diagnosis method; however, it has many side effects and is costly. Existing studies have used several features in collecting data from patients, while applying different data mining algorithms to achieve methods with high accuracy and less side effects and costs. In this paper, a dataset called Z-Alizadeh Sani with 303 patients and 54 features, is introduced which utilizes several effective features. Also, a feature creation method is proposed to enrich the dataset. Then Information Gain and confidence were used to determine the effectiveness of features on CAD. Typical Chest Pain, Region RWMA2, and age were the most effective ones besides the created features by means of Information Gain. Moreover Q Wave and ST Elevation had the highest confidence. Using data mining methods and the feature creation algorithm, 94.08% accuracy is achieved, which is higher than the known approaches in the literature. (C) 2013 Elsevier Ireland Ltd. All rights reserved.
引用
收藏
页码:52 / 61
页数:10
相关论文
共 20 条
[1]  
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[2]  
[Anonymous], 2006, Introduction to Data Mining
[3]  
Ben-Hur A, 2010, METHODS MOL BIOL, V609, P223, DOI 10.1007/978-1-60327-241-4_13
[4]  
Bonow R., 2012, Braunwald's heart disease- a textbook of cardiovascular medicine
[5]  
Breiman L, 1996, MACH LEARN, V24, P123, DOI 10.1023/A:1018054314350
[6]  
Caruana R., 2006, ACM INT C P SER, P161, DOI [10.1145/1143844.1143865, DOI 10.1145/1143844.1143865]
[7]  
Chu ChiMing Chu ChiMing, 2009, Journal of Medical Sciences, V29, P187
[8]  
Itchhaporia D., 1995, J AM COLL CARDIOL, V25, P23
[9]   Assessment of the Risk Factors of Coronary Heart Events Based on Data Mining With Decision Trees [J].
Karaolis, Minas A. ;
Moutiris, Joseph A. ;
Hadjipanayi, Demetra ;
Pattichis, Constantinos S. .
IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE, 2010, 14 (03) :559-566
[10]   Recommendations for the standardization and interpretation of the electrocardiogram - Part I: The electrocardiogram and its technology - A scientific statement from the American Heart Association Electrocardiography and Arrhythmias Committee, Council on Clinical Cardiology; the American College of Cardiology Foundation; and the Heart Rhythm Society [J].
Kligfield, Paul ;
Gettes, Leonard S. ;
Bailey, James J. ;
Childers, Rory ;
Deal, Barbara J. ;
Hancock, E. William ;
van Herpen, Gerard ;
Kors, Jan A. ;
Macfarlane, Peter ;
Mirvis, David M. ;
Pahlm, Olle ;
Rautaharju, Pentti ;
Wagner, Galen S. .
JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2007, 49 (10) :1109-1127