Mix-ratio sampling: Classifying multiclass imbalanced mouse brain images using support vector machine

被引:15
作者
Bae, Min Hyeok [1 ]
Wu, Teresa [1 ]
Pan, Rong [1 ]
机构
[1] Arizona State Univ, Dept Ind Syst & Operat Engn, Tempe, AZ 85287 USA
关键词
Sampling procedure; Imbalanced dataset; Multiclass classification; Support vector machine; Data mining; Brain image segmentation; AUTOMATED SEGMENTATION; NEUROANATOMICAL STRUCTURES; CLASSIFICATION; MRI;
D O I
10.1016/j.eswa.2009.12.018
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Support Vector Machine (SVM) is a classifier designed to achieve optimized classification accuracy. It has been applied to numerous applications associated with images. Yet challenges remain when applying SVM on segmenting mouse brain images. This is due to the fact that each high-resolution mouse brain image is a very large data set and it is a multiclass classification problem with extremely imbalanced data size for different classes. To address these issues, a mix-ratio sampling approach for SVM is proposed which determines various over-sampling ratios for different minority classes. In addition, to improve the imaging classification accuracy, spatial information is incorporated into the classification problem. Five mouse Magnetic Resonance Microscopy (MRM) images are collected to test the accuracy of classifying 21 brain structures. The first comparison experiment demonstrates the SVM with mix-ratio sampling method relieves the imbalance problem for multiclass more effectively and efficiently than the SVM with simple over-sampling method. In the second comparison experiment, another classifier, Artificial Neural Network (ANN) is used to compare against SVM based on the same mix-ratio sampled data and the results indicate that SVM shows better classification performance than ANN. Thirdly, the cross validation is conducted to demonstrate SVM with mix-ration sampling can classify multiclass imbalanced data with high accuracy. Published by Elsevier Ltd.
引用
收藏
页码:4955 / 4965
页数:11
相关论文
共 27 条
[1]  
Abe S., 2005, Support vector machines for pattern classification, V2
[2]   Applying support vector machines to imbalanced datasets [J].
Akbani, R ;
Kwek, S ;
Japkowicz, N .
MACHINE LEARNING: ECML 2004, PROCEEDINGS, 2004, 3201 :39-50
[3]   Automated segmentation of neuroanatomical structures in multispectral MR microscopy of the mouse brain [J].
Ali, AA ;
Dale, AM ;
Badea, A ;
Johnson, GA .
NEUROIMAGE, 2005, 27 (02) :425-435
[4]  
[Anonymous], 1999, Proceedings of the International Joint Conference on Artificial Intelligence
[5]   A voxel-based morphometry study of temporal lobe gray matter reductions in Alzheimer's disease [J].
Busatto, GF ;
Garrido, GEJ ;
Almeida, OP ;
Castro, CC ;
Camargo, CHP ;
Cid, CG ;
Buchpiguel, CA ;
Furuie, S ;
Bottino, CM .
NEUROBIOLOGY OF AGING, 2003, 24 (02) :221-231
[6]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[7]   Classification of magnetic resonance brain images using wavelets as input to support vector machine and neural network [J].
Chaplot, Sandeep ;
Patnaik, L. M. ;
Jagannathan, N. R. .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2006, 1 (01) :86-92
[8]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[9]  
EVANS AC, 1994, NATO ADV SCI INST SE, V264, P263
[10]   Whole brain segmentation: Automated labeling of neuroanatomical structures in the human brain [J].
Fischl, B ;
Salat, DH ;
Busa, E ;
Albert, M ;
Dieterich, M ;
Haselgrove, C ;
van der Kouwe, A ;
Killiany, R ;
Kennedy, D ;
Klaveness, S ;
Montillo, A ;
Makris, N ;
Rosen, B ;
Dale, AM .
NEURON, 2002, 33 (03) :341-355