A MapReduce-based distributed SVM algorithm for automatic image annotation

被引:57
作者
Alham, Nasullah Khalid [1 ]
Li, Maozhen [1 ]
Liu, Yang [1 ]
Hammoud, Suhel [1 ]
机构
[1] Brunel Univ, Sch Engn & Design, Uxbridge UB8 3PH, Middx, England
关键词
Automatic image annotation; MapReduce framework; Sequential minimal optimization; Support vector machine;
D O I
10.1016/j.camwa.2011.07.046
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Machine learning techniques have facilitated image retrieval by automatically classifying and annotating images with keywords. Among them Support Vector Machines (SVMs) have been used extensively due to their generalization properties. However, SVM training is notably a computationally intensive process especially when the training dataset is large. This paper presents MRSMO, a MapReduce based distributed SVM algorithm for automatic image annotation. The performance of the MRSMO algorithm is evaluated in an experimental environment. By partitioning the training dataset into smaller subsets and optimizing the partitioned subsets across a cluster of computers, the MRSMO algorithm reduces the training time significantly while maintaining a high level of accuracy in both binary and multiclass classifications. (C) 2011 Elsevier Ltd. All rights reserved.
引用
收藏
页码:2801 / 2811
页数:11
相关论文
共 39 条
[1]  
Abe S., 2005, Support vector machines for pattern classification, V2
[2]  
[Anonymous], 3 DAT MIN OP SOURC M
[3]  
[Anonymous], 2008, P 25 INT C MACHINE L, DOI DOI 10.1145/1390156.1390170
[4]  
[Anonymous], PARALLEL SMO TRAININ
[5]  
[Anonymous], P ACM MULT WORKSH MU
[6]  
[Anonymous], OP SOURC JAV CONT BA
[7]  
[Anonymous], PROGRAMMING HADOOP
[8]  
[Anonymous], BMC BIOINFORMA UNPUB
[9]  
[Anonymous], PARALLEL MIXTURE SVM
[10]  
[Anonymous], COR IM DAT