A Classification Algorithm for Network Traffic based on Improved Support Vector Machine

被引:14
作者
Ding, Lei [1 ]
Yu, Fei [2 ]
Peng, Sheng [1 ]
Xu, Chen [2 ,3 ]
机构
[1] Jishou Univ, Sch Informat Sci & Engn, Jishou 416000, Peoples R China
[2] Soochow Univ, Jiangsu Prov Key Lab Comp Informat Proc Technol, Suzhou 215000, Peoples R China
[3] Hunan Univ, Sch Informat Sci & Engn, Changsha 416000, Hunan, Peoples R China
关键词
improved SVM; probabilistic distributing area of a feature; contribution degree; Gustafson-Kessel clustering algorithm;
D O I
10.4304/jcp.8.4.1090-1096
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
An algorithm to classify the network traffic based on improved support vector machine (SVM) is presented in this paper. Each feature of the traditional support vector machine (SVM) algorithm has the same effect on classification rather than considering its practical effect. To improve the classification accuracy of SVM, the probabilistic distributing area of a feature in a kind of network traffic is obtained from the real network traffic. Then the overlapped degree of the feature's probabilistic distributing area between two different kinds of network traffic is calculated to obtain the feature's contribution degree, and the corresponding weight value of the feature is derived from its contribution degree. Thus each feature has different effect on the classification according to its weight value. Considering the feature's probabilistic distributing area is affected by the outliers or noises intensively, the data space is mapped to high dimension feature space, and the Gustafson-Kessel clustering algorithm is employed to deal with the outliers or noises existing in the input samples. The experimental results show that the method presented in this paper has a higher classification accuracy.
引用
收藏
页码:1090 / 1096
页数:7
相关论文
共 20 条
[1]  
Alshammari R, 2009, IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN CYBER SECURITY, P167
[2]  
Dusi M, 2009, IEEE ICC, P702
[3]   Support Vector Machines for TCP traffic classification [J].
Este, Alice ;
Gringoli, Francesco ;
Salgarelli, Luca .
COMPUTER NETWORKS, 2009, 53 (14) :2476-2490
[4]  
Gu Chengjie, 2011, Chinese Journal of Scientific Instrument, V32, P1507
[5]  
Guang Cheng, 2011, 2011 International Conference on Computer Science and Service System (CSSS), P914
[6]  
Internet assigned numbers authority (IANA), PORT NUMB
[7]  
Kumar Santosh, 2012, Data Engineering and Management. Second International Conference, ICDEM 2010. Revised Selected Papers, P80, DOI 10.1007/978-3-642-27872-3_12
[8]  
Moore A. W., 2005, Performance Evaluation Review, V33, P50, DOI 10.1145/1071690.1064220
[9]   A Survey of Techniques for Internet Traffic Classification using Machine Learning [J].
Nguyen, Thuy T. T. ;
Armitage, Grenville .
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2008, 10 (04) :56-76
[10]  
Ohm P, 2007, IMC'07: PROCEEDINGS OF THE 2007 ACM SIGCOMM INTERNET MEASUREMENT CONFERENCE, P141