基于大数据分析的移动互联网服务提供商流量挖掘与建模(英文)

被引:16
作者
刘军 [1 ]
李婷婷 [1 ]
CHENG Gang [2 ]
于华 [1 ]
雷振明 [1 ]
机构
[1] Beijing Key Laboratory of Network System Architecture and Convergence,Beijing University of Posts and Telecommunications
[2] Microsoft Corporation
关键词
mobile Internet; service provider; traffic measurement; MapReduce; time series clustering;
D O I
暂无
中图分类号
TP393.06 [];
学科分类号
081201 ; 1201 ;
摘要
Understanding the dynamic traffic and usage characteristics of data services in cellular networks is important for optimising network resources and improving user experience.Recent studies have illustrated traffic characteristics from specific perspectives,such as user behaviour,device type,and applications.In this paper,we present the results of our study from a different perspective,namely service providers,to reveal the traffic characteristics of cellular data networks.Our study is based on traffic data collected over a five-day period from a leading mobile operator's core network in China.We propose a Zipf-like model to characterise the distributions of the traffic volume,subscribers,and requests among service providers.Nine distinct diurnal traffic patterns of service providers are identified by formulating and solving a time series clustering problem.Our work differs from previous related works in that we perform measurements on a large quantity of data covering 2.2 billion traffic records,and we first explore the traffic patterns of thousands of service providers.Results of our study present mobile Internet participants with a better understanding of the traffic and usage characteristics of service providers,which play a critical role in the mobile Internet era.
引用
收藏
页码:25 / 36
页数:12
相关论文
共 5 条
[1]   基于Jaccard的移动终端自动识别并行算法及其MapReduce实现(英文) [J].
刘军 ;
李银周 ;
Felix Cuadrado ;
Steve Uhlig ;
雷振明 .
中国通信, 2013, 10 (07) :71-84
[2]  
MapReduce[J] . Jeffrey Dean,Sanjay Ghemawat.Communications of the ACM . 2008 (1)
[3]   Clustering of time series data - a survey [J].
Liao, TW .
PATTERN RECOGNITION, 2005, 38 (11) :1857-1874
[4]  
Estimating the number of clusters in a data set via the gap statistic[J] . RobertTibshirani,GuentherWalther,TrevorHastie.Journal of the Royal Statistical Society: Series B (Statistical Methodology) . 2002 (2)
[5]   AN EXAMINATION OF PROCEDURES FOR DETERMINING THE NUMBER OF CLUSTERS IN A DATA SET [J].
MILLIGAN, GW ;
COOPER, MC .
PSYCHOMETRIKA, 1985, 50 (02) :159-179