ProFID: Practical frequent items discovery in peer-to-peer networks

被引:7
作者
Cem, Emrah [1 ]
Ozkasap, Oznur [2 ]
机构
[1] Univ Texas Dallas, Dept Comp Sci, Dallas, TX 75230 USA
[2] Koc Univ, Dept Comp Engn, Istanbul, Turkey
来源
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE | 2013年 / 29卷 / 06期
关键词
Peer-to-peer; Distributed computing; Gossip-based (epidemic) protocols; Frequent items; AGGREGATION; COMPUTATION; ALGORITHMS; ELEMENTS; STREAMS;
D O I
10.1016/j.future.2012.10.002
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We address the problem of discovering frequent items in unstructured P2P networks which is relevant for several distributed services such as cache management, data replication, query refinement, topology optimization and security. This study makes the following contributions to the current state of the art. First, we propose and develop a fully distributed Protocol for Frequent Items Discovery (ProFID) where the result is produced at every peer. ProFID uses gossip-based (epidemic) communication, a novel pairwise averaging function and system size estimation together to discover frequent items in an unstructured P2P network. We also propose a practical rule for convergence of the algorithm. In contrast to the previous works, each peer gives a local decision for convergence based on the change of updated local state. We developed a model of ProFID in PeerSim and performed various experiments to compare and evaluate its efficiency, scalability, and applicability. The protocol's resilience under realistic churn models was studied. For evaluating the effect of network dynamics, we deployed our protocol on the Internet-scale real network PlanetLab. We also compared the accuracy and scalability of ProFID with the adaptive Push-Sum algorithm. Our results confirm the practical nature, ease of deployment and efficiency of our approach, and also show that it outperforms adaptive Push-Sum in terms of accuracy, convergence speed and message overhead. (C) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:1544 / 1560
页数:17
相关论文
共 42 条
[1]   Network sampling and classification: An investigation of network model representations [J].
Airoldi, Edoardo M. ;
Bai, Xue ;
Carley, Kathleen M. .
DECISION SUPPORT SYSTEMS, 2011, 51 (03) :506-518
[2]  
[Anonymous], 2006, Proc. of ACM Symposium on Principles of Distributed Computing
[3]  
[Anonymous], 2006, KDD
[4]   Emergence of scaling in random networks [J].
Barabási, AL ;
Albert, R .
SCIENCE, 1999, 286 (5439) :509-512
[5]   Handling dynamics in diffusive aggregation schemes: An evaporative approach [J].
Bicocchi, Nicola ;
Mamei, Marco ;
Zambonelli, Franco .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2010, 26 (06) :877-889
[6]  
Birman Ken, 2007, Operating Systems Review, V41, P8, DOI 10.1145/1317379.1317382
[7]  
Boyd S, 2005, IEEE INFOCOM SER, P1653
[8]  
Cem E., 2010, P ISCIS 25 INT S COM, P199
[9]   Robust computation of aggregates in wireless sensor networks: Distributed randomized algorithms and analysis [J].
School of Electrical and Computer Engineering, Purdue University, Box 165, West Lafayette, IN 47907, United States ;
不详 .
IEEE Trans Parallel Distrib Syst, 2006, 9 (987-1000) :987-1000
[10]   Aggregation methods for large-scale sensor networks [J].
Chitnis, Laukik ;
Dobra, Alin ;
Ranka, Sanjay .
ACM TRANSACTIONS ON SENSOR NETWORKS, 2008, 4 (02)