Exploring networks with traceroute-like probes:: Theory and simulations

被引:66
作者
Dall'Asta, L
Alvarez-Hamelin, I
Barrata, A
Vázquez, A
Vespignani, A
机构
[1] Univ Paris 11, CNRS, UMR 8627, Phys Theor Lab, F-91405 Orsay, France
[2] Univ Notre Dame, Notre Dame, IN 46556 USA
[3] Indiana Univ, Sch Informat, Bloomington, IN 47408 USA
[4] Indiana Univ, Dept Phys, Bloomington, IN 47408 USA
关键词
traceroute; Internet exploration; topology inference;
D O I
10.1016/j.tcs.2005.12.009
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Mapping the Internet generally consists in sampling the network from a limited set of sources by using traceroute-like probes. This methodology, akin to the merging of different spanning trees to a set of destination, has been argued to introduce uncontrolled sampling biases that might produce statistical properties of the sampled graph which sharply differ from the original ones. In this paper, we explore these biases and provide a statistical analysis of their origin. We derive an analytical approximation for the probability of edge and vertex detection that exploits the role of the number of sources and targets and allows us to relate the global topological properties of the underlying network with the statistical accuracy of the sampled graph. In particular, we find that the edge and vertex detection probability depends on the betweenness centrality of each element. This allows us to show that shortest path routed sampling provides a better characterization of underlying graphs with broad distributions of connectivity. We complement the analytical discussion with a throughout numerical investigation of simulated mapping strategies in network models with different topologies. We show that sampled graphs provide a fair qualitative characterization of the statistical properties of the original networks in a fair range of different strategies and exploration parameters. Moreover, we characterize the level of redundancy and completeness of the exploration process as a function of the topological properties of the network. Finally, we study numerically how the fraction of vertices and edges discovered in the sampled graph depends on the particular deployements of probing sources. The results might hint the steps toward more efficient mapping strategies. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:6 / 24
页数:19
相关论文
共 27 条
[1]  
[Anonymous], 2005, STOC 05: Proceedings of the thirty-seventh annual ACM symposium on Theory of computing, DOI DOI 10.1145/1060590
[2]  
[Anonymous], CSETR43300 U MICH EE
[3]  
BALDI P, 2003, PROBABILISTIC METHOD
[4]   Emergence of scaling in random networks [J].
Barabási, AL ;
Albert, R .
SCIENCE, 1999, 286 (5439) :509-512
[5]   Betweenness centrality in large complex networks [J].
Barthélemy, M .
EUROPEAN PHYSICAL JOURNAL B, 2004, 38 (02) :163-168
[6]   A faster algorithm for betweenness centrality [J].
Brandes, U .
JOURNAL OF MATHEMATICAL SOCIOLOGY, 2001, 25 (02) :163-177
[7]  
BROIDO A, 2001, SAN DIEG P SPIE INT
[8]   Mapping the Internet [J].
Burch, H ;
Cheswick, B .
COMPUTER, 1999, 32 (04) :97-+
[9]   The fractal properties of Internet [J].
Caldarelli, G ;
Marchetti, R ;
Pietronero, L .
EUROPHYSICS LETTERS, 2000, 52 (04) :386-391
[10]  
CHEN Q, 2002, P IEEE INFOCOM 2002