On the Bias of Traceroute Sampling: or, Power-Law Degree Distributions in Regular Graphs

被引:41
作者
Achlioptas, Dimitris [1 ]
Clauset, Aaron [4 ,5 ]
Kempe, David [2 ]
Moore, Cristopher [3 ,4 ,5 ]
机构
[1] Univ Calif Santa Cruz, Dept Comp Sci, Santa Cruz, CA 95064 USA
[2] Univ So Calif, Dept Comp Sci, Los Angeles, CA 90089 USA
[3] 1 Univ New Mexico, Dept Comp Sci, Albuquerque, NM 87131 USA
[4] Univ New Mexico, Albuquerque, NM 87131 USA
[5] Santa Fe Inst, Santa Fe, NM USA
基金
欧洲研究理事会;
关键词
Measurement; Reliability; Theory; Internet topology; traceroute; sampling bias; INTERNET TOPOLOGY; NETWORKS; EXPLORATION; PRINCIPLES;
D O I
10.1145/1538902.1538905
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Understanding the graph structure of the Internet is a crucial step for building accurate network models and designing efficient algorithms for Internet applications. Yet, obtaining this graph structure can be a surprisingly difficult task, as edges cannot be explicitly queried. For instance, empirical studies of the network of Internet Protocol (IP) addresses typically rely on indirect methods like traceroute to build what are approximately single-source, all-destinations, shortest-path trees. These trees only sample a fraction of the network's edges, and a paper by Lakhina et al. [2003] found empirically that the resulting sample is intrinsically biased. Further, in simulations, they observed that the degree distribution under traceroute sampling exhibits a power law even when the underlying degree distribution is Poisson. In this article, we study the bias of traceroute sampling mathematically and, for a very general class of underlying degree distributions, explicitly calculate the distribution that will be observed. As example applications of our machinery, we prove that traceroute sampling finds power-law degree distributions in both delta-regular and Poisson-distributed random graphs. Thus, our work puts the observations of Lakhina et al. on a rigorous footing, and extends them to nearly arbitrary degree distributions.
引用
收藏
页数:28
相关论文
共 36 条
[1]   Understanding Internet topology: Principles, models, and validation [J].
Alderson, D ;
Li, L ;
Willinger, W ;
Doyle, JC .
IEEE-ACM TRANSACTIONS ON NETWORKING, 2005, 13 (06) :1205-1218
[2]  
AMINI L, 2002, SPIE IT COM
[3]  
[Anonymous], 1988, SIAM J. Discrete Math, DOI DOI 10.1137/0401033
[4]  
BARFORD P, 2001, SIGCOMM INT MEAS WOR
[5]   Distance distribution in random graphs and application to network exploration [J].
Blondel, Vincent D. ;
Guillaume, Jean-Loup ;
Hendrickx, Julien M. ;
Jungers, Raphael M. .
PHYSICAL REVIEW E, 2007, 76 (06)
[6]  
Bollobas, 2001, RANDOM GRAPHS, DOI 10.1017/CBO9780511814068
[7]  
CHEN Q, 2002, P 21 ACM SIGCOMM C
[8]   Accuracy and scaling phenomena in Internet mapping [J].
Clauset, A ;
Moore, C .
PHYSICAL REVIEW LETTERS, 2005, 94 (01)
[9]  
COHEN R, 2007, P EUR C COMPL SYST E
[10]   Exploring networks with traceroute-like probes:: Theory and simulations [J].
Dall'Asta, L ;
Alvarez-Hamelin, I ;
Barrata, A ;
Vázquez, A ;
Vespignani, A .
THEORETICAL COMPUTER SCIENCE, 2006, 355 (01) :6-24