Stochastic models for the web graph

被引:296
作者
Kumar, R [1 ]
Raghavan, P [1 ]
Rajagopalan, S [1 ]
Sivakumar, D [1 ]
Tomkins, A [1 ]
Upfal, E [1 ]
机构
[1] IBM Corp, Almaden Res Ctr, San Jose, CA 95120 USA
来源
41ST ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, PROCEEDINGS | 2000年
关键词
D O I
10.1109/SFCS.2000.892065
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The web may be viewed as a directed graph each of whose vertices is a static HTML web page, and each of whose edges corresponds to a hyperlink from one web page to another In this paper we propose and analyze random graph models inspired by a series of empirical observations on the web. Our graph models differ from the traditional G(n,p) models in two ways: 1. Independently chosen edges do not result in the statistics (degree distributions, clique multitudes) observed on the web. Thus, edges in our model are statistically dependent on each other 2. Our model introduces new vertices in the graph as time evolves. This captures the fact that the web is changing with time Our results are two fold: we show that graphs generated using our model exhibit the statistics observed on the web graph, and additionally, that natural graph models proposed earlier do not exhibit them. This remains true even when these earlier models are generalized to account for the arrival of vertices over time. In particular; the sparse random graphs in our models exhibit properties that do not arise in far denser random graphs generated by Erdos-Renyi models.
引用
收藏
页码:57 / 65
页数:9
相关论文
共 16 条
[1]  
Aiello W., 2000, Proceedings of the Thirty Second Annual ACM Symposium on Theory of Computing, P171, DOI 10.1145/335305.335326
[2]   Internet -: Diameter of the World-Wide Web [J].
Albert, R ;
Jeong, H ;
Barabási, AL .
NATURE, 1999, 401 (6749) :130-131
[3]  
ALDOUS D, 2000, RANDOM GRAPH IMMIGRA
[4]  
[Anonymous], 1949, Human behaviour and the principle of least-effort
[5]  
[Anonymous], SOCIOLOGICAL RES ONL
[6]  
Bollobas B, 1985, RANDOM GRAPHS
[7]   Graph structure in the Web [J].
Broder, A ;
Kumar, R ;
Maghoul, F ;
Raghavan, P ;
Rajagopalan, S ;
Stata, R ;
Tomkins, A ;
Wiener, J .
COMPUTER NETWORKS-THE INTERNATIONAL JOURNAL OF COMPUTER AND TELECOMMUNICATIONS NETWORKING, 2000, 33 (1-6) :309-320
[8]  
Egghe L., 1990, INTRO INFORMETRICS
[9]  
Feller W, 1950, An Introduction to Probability Theory and Its Applications, VI