How dynamic is the Web?

被引:76
作者
Brewington, BE [1 ]
Cybenko, G [1 ]
机构
[1] Dartmouth Coll, Thayer Sch Engn, Hanover, NH 03755 USA
基金
美国国家科学基金会;
关键词
Web dynamics; monitoring; document management;
D O I
10.1016/S1389-1286(00)00045-1
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Recent experiments and analysis suggest that there are about 800 million publicly-indexable Web pages. However, unlike books in a traditional Library, Web pages continue to change even after they are initially published by their authors and indexed by search engines. This paper describes preliminary data on and statistical analysis of the frequency and nature of Web page modifications. Using empirical models and a novel analytic metric of 'up-to-dateness', we estimate the rate at which Web search engines must re-index the Web to remain current. (C) 2000 Published by Elsevier Science B.V. All rights reserved.
引用
收藏
页码:257 / 276
页数:20
相关论文
共 9 条
[1]  
COFFMAN EG, 1997, J SCHEDULING
[2]  
DOUGLIS F, 1997, P USENIX S INT TECHN
[3]  
Feller W., 1971, An introduction to probability theory and its applications, V2
[4]  
GRAY M, 1997, INTERNET GROWTH SUMM
[5]   Searching the World Wide Web [J].
Lawrence, S ;
Giles, CL .
SCIENCE, 1998, 280 (5360) :98-100
[6]  
LAWRENCE S, 1999, NATURE
[7]  
Montgomery D.C., 2010, Applied Statistics and Probability for Engineers, V5th ed.
[8]  
Papoulis A., 1984, Probability, Random Variables and Stochastic Processes, V2nd
[9]  
1995, INFORMANT