Monte Carlo methods in pagerank computation: When one iteration is sufficient

被引:125
作者
Avrachenkov, K.
Litvak, N.
Nemirovsky, D.
Osipova, N.
机构
[1] INRIA Sophia Antipolis, MAESTRO Team, F-06902 Sophia Antipolis, France
[2] Univ Twente, Dept Appl Math, Fac EEMCS, NL-7500 AE Enschede, Netherlands
[3] St Petersburg State Univ, Dept Informat Technol, St Petersburg 198504, Russia
关键词
Google; PageRank; Monte Carlo methods; absorbing Markov chains;
D O I
10.1137/050643799
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
PageRank is one of the principle criteria according to which Google ranks Web pages. PageRank can be interpreted as a frequency of visiting a Web page by a random surfer, and thus it reflects the popularity of a Web page. Google computes the PageRank using the power iteration method, which requires about one week of intensive computations. In the present work we propose and analyze Monte Carlo-type methods for the PageRank computation. There are several advantages of the probabilistic Monte Carlo methods over the deterministic power iteration method: Monte Carlo methods already provide good estimation of the PageRank for relatively important pages after one iteration; Monte Carlo methods have natural parallel implementation; and finally, Monte Carlo methods allow one to perform continuous update of the PageRank as the structure of the Web changes.
引用
收藏
页码:890 / 904
页数:15
相关论文
共 16 条
[1]  
ABITEBOUL S, 2003, P 12 INT WORLD WID W
[2]  
[Anonymous], 2003, P 12 INT WORLD WID W
[3]  
Bianchini M., 2005, ACM Transactions on Internet Technology, V5, P92, DOI 10.1145/1052934.1052938
[4]   Catalytic Perfect Simulation [J].
Breyer, L. A. ;
Roberts, G. O. .
METHODOLOGY AND COMPUTING IN APPLIED PROBABILITY, 2001, 3 (02) :161-177
[5]  
Breyer LA, 2002, MARKOVIAN PAGE RANKI
[6]  
Brin S., 1998, The pagerank citation ranking: Bringing order to the web
[7]   Graph structure in the Web [J].
Broder, A ;
Kumar, R ;
Maghoul, F ;
Raghavan, P ;
Rajagopalan, S ;
Stata, R ;
Tomkins, A ;
Wiener, J .
COMPUTER NETWORKS-THE INTERNATIONAL JOURNAL OF COMPUTER AND TELECOMMUNICATIONS NETWORKING, 2000, 33 (1-6) :309-320
[8]  
Dill S., 2002, ACM Trans. Internet Technol., V2, P205, DOI DOI 10.1145/572326.572328
[9]  
FOGARAS D, 2004, P 3 INT WORKSH ALG M
[10]  
Henzinger Monika R, 2004, Internet Mathematics, V1, P115