Web search engine:characteristics of user behaviors and their implication

被引:3
作者
王建勇
单松巍
雷鸣
谢正茂
李晓明
机构
关键词
world wide web; search engine; distribution characteristic; web information; user behavior;
D O I
暂无
中图分类号
TP393.03 [];
学科分类号
081201 ; 1201 ;
摘要
<正>In this paper, first studied are the distribution characteristics of user behaviors based on log data from a massive web search engine. Analysis shows that stochastic distribution of user queries accords with the characteristics of power-law function and exhibits strong similarity, and the user' s queries and clicked URLs present dramatic locality, which implies that query cache and 'hot click' cache can be employed to improve system performance. Then three typical cache replacement policies are compared, including LRU, FIFO, and LFU with attenuation. In addition, the distribution character-istics of web information are also analyzed, which demonstrates that the link popularity and replica pop-ularity of a URL have positive influence on its importance. Finally, variance between the link popularity and user popularity, and variance between replica popularity and user popularity are analyzed, which give us some important insight that helps us improve the ranking algorithms in a search engine.
引用
收藏
页码:351 / 365
页数:15
相关论文
共 1 条
[1]  
On the self-similar nature of ethernet traffic (extended version). Leland,W. E. et al. IEEE ACM Transactions on Networking . 1994