A rapid egocentric search scheme using authority estimation in blog space

被引:1
作者
Jeong, Yoonjae [2 ]
Lee, Dongman [1 ]
机构
[1] Informat & Commun Univ, Sch Engn, Taejon, South Korea
[2] Informat & Commun Univ, Digital Media Lab, Seoul, South Korea
关键词
information retrieval; search engines; social networks;
D O I
10.1108/14684520810879854
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Purpose - The purpose of this study is to improve the egocentric search speed for important documents in neighbouring blogs. Design/methodology/approach - This paper presents a rapid egocentric search scheme that narrows down the search space to more important blogs. To determine which blogs are more valuable among a user's neighbouring blogs, a heuristic function is developed that predicts the authority scores on the basis of the local information of the blog. The proposed approach improves the speed of the egocentric search process and the quality of retrieved documents. Findings - A blog is a new medium that is receiving considerable attention. Its links enable one to acquire information about social relations between bloggers in a blog space, and these relations reflect bloggers' interests. Therefore, the ability to search documents in linked blogs is significant for bloggers. An egocentric search method is proposed to search for documents in such neighbouring blogs. However, it takes considerable time to find the most valuable documents in a user's neighbouring blogs when many blogs are linked to that user's blog. Originality/value - This study shows that the number of neighbouring blogs, which are linked to a blog with trackbacks and comments, is important for estimating the authority of a blog. In the experimental results this method performs about five times faster than the egocentric search using a breadth-first search strategy in searching for the top 5 per cent of the most important documents in the neighbouring blogs.
引用
收藏
页码:236 / 253
页数:18
相关论文
共 25 条
  • [1] *6 AP, 2002, TRACKB TECHN SPEC
  • [2] [Anonymous], MODERN INFORM RETRIE
  • [3] [Anonymous], 2003, P 12 INT C WORLD WID
  • [4] [Anonymous], 2004, P 37 HAW INT C SYST
  • [5] BAEZAYATES R, 2005, 14 INT C WORLD WID W, P864
  • [6] Blood Rebecca., 2002, WEBLOG HDB PRACTICAL
  • [7] The anatomy of a large-scale hypertextual Web search engine
    Brin, S
    Page, L
    [J]. COMPUTER NETWORKS AND ISDN SYSTEMS, 1998, 30 (1-7): : 107 - 117
  • [8] Efficient crawling through URL ordering
    Cho, J
    Garcia-Molina, H
    Page, L
    [J]. COMPUTER NETWORKS AND ISDN SYSTEMS, 1998, 30 (1-7): : 161 - 172
  • [9] CHO J, 2004, P 13 INT WORLD WID W
  • [10] DEBRA PME, 1994, P 12 INT WORLD WID W