Automatic resource compilation by analyzing hyperlink structure and associated text

被引:175
作者
Chakrabarti, S
Dom, B
Raghava, P
Rajagopalan, S
Gibson, D
Kleinberg, J
机构
[1] IBM Corp, Almaden Res Ctr K53, San Jose, CA 95120 USA
[2] Univ Calif Berkeley, Div Comp Sci, Berkeley, CA 94720 USA
[3] Cornell Univ, Dept Comp Sci, Ithaca, NY 14853 USA
来源
COMPUTER NETWORKS AND ISDN SYSTEMS | 1998年 / 30卷 / 1-7期
关键词
search; taxonomies; link analysis; anchor text; information retrieval;
D O I
10.1016/S0169-7552(98)00087-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We describe the design, prototyping and evaluation of ARC, a system for automatically compiling a list of authoritative Web resources on any (sufficiently broad) topic. The goal of ARC is to compile resource lists similar to those provided by Yahoo! or Infoseek. The fundamental difference is that these services construct lists either manually or through a combination of human and automated effort, while ARC operates fully automatically. We describe the evaluation of ARC, Yahoo!, and Infoseek resource lists by a panel of human users. This evaluation suggests that the resources found by ARC frequently fare almost as well as, and sometimes better than, lists of resources that are manually compiled or classified into a topic. We also provide examples of ARC resource lists for the reader to examine. (C) 1998 Published by Elsevier Science B.V. All rights reserved.
引用
收藏
页码:65 / 74
页数:10
相关论文
共 12 条
[1]  
[Anonymous], P ACM SIGCHI C HUM F
[2]  
AROCENA GO, 1997, P 6 INT WORLD WEB C
[3]   The anatomy of a large-scale hypertextual Web search engine [J].
Brin, S ;
Page, L .
COMPUTER NETWORKS AND ISDN SYSTEMS, 1998, 30 (1-7) :107-117
[4]  
CARRIERE J, 1995, P 6 INT WORLD WID WE
[5]  
Golub G.H., 1996, Matrix Computations, Vthird
[6]  
KLEINBERG J, 1998, IN PRESS P ACM SIAM
[7]  
KLEINBERG J, 1995, 107691892 IBM RJ
[8]  
MCBRYAN OA, 1994, P 1 INT C WORLD WID
[9]   NAVIGATING IN HYPERSPACE - DESIGNING A STRUCTURE-BASED TOOLBOX [J].
RIVLIN, E ;
BOTAFOGO, R ;
SHNEIDERMAN, B .
COMMUNICATIONS OF THE ACM, 1994, 37 (02) :87-96
[10]  
SPERTUS E, 1997, P 6 INT WORLD WID WE