Indexing and retrieval of scientific literature

被引:31
作者
Lawrence, S [1 ]
Bollacker, K [1 ]
Giles, CL [1 ]
机构
[1] NEC Res Inst, Princeton, NJ 08540 USA
来源
PROCEEDINGS OF THE EIGHTH INTERNATIONAL CONFERENCE ON INFORMATION KNOWLEDGE MANAGEMENT, CIKM'99 | 1999年
关键词
D O I
10.1145/319950.319970
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The web has greatly improved access to scientific literature. However, scientific articles on the web are largely disorganized, with research articles being spread across archive sites, institution sites, journal sites, and researcher home-pages. No index covers all of the available literature, and the major web search engines typically do not index the content of Postscript/PDF documents at all. This paper discusses the creation of digital libraries of scientific literature on the web, including the efficient location of articles, full-text indexing of the articles, autonomous citation indexing, information extraction, display of query-sensitive summaries and citation context, hubs and authorities computation, similar document detection, user profiling, distributed error correction, graph analysis, and detection of overlapping documents. The software for the system is available at no cost for non-commercial use.
引用
收藏
页码:139 / 146
页数:8
相关论文
共 32 条
[1]  
[Anonymous], 1994, MANAGING GIGABYTES C
[2]   The World Wide Web as an instructional tool [J].
Barrie, JM ;
Presti, DE .
SCIENCE, 1996, 274 (5286) :371-372
[3]  
BHARAT K, 1998, SIGIR C RES DEV INF
[4]  
Bollacker K. D., 1999, Digital 99 Libraries. Fourth ACM Conference on Digital Libraries, P105, DOI 10.1145/313238.313270
[5]  
BOLLACKER S, 1998, P 2 INT C AUT, P116
[6]  
Brin S., 1998, 7 INT WORLD WID WEB
[7]  
BRIN S, 1995, P ACM SIGMOD ANN C
[8]  
BRODER A, 1997, 6 INT WORLD WID WEB, P391
[9]  
Brown E. W., 1994, P 20 INT C VER LARG, P192
[10]  
CAMERON RD, 1997, 1 MONDAY, V2