AUTOMATIC-ANALYSIS, THEME GENERATION, AND SUMMARY OF MACHINE-READABLE TEXTS

被引:86
作者
SALTON, G
ALLAN, J
BUCKLEY, C
SINGHAL, A
机构
[1] Department of Computer Science, Cornell University, Ithaca
关键词
D O I
10.1126/science.264.5164.1421
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Vast amounts of text material are now available in machine-readable form for automatic processing. Here, approaches are outlined for manipulating and accessing texts in arbitrary subject areas in accordance with user needs. In particular, methods are given for determining text themes, traversing texts selectively, and extracting summary statements that reflect text content.
引用
收藏
页码:1421 / 1426
页数:6
相关论文
共 36 条
[1]  
Al-hawamdeh S., 1989, Electronic Publishing: Origination, Dissemination and Design, V2, P179
[2]  
Andersen M. H., 1989, Hypermedia, V1, P255
[3]  
Bernstein M., 1990, Hypertext: Concepts, Systems and Applications. Proceedings of the First European Conference on Hypertext, P212
[4]  
BERNSTEIN M, 1991, P HYPERTEXT 91, P246
[5]  
BOLTER J., 1991, WRITING SPACE COMPUT
[6]   STRUCTURAL-ANALYSIS OF HYPERTEXTS - IDENTIFYING HIERARCHIES AND USEFUL METRICS [J].
BOTAFOGO, RA ;
RIVLIN, E ;
SHNEIDERMAN, B .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 1992, 10 (02) :142-180
[7]  
Buckley C., 1993, NIST SPECIAL PUBLICA, P59
[8]  
BUCKLEY C, IN PRESS NIST SPECIA
[9]  
Chignell M. H., 1991, Hypermedia, V3, P187
[10]   CLUSTERING LARGE FILES OF DOCUMENTS USING SINGLE-LINK METHOD [J].
CROFT, WB .
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1977, 28 (06) :341-344