Automatic text decomposition and structuring

被引:48
作者
Salton, G
Allan, J
Singhal, A
机构
[1] Department of Computer Science, Cornell University, Ithaca
关键词
D O I
10.1016/S0306-4573(96)85001-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sophisticated text similarity measurements are used to determine relationships between natural-language texts and text excerpts. The resulting linked hypertext maps can be decomposed into text segments and text themes, and these decompositions are usable to identify different text types and text structures, leading to improved text access and utilization. Examples of text decomposition are given for expository and non-expository texts.
引用
收藏
页码:127 / 138
页数:12
相关论文
共 12 条
[1]  
HARMAN DK, 1995, NIST500215 SPEC PUBL
[2]  
Rocchio J. J., 1971, SMART SYSTEM EXPT AU
[3]   GLOBAL TEXT MATCHING FOR INFORMATION-RETRIEVAL [J].
SALTON, G ;
BUCKLEY, C .
SCIENCE, 1991, 253 (5023) :1012-1015
[4]   TERM-WEIGHTING APPROACHES IN AUTOMATIC TEXT RETRIEVAL [J].
SALTON, G ;
BUCKLEY, C .
INFORMATION PROCESSING & MANAGEMENT, 1988, 24 (05) :513-523
[5]   DEVELOPMENTS IN AUTOMATIC TEXT RETRIEVAL [J].
SALTON, G .
SCIENCE, 1991, 253 (5023) :974-980
[6]   AUTOMATIC-ANALYSIS, THEME GENERATION, AND SUMMARY OF MACHINE-READABLE TEXTS [J].
SALTON, G ;
ALLAN, J ;
BUCKLEY, C ;
SINGHAL, A .
SCIENCE, 1994, 264 (5164) :1421-1426
[7]   AUTOMATIC STRUCTURING AND RETRIEVAL OF LARGE TEXT FILES [J].
SALTON, G ;
ALLAN, J ;
BUCKLEY, C .
COMMUNICATIONS OF THE ACM, 1994, 37 (02) :97-108
[8]  
Salton G., 1971, SMART RETRIEVAL SYST
[9]  
SALTON G, 1991, 14TH P ANN INT ACM S, P21
[10]  
SALTON G, 1994, TEXT RETRIEVAL USING