Database tomography for information retrieval

被引:63
作者
Kostoff, RN
Eberhart, HJ
Toothman, DR
机构
[1] USN, CTR AIR WARFARE, CHINA LAKE, CA USA
[2] DSTI INC, ROCKVILLE, MD USA
关键词
D O I
10.1177/016555159702300404
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Database tomography is an information extraction and analysis system which operates on textual databases. Its primary use to date has been tb identify pervasive technical thrusts and themes, and the interrelationships among these themes and sub-themes, which are intrinsic to large textual databases. Its two main algorithmic components are multiword phrase frequency analysis and phrase proximity analysis. This paper shows how database tomography can be used to enhance information retrieval from large textual databases through the newly developed process of simulated nucleation. The principles of simulated nucleation are presented, and the advantages for information retrieval are delineated. An application is described of developing, from Science Citation Index and Engineering Compendex, a database of journal articles focused on near-Earth space science and technology.
引用
收藏
页码:301 / 311
页数:11
相关论文
共 31 条
[21]  
LESK M, 1969, AM DOCUMENTATION, V20
[22]  
MACROBERTS M, 1996, SCIENTOMETRICS, V36
[23]  
MARON M, 1960, J ACM, V7
[24]  
ROBERTSON SE, 1976, J AM SOC INFORMATION, V27
[25]  
Rocchio J. J., 1971, SMART SYSTEM EXPT AU
[26]  
SALTON G, 1990, J AM SOC INFORMATION, V41
[27]  
SALTON G, 1985, J AM SOC INFORMATION, V36
[28]  
SMEATON A, 1983, COMPUTER J, V26
[29]  
SPINK A, 1995, INFORMATION PROCESSI, V31
[30]  
STILES H, 1961, J ACM, V8