Fractals text mining using bibliometrics and database tomography

被引:20
作者
Kostoff, RN
Shlesinger, MF
Malpohl, G
机构
[1] Off Naval Res, Arlington, VA 22217 USA
[2] Univ Karlsruhe, D-76128 Karlsruhe, Germany
关键词
fractals; self-similarity; self-organized criticality; multi-fractal; text-mining; bibliometrics; computational linguistics;
D O I
10.1142/S0218348X04002343
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Database Tomography (DT) is a textual database analysis system consisting of two major components: (1) algorithms for extracting multi-word phrase frequencies and phrase proximities (physical closeness of the multi-word technical phrases) from any type of large textual database, to augment (2) interpretative capabilities of the expert human analyst. DT was used to obtain technical intelligence from a Fractals database derived from the Science Citation Index/Social Science Citation Index (SCI). Phrase frequency analysis by the technical domain experts provided the pervasive technical themes of the Fractals database, and the phrase proximity analysis provided the relationships among the pervasive technical themes. Bibliometric analysis of the Fractals literature supplemented the DT results with author/journal/institution publication and citation data.
引用
收藏
页码:1 / 16
页数:16
相关论文
共 29 条
[1]  
[Anonymous], COMPETITIVE INTELLIG
[2]  
CUTTING DR, 1992, SIGIR 92 : PROCEEDINGS OF THE FIFTEENTH ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, P318
[3]   HISTORY OF CITATION INDEXES FOR CHEMISTRY - A BRIEF REVIEW [J].
GARFIELD, E .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1985, 25 (03) :170-174
[4]  
Guha S., 1998, SIGMOD Record, V27, P73, DOI 10.1145/276305.276312
[5]  
HEARST MA, 1998, NATURAL LANGUAGE INF
[6]  
*ISI, 2002, SCI SCI CIT IND
[7]   Chameleon: Hierarchical clustering using dynamic modeling [J].
Karypis, G ;
Han, EH ;
Kumar, V .
COMPUTER, 1999, 32 (08) :68-+
[8]  
Kostoff R., 1993, COMPET INTELL REV, V4, DOI [10.1002/cir.3880040109, DOI 10.1002/CIR.3880040109]
[9]   Electrochemical power text mining using bibliometrics and database tomography [J].
Kostoff, RN ;
Tshiteya, R ;
Pfeil, KM ;
Humenik, JA .
JOURNAL OF POWER SOURCES, 2002, 110 (01) :163-176
[10]   Text mining using database tomography and bibliometrics: A review [J].
Kostoff, RN ;
Toothman, DR ;
Eberhart, HJ ;
Humenik, JA .
TECHNOLOGICAL FORECASTING AND SOCIAL CHANGE, 2001, 68 (03) :223-253