A hybrid mapping of information science

被引:97
作者
Janssens, Frizo [1 ,2 ]
Glanzel, Wolfgang [2 ,3 ]
De Moor, Bart [1 ]
机构
[1] Katholieke Univ Leuven, Dept Elect Engn, ESAT SCD, B-3001 Leuven, Belgium
[2] Katholieke Univ Leuven, Steunpunt O&O Indicatoren, B-3001 Leuven, Belgium
[3] Hungarian Acad Sci, ISPR, Budapest, Hungary
关键词
D O I
10.1007/s11192-007-2002-7
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Previous studies have shown that hybrid clustering methods that incorporate textual content and bibliometric information can outperform clustering methods that use only one of these components. In this paper we apply a hybrid clustering method based on Fisher's inverse chi-square to integrate full-text with citations and to provide a mapping of the field of information science. We quantitatively and qualitatively asses the added value of such an integrated analysis and we investigate whether the clustering outcome is a better representation of the field by comparing with a text-only clustering and with another hybrid method based on linear combination of distance matrices. Our data set consists of almost 1000 articles and notes published in the period 2002-2004 in 5 representative journals. The optimal number of clusters for the field is 5, determined by using a combination of distance-based and stability-based methods. Term networks present the cognitive structure of the field and are complemented by the most representative publications. Three large traditional sub-disciplines, particularly, information retrieval, bibliometrics/scientometrics, and more social aspects, and two smaller clusters about patent analysis and webometrics, can be distinguished.
引用
收藏
页码:607 / 631
页数:25
相关论文
共 34 条
  • [1] [Anonymous], 2000, FDN STAT NATURAL LAN
  • [2] Baeza-Yates R.A., 1999, Modern Information Retrieval
  • [3] Batagelj V, 2002, LECT NOTES COMPUT SC, V2265, P477
  • [4] Ben-Hur Asa, 2002, Pac Symp Biocomput, P6
  • [5] Using linear algebra for intelligent information retrieval
    Berry, MW
    Dumais, ST
    OBrien, GW
    [J]. SIAM REVIEW, 1995, 37 (04) : 573 - 595
  • [6] BRAAM RR, 1991, J AM SOC INFORM SCI, V42, P252, DOI 10.1002/(SICI)1097-4571(199105)42:4<252::AID-ASI2>3.0.CO
  • [7] 2-G
  • [8] Link-based similarity measures for the classification of Web documents
    Calado, P
    Cristo, M
    Gonçalves, MA
    de Moura, ES
    Ribeiro-Neto, B
    Ziviani, N
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2006, 57 (02): : 208 - 221
  • [9] Local versus global link information in the Web
    Calado, P
    Ribeiro-Neto, B
    Ziviani, N
    Moura, E
    Silva, I
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2003, 21 (01) : 42 - 63
  • [10] COHN D, 2000, NEURAL INFORM PROCES, P13