Author cocitation analysis and Pearson's r

被引:110
作者
White, HD [1 ]
机构
[1] Drexel Univ, Coll Informat Sci & Technol, Philadelphia, PA 19104 USA
来源
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY | 2003年 / 54卷 / 13期
关键词
D O I
10.1002/asi.10325
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In their article "Requirements for a cocitation similarity measure, with special reference to Pearson's correlation coefficient," Ahlgren, Jarneving, and Rousseau fault traditional author cocitation analysis (ACA) for using Pearson's r as a measure of similarity between authors because it fails two tests of stability of measurement. The instabilities arise when rs are recalculated after a first coherent group of authors has been augmented by a second coherent group with whom the first has little or no cocitation. However, AJ&R neither cluster nor map their data to demonstrate how fluctuations in rs will mislead the analyst, and the problem they pose is remote from both theory and practice in traditional ACA. By entering their own rs into multidimensional scaling and clustering routines, I show that, despite rs fluctuations, clusters based on it are much the same for the combined groups as for the separate groups. The combined groups when mapped appear as polarized clumps of points in two-dimensional space, confirming that differences between the groups have become much more important than differences within the groups-an accurate portrayal of what has happened to the data. Moreover, r produces clusters and maps very like those based on other coefficients that AJ&R mention as possible replacements, such as a cosine similarity measure or a chi square dissimilarity measure. Thus, r performs well enough for the purposes of ACA. Accordingly, I argue that qualitative information revealing why authors are cocited is more important than the cautions proposed in the AJ&R critique. I include notes on topics such as handling the diagonal in author cocitation matrices, lognormalizing data, and testing r for significance.
引用
收藏
页码:1250 / 1259
页数:10
相关论文
共 24 条
  • [1] Requirements for a cocitation similarity measure, with special reference to Pearson's correlation coefficient
    Ahlgren, P
    Jarneving, B
    Rousseau, R
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2003, 54 (06): : 550 - 560
  • [2] BAYER AE, 1990, J AM SOC INFORM SCI, V41, P444, DOI 10.1002/(SICI)1097-4571(199009)41:6<444::AID-ASI12>3.0.CO
  • [3] 2-J
  • [4] Borgatti S.P., 2002, Harv MA: analytic Technol, V6, P12
  • [5] BORGATTI SP, 2000, WORKSH SUNB 20 INT S
  • [6] Davison M.L., 1983, Multidimensional scaling
  • [7] Eom SB, 1996, J AM SOC INFORM SCI, V47, P941, DOI 10.1002/(SICI)1097-4571(199612)47:12<941::AID-ASI7>3.0.CO
  • [8] 2-2
  • [9] Everitt B, 1974, CLUSTER ANAL
  • [10] Griffith B., 1980, KEY PAPERS INFORMATI, pvi