On the normalization and visualization of author co-citation data:: Salton's cosine versus the Jaccard index

被引:158
作者
Leydesdorff, Loet [1 ]
机构
[1] Amsterdam Sch Commun Res, NL-1012 CX Amsterdam, Netherlands
来源
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY | 2008年 / 59卷 / 01期
关键词
D O I
10.1002/asi.20732
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 [计算机科学与技术];
摘要
The debate about which similarity measure one should use for the normalization in the case of Author Co-citation Analysis (ACA) is further complicated when one distinguishes between the symmetrical co-citation-or, more generally, co-occurrence-matrix and the underlying asymmetrical citation-occurrence-matrix. In the Web environment, the approach of retrieving original citation data is often not feasible. In that case, one should use the Jaccard index, but preferentially after adding the number of total citations (i.e., occurrences) on the main diagonal. Unlike Salton's cosine and the Pearson correlation, the Jaccard index abstracts from the shape of the distributions and focuses only on the intersection and the sum of the two sets. Since the correlations in the co-occurrence matrix may be spurious, this property of the Jaccard index can be considered as an advantage in this case.
引用
收藏
页码:77 / 85
页数:9
相关论文
共 30 条
[1]
Requirements for a cocitation similarity measure, with special reference to Pearson's correlation coefficient [J].
Ahlgren, P ;
Jarneving, B ;
Rousseau, R .
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2003, 54 (06) :550-560
[2]
Ahlgren P, 2004, J AM SOC INF SCI TEC, V55, P843, DOI 10.1002/asi.20030
[3]
Pearson's r and author cocitation analysis:: A commentary on the controversy [J].
Bensman, SJ .
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2004, 55 (10) :935-935
[4]
Borgatti S.P., 2002, Harv MA: analytic Technol, V6, P12
[5]
Egghe L., 1990, INTRO INFORMETRICS
[6]
National characteristics in international scientific co-authorship relations [J].
Glänzel, W .
SCIENTOMETRICS, 2001, 51 (01) :69-115
[7]
SIMILARITY MEASURES IN SCIENTOMETRIC RESEARCH - THE JACCARD INDEX VERSUS SALTON COSINE FORMULA [J].
HAMERS, L ;
HEMERYCK, Y ;
HERWEYERS, G ;
JANSSEN, M ;
KETERS, H ;
ROUSSEAU, R ;
VANHOUTTE, A .
INFORMATION PROCESSING & MANAGEMENT, 1989, 25 (03) :315-318
[8]
Jaccard P., 1901, Bulletin de la Societe Vaudoise de Sciences Naturelles, V37, P241
[9]
JONES WP, 1987, J AM SOC INFORM SCI, V38, P420, DOI 10.1002/(SICI)1097-4571(198711)38:6<420::AID-ASI3>3.0.CO
[10]
2-S