Improving the accuracy of co-citation clustering using full text

被引:105
作者
Boyack, Kevin W. [1 ]
Small, Henry [2 ]
Klavans, Richard [2 ]
机构
[1] SciTech Strategies Inc, Albuquerque, NM 87122 USA
[2] SciTech Strategies Inc, Berwyn, PA 19312 USA
来源
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY | 2013年 / 64卷 / 09期
关键词
citation analysis; citation networks; full text databases; CITATION; ARTICLES;
D O I
10.1002/asi.22896
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Historically, co-citation models have been based only on bibliographic information. Full-text analysis offers the opportunity to significantly improve the quality of the signals upon which these co-citation models are based. In this work we study the effect of reference proximity on the accuracy of co-citation clusters. Using a corpus of 270,521 full text documents from 2007, we compare the results of traditional co-citation clustering using only the bibliographic information to results from co-citation clustering where proximity between reference pairs is factored into the pairwise relationships. We find that accounting for reference proximity from full text can increase the textual coherence (a measure of accuracy) of a co-citation cluster solution by up to 30% over the traditional approach based on bibliographic information.
引用
收藏
页码:1759 / 1767
页数:9
相关论文
共 23 条
  • [1] Agarwal Shashank, 2010, AMIA Annu Symp Proc, V2010, P11
  • [2] What do citation counts measure? A review of studies on citing behavior
    Bornmann, Luti
    Daniel, Hans-Dieter
    [J]. JOURNAL OF DOCUMENTATION, 2008, 64 (01) : 45 - 80
  • [3] Clustering More than Two Million Biomedical Publications: Comparing the Accuracies of Nine Text-Based Similarity Approaches
    Boyack, Kevin W.
    Newman, David
    Duhon, Russell J.
    Klavans, Richard
    Patek, Michael
    Biberstine, Joseph R.
    Schijvenaars, Bob
    Skupin, Andre
    Ma, Nianli
    Boerner, Katy
    [J]. PLOS ONE, 2011, 6 (03):
  • [4] Co-Citation Analysis, Bibliographic Coupling, and Direct Citation: Which Citation Approach Represents the Research Front Most Accurately?
    Boyack, Kevin W.
    Klavans, Richard
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2010, 61 (12): : 2389 - 2404
  • [5] Contextual Cocitation: Augmenting Cocitation Analysis and its Applications
    Callahan, Alison
    Hockema, Stephen
    Eysenbach, Gunther
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2010, 61 (06): : 1130 - 1143
  • [6] Blind men and elephants: What do citation summaries tell us about a research article?
    Elkiss, Aaron
    Shen, Siwei
    Fader, Anthony
    Erkan, Guenes
    States, David
    Radev, Dragomir
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2008, 59 (01): : 51 - 62
  • [7] Garfield E., 1977, ESSAYS INFORM SCI, V1, P84
  • [8] Giles C. L., 1998, P 3 ACM C DIG LIB DL
  • [9] Gipp B, 2009, PRO INT CONF SCI INF, V2, P571
  • [10] Recognizing speculative language in biomedical research articles: a linguistically motivated perspective
    Kilicoglu, Halil
    Bergler, Sabine
    [J]. BMC BIOINFORMATICS, 2008, 9 (Suppl 11)