The differences between latent topics in abstracts and citation contexts of citing papers

被引:47
作者
Liu, Shengbo [1 ]
Chen, Chaomei [2 ]
机构
[1] Dalian Univ Technol, Wiselab, Dalian 116023, Peoples R China
[2] Drexel Univ, Coll Informat Sci & Technol, Philadelphia, PA 19104 USA
来源
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY | 2013年 / 64卷 / 03期
关键词
citation analysis; COMPUTER RECOGNITION; TEXT; BIBLIOMETRICS; STATEMENTS; INFERENCE; TOOL;
D O I
10.1002/asi.22771
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Although it is commonly expected that the citation context of a reference is likely to provide more detailed and direct information about the nature of a citation, few studies in the literature have specifically addressed the extent to which the information in different parts of a scientific publication differs. Do abstracts tend to use conceptually broader terms than sentences in a citation context in the body of a publication? In this article, we propose a method to analyze and compare latent topics in scientific publications, in particular, from abstracts of papers that cited a target reference and from sentences that cited the target reference. We conducted an experiment and applied topical modeling techniques to full-text papers in eight biomedicine journals. Topics derived from the two sources are compared in terms of their similarities and broad-narrow relationships defined based on information entropy. The results show that abstracts and citation contexts are characterized by distinct sets of topics with moderate overlaps. Furthermore, the results confirm that topics from abstracts of citing papers have broader terms than topics from citation contexts formed by citing sentences. The method and the findings could be used to enhance and extend the current methodologies for research evaluation and citation evaluation.
引用
收藏
页码:627 / 639
页数:13
相关论文
共 41 条
[1]  
[Anonymous], P 13 ACM SIGKDD INT
[2]  
[Anonymous], 2008, P 22 INT C COMP LING, DOI DOI 10.3115/1599081.1599168
[3]  
[Anonymous], UMCS2005071
[4]  
[Anonymous], NEURAL INFORM PROCES
[5]  
[Anonymous], 2007, HUMAN LANGUAGE TECHN
[6]  
[Anonymous], 2009, P HUM LANG TECHN 200, DOI DOI 10.3115/1620754.1620839
[7]  
[Anonymous], REFERENCE DIRECTED I
[8]  
[Anonymous], P SIGIR 2004 WORKSH
[9]  
[Anonymous], 2008, P ACL 08 HLT
[10]  
Bhattacharya I, 2006, SIAM PROC S, P47