Unsupervised topic discovery in micro-blogging networks

被引:33
作者
Vicient, Carlos [1 ]
Moreno, Antonio [1 ]
机构
[1] Univ Rovira & Virgili, ITAKA, Dept Engn Informat & Matemat, Ave Paisos Catalans 26, E-43007 Tarragona, Catalonia, Spain
关键词
Knowledge-based systems; Semantic Web; Micro-blogging; Twitter; TWITTER;
D O I
10.1016/j.eswa.2015.04.014
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
Unsupervised automatic topic discovery in micro-blogging social networks is a very challenging task, as it involves the analysis of very short, noisy, ungrammatical and uncontextual messages. Most of the current approaches to this problem are basically syntactic, as they focus either on the use of statistical techniques or on the analysis of the co-occurrences between the terms. This paper presents a novel topic discovery methodology, based on the mapping of hashtags to WordNet terms and their posterior clustering, in which semantics plays a centre role. The paper also presents a detailed case study in the field of Oncology, in which the discovered topics are thoroughly compared to a golden standard, showing promising results. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:6472 / 6485
页数:14
相关论文
共 62 条
[1]
Alfayez Reem, 2013, Third International Conference on Innovative Computing Technology (INTECH 2013), P545, DOI 10.1109/INTECH.2013.6653644
[2]
[Anonymous], P 10 INT WORKSHOP MU, DOI [DOI 10.1145/1814245.1814249, 10.1145/1814245.1814249]
[3]
[Anonymous], 545 EECS
[4]
[Anonymous], 2011, P ACM SIGIR SWSM
[5]
[Anonymous], 2014, OPENACCESS SERIES IN
[6]
[Anonymous], 2014, CEUR Workshop Proceed
[7]
[Anonymous], 2011, P 20 ACM INT C INF K, DOI 10.1145/2063576.2063726
[8]
[Anonymous], 1967, P 5 BERK S MATH STAT
[9]
[Anonymous], 2009, P 17 ACM SIGSP INT C
[10]
Archambault D., 2013, ABS13063 CORR