A novel sentiment analysis of social networks using supervised learning

被引:43
作者
Anjaria, Malhar [1 ]
Guddeti, Ram Mohana Reddy [1 ]
机构
[1] Natl Inst Technol Karnataka, Dept Informat Technol, Mangalore 575025, India
关键词
Electoral prediction; Microblogs; Opinion mining; Sentiment analysis; Social intelligence; Social network analysis; Supervised machine learning; Twitter; Twitter analytics;
D O I
10.1007/s13278-014-0181-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Online microblog-based social networks have been used for expressing public opinions through short messages. Among popular microblogs, Twitter has attracted the attention of several researchers in areas like predicting the consumer brands, democratic electoral events, movie box office, popularity of celebrities, the stock market, etc. Sentiment analysis over a Twitter-based social network offers a fast and efficient way of monitoring the public sentiment. This paper studies the sentiment prediction task over Twitter using machine-learning techniques, with the consideration of Twitter-specific social network structure such as retweet. We also concentrate on finding both direct and extended terms related to the event and thereby understanding its effect. We employed supervised machine-learning techniques such as support vector machines (SVM), Naive Bayes, maximum entropy and artificial neural networks to classify the Twitter data using unigram, bigram and unigram + bigram (hybrid) feature extraction model for the case study of US Presidential Elections 2012 and Karnataka State Assembly Elections (India) 2013. Further, we combined the results of sentiment analysis with the influence factor generated from the retweet count to improve the prediction accuracy of the task. Experimental results demonstrate that SVM outperforms all other classifiers with maximum accuracy of 88% in predicting the outcome of US Elections 2012, and 68% for Indian State Assembly Elections 2013.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 41 条
[1]   Communication Dynamics in Twitter During Political Campaigns: The Case of the 2011 Spanish National Election [J].
Aragon, Pablo ;
Kappler, Karolin Eva ;
Kaltenbrunner, Andreas ;
Laniado, David ;
Volkovich, Yana .
POLICY AND INTERNET, 2013, 5 (02) :183-206
[2]  
Asur S., 2010, Proceedings 2010 IEEE/ACM International Conference on Web Intelligence-Intelligent Agent Technology (WI-IAT), P492, DOI 10.1109/WI-IAT.2010.63
[3]  
Bakliwal A., 2013, P WORKSH LANG SOC ME, P49
[4]  
Barbosa Luciano, 2010, P COLING
[5]  
Berger AL, 1996, COMPUT LINGUIST, V22, P39
[6]  
Bermingham Adam, 2011, P WORKSHOP SENTIMENT, P2, DOI 11-3700/.
[7]  
Bollen J, 2009, P 5 INT AAAI C WEBL
[8]  
Boutet A, 2013, ASS ADVANCEMENT ARTI
[9]   A neural network based approach for sentiment classification in the blogosphere [J].
Chen, Long-Sheng ;
Liu, Cheng-Hsiang ;
Chiu, Hui-Ju .
JOURNAL OF INFORMETRICS, 2011, 5 (02) :313-322
[10]  
Cozma R., 2011, 61 ANN C INT COMM AS