Twitter推文与情感词典SentiWordNet匹配算法研究

被引:2
作者
易顺明 [1 ]
周洪斌 [1 ]
周国栋 [2 ]
机构
[1] 沙洲职业工学院电子信息工程系
[2] 苏州大学计算机科学与技术学院
关键词
推文; 情感分类; SentiWordNet; 匹配算法;
D O I
暂无
中图分类号
TP391.1 [文字信息处理];
学科分类号
摘要
在Twitter情感分类研究中,经常会采用将推文中的单词匹配情感词典中的同义词条查找相应情感值的方法 .但推文书写比较随意,包含许多俚语、缩写和特殊符号,导致许多词汇与情感词典中的词条无法匹配,匹配率不高直接影响推文的情感分类性能.针对Twitter的语言特征,提出了一套Twitter推文与情感词典SentiWordNet的匹配算法.该算法首先通过对推文内容进行数据清洗、替代处理、词性标注和词形还原等预处理,增加了命名实体识别、对hashtags内容的断词处理、基于Word Clusters的否定句处理和词组匹配等方法 .实验结果表明,采用此方法的匹配率可达90%以上.
引用
收藏
页码:41 / 47+53 +53
页数:8
相关论文
共 20 条
  • [1] Thumbs up? sentiment classification using machine learning techniques. Bo Pang,Lillian Lee,Shivakumar Vaithyanathan. Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing (EMNLP) . 2002
  • [2] Towards Answering Opinion Questions:Separating Facts from Opinions and Identifying the Polarity of Opinion Sentences. YU H,HATZIVASSILOGLOU V. Proceedings of the EMNLP-03 . 2003
  • [3] Employing Personal/Impersonal Views in Supervised and Semi-supervised Sentiment Classification. Li Shoushan,Huang Chu-Ren,Zhou Guodong,et al. Proceedings of ACL-10 . 2010
  • [4] Mining and Summary Customer Reviews. Hu M,Liu B. KDD . 2004
  • [5] ECNUCS:A surface information based system description of sentiment analysis in Twitter in the Sem Eval-2013 (Task 2). ZHU T,ZHANG F,LAN M. Proceedings of Sem Eval 2013 . 2013
  • [6] Development and Use of a Gold Standard Data Set for Subjectivity Classifications. J. Wiebe,R. Bruce,T. O’’Hara. Proceedings of the ACL -99 . 1999
  • [7] Improved part-of-speech tagging for online conversational text with word clusters. Olutobi Owoputi,Brendan O’’Connor,Chris Dyer,et al. Proceedings of NAACLHLT 2013 . 2013
  • [8] Thumbs up or thumbs down?Semantic orientation applied to unsupervised classification of reviews. P.Turney. Proceedings of the ACL . 2002
  • [9] Determining term subjectivity and term orientation for opinion mining. A. Esuli,F. Sebastiani. Proceedings of EACL . 2006
  • [10] Mining the peanut gallery:Opinion extraction and semantic classification of product reviews. Kushal Dave,Steve Lawrence,David M.Pennock. Proceedings of WWW 2003 . 2003