Sentiment visualization and classification via semi-supervised nonlinear dimensionality reduction

被引:57
作者
Kim, Kyoungok [1 ]
Lee, Jaewook [2 ]
机构
[1] POSTECH, Dept Ind & Management Engn, Pohang 790784, Kyungbuk, South Korea
[2] Seoul Natl Univ, Dept Ind Engn, Seoul 151744, South Korea
基金
新加坡国家研究基金会;
关键词
Text visualization; Semi-supervised dimensionality reduction; Laplacian eigenmaps; Sentiment classification; MANIFOLD; EIGENMAPS; FRAMEWORK;
D O I
10.1016/j.patcog.2013.07.022
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment analysis, which detects the subjectivity or polarity of documents, is one of the fundamental tasks in text data analytics. Recently, the number of documents available online and offline is increasing dramatically, and preprocessed text data have more features. This development makes analysis more complex to be analyzed effectively. This paper proposes a novel semi-supervised Laplacian eigenmap (SS-LE). The SS-LE removes redundant features effectively by decreasing detection errors of sentiments. Moreover, it enables visualization of documents in perceptible low dimensional embedded space to provide a useful tool for text analytics. The proposed method is evaluated using multi-domain review data set in sentiment visualization and classification by comparing other dimensionality reduction methods. SS-LE provides a better similarity measure in the visualization result by separating positive and negative documents properly. Sentiment classification models trained over reduced data by SS-LE show higher accuracy. Overall, experimental results suggest that SS-LE has the potential to be used to visualize documents for the ease of analysis and to train a predictive model in sentiment analysis. SS-LE can also be applied to any other partially annotated text data sets. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:758 / 768
页数:11
相关论文
共 62 条
[1]  
[Anonymous], 2010, P INT C LANG RES EV
[2]  
[Anonymous], 1978, Multidimensional scaling
[3]  
[Anonymous], 1973, Pattern Classification and Scene Analysis
[4]   Predicting consumer sentiments from online text [J].
Bai, Xue .
DECISION SUPPORT SYSTEMS, 2011, 50 (04) :732-742
[5]  
Belkin M, 2002, ADV NEUR IN, V14, P585
[6]  
Belkin M, 2006, J MACH LEARN RES, V7, P2399
[7]  
Blitzer J., 2007, BIOGRAPHIES BOLLYWOO, P187
[8]  
Blitzer John., 2008, NEURAL INFORM PROCES
[9]  
Chen J, 2009, J MACH LEARN RES, V10, P1989
[10]   A neural network based approach for sentiment classification in the blogosphere [J].
Chen, Long-Sheng ;
Liu, Cheng-Hsiang ;
Chiu, Hui-Ju .
JOURNAL OF INFORMETRICS, 2011, 5 (02) :313-322