ABCDM: An Attention-based Bidirectional CNN-RNN Deep Model for sentiment analysis

被引：476

作者：

Basiri, Mohammad Ehsan ^{[1
]}

Nemati, Shahla ^{[1
]}

Abdar, Moloud ^{[2
]}

Cambria, Erik ^{[3
]}

Acharya, U. Rajendra ^{[4
]}

机构：

[1] Shahrekord Univ, Dept Comp Engn, Shahrekord, Iran

[2] Deakin Univ, Inst Intelligent Syst Res & Innovat, Geelong, Vic, Australia

[3] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore

[4] Dept Elect & Comp Engn, Ngee Ann Polytech, Singapore, Singapore

来源：

FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE | 2021年 / 115卷

关键词：

Sentiment analysis; Deep learning; Convolutional neural network; Long short-term memory; Attention mechanism; CLASSIFICATION; TEXT; CLASSIFIERS; EMBEDDINGS; NETWORK; LSTM;

D O I：

10.1016/j.future.2020.08.005

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Sentiment analysis has been a hot research topic in natural language processing and data mining fields in the last decade. Recently, deep neural network (DNN) models are being applied to sentiment analysis tasks to obtain promising results. Among various neural architectures applied for sentiment analysis, long short-term memory (LSTM) models and its variants such as gated recurrent unit (GRU) have attracted increasing attention. Although these models are capable of processing sequences of arbitrary length, using them in the feature extraction layer of a DNN makes the feature space high dimensional. Another drawback of such models is that they consider different features equally important. To address these problems, we propose an Attention-based Bidirectional CNN-RNN Deep Model (ABCDM). By utilizing two independent bidirectional LSTM and GRU layers, ABCDM will extract both past and future contexts by considering temporal information flow in both directions. Also, the attention mechanism is applied on the outputs of bidirectional layers of ABCDM to put more or less emphasis on different words. To reduce the dimensionality of features and extract position-invariant local features, ABCDM utilizes convolution and pooling mechanisms. The effectiveness of ABCDM is evaluated on sentiment polarity detection which is the most common and essential task of sentiment analysis. Experiments were conducted on five review and three Twitter datasets. The results of comparing ABCDM with six recently proposed DNNs for sentiment analysis show that ABCDM achieves state-of-the-art results on both long review and short tweet polarity classification. (c) 2020 Elsevier B.V. All rights reserved.

引用

页码：279 / 294

页数：16

共 79 条

[1] Selecting Attributes for Sentiment Classification Using Feature Relation Networks [J].

Abbasi, Ahmed ;

France, Stephen ;

Zhang, Zhu ;

Chen, Hsinchun .

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (03) :447-462

[2] Energy choices in Alaska: Mining people's perception and attitudes from geotagged tweets [J].

Abdar, Moloud ;

Basiri, Mohammad Ehsan ;

Yin, Junjun ;

Habibnezhad, Mahmoud ;

Chi, Guangqing ;

Nemati, Shahla ;

Asadi, Somayeh .

RENEWABLE & SUSTAINABLE ENERGY REVIEWS, 2020, 124 (124)

[3]

Basiri M.E., 2017, CURR FUTURE DEV ARTI, V30, P154, DOI 10.2174/9781681085029117010009

[4] A novel method for sentiment classification of drug reviews using fusion of deep and machine learning techniques [J].

Basiri, Mohammad Ehsan ;

Abdar, Moloud ;

Cifci, Mehmet Akif ;

Nemati, Shahla ;

Acharya, U. Rajendra .

KNOWLEDGE-BASED SYSTEMS, 2020, 198

[5] HOMPer: A new hybrid system for opinion mining in the Persian language [J].

Basiri, Mohammad Ehsan ;

Kabiri, Arman .

JOURNAL OF INFORMATION SCIENCE, 2020, 46 (01) :101-117

[6] Words Are Important: Improving Sentiment Analysis in the Persian Language by Lexicon Refining [J].

Basiri, Mohammad Ehsan ;

Kabiri, Arman .

ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2018, 17 (04)

[7]

Cambria E., 2020, CIKM

[8] Guest Editorial: Big Social Data Analysis [J].

Cambria, Erik ;

Wang, Haixun ;

White, Bebo .

KNOWLEDGE-BASED SYSTEMS, 2014, 69 :1-2

[9] Understanding Emotions in Text Using Deep Learning and Big Data [J].

Chatterjee, Ankush ;

Gupta, Umang ;

Chinnakotla, Manoj Kumar ;

Srikanth, Radhakrishnan ;

Galley, Michel ;

Agrawal, Puneet .

COMPUTERS IN HUMAN BEHAVIOR, 2019, 93 :309-317

[10] Fuzzy commonsense reasoning for multimodal sentiment analysis [J].

Chaturvedi, Iti ;

Satapathy, Ranjan ;

Cavallari, Sandro ;

Cambria, Erik .

PATTERN RECOGNITION LETTERS, 2019, 125 :264-270

← 1 2 3 4 5 6 7 8 →