Text Classification Research with Attention-based Recurrent Neural Networks

被引:96
作者
Du, C. [1 ]
Huang, L. [1 ]
机构
[1] Beijing Jiaotong Univ, Sch Econ & Management, Beijing 100044, Peoples R China
关键词
machine learning; text classification; attention mechanism; bidirectional RNN; word vector;
D O I
10.15837/ijccc.2018.1.3142
中图分类号
TP [自动化技术、计算机技术];
学科分类号
080201 [机械制造及其自动化];
摘要
Text classification is one of the principal tasks of machine learning. It aims to design proper algorithms to enable computers to extract features and classify texts automatically. In the past, this has been mainly based on the classification of keywords and neural network semantic synthesis classification. The former emphasizes the role of keywords, while the latter focuses on the combination of words between roles. The method proposed in this paper considers the advantages of both methods. It uses an attention mechanism to learn weighting for each word. Under the setting, key words will have a higher weight, and common words will have lower weight. Therefore, the representation of texts not only considers all words, but also pays more attention to key words. Then we feed the feature vector to a softmax classifier. At last, we conduct experiments on two news classification datasets published by NLPCC2014 and Reuters, respectively. The proposed model achieves F-values by 88.5% and 51.8% on the two datasets. The experimental results show that our method outperforms all the traditional baseline systems.
引用
收藏
页码:50 / 61
页数:12
相关论文
共 18 条
[1]
[Anonymous], 2013, P 17 C COMPUTATIONAL, DOI DOI 10.1007/BF02579642
[2]
[Anonymous], 2012, Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
[3]
[Anonymous], 2013, UAI 2013
[4]
Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
[5]
Chung JY, 2015, PR MACH LEARN RES, V37, P2067
[6]
Framewise phoneme classification with bidirectional LSTM and other neural network architectures [J].
Graves, A ;
Schmidhuber, J .
NEURAL NETWORKS, 2005, 18 (5-6) :602-610
[7]
Hua L., 2007, J CHINESE INFORM PRO, V21, P34
[8]
Hyperspectral Image Classification Using Deep Pixel-Pair Features [J].
Li, Wei ;
Wu, Guodong ;
Zhang, Fan ;
Du, Qian .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2017, 55 (02) :844-853
[9]
Mikolov T., 2013, P 2013 C N AM CHAPT
[10]
Mikolov T., 2013, Adv Neural Inf Process Syst, P26, DOI DOI 10.48550/ARXIV.1310.4546