Deep Pyramid Convolutional Neural Networks for Text Categorization

被引:500
作者
Johnson, Rie [1 ]
Zhang, Tong [2 ]
机构
[1] RJ Res Consulting, Tarrytown, NY 10591 USA
[2] Tencent AI Lab, Shenzhen, Peoples R China
来源
PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1 | 2017年
关键词
D O I
10.18653/v1/P17-1052
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper proposes a low-complexity word-level deep convolutional neural network (CNN) architecture for text categorization that can efficiently represent long-range associations in text. In the literature, several deep and complex neural networks have been proposed for this task, assuming availability of relatively large amounts of training data. However, the associated computational complexity increases as the networks go deeper, which poses serious challenges in practical applications. Moreover, it was shown recently that shallow word-level CNNs are more accurate and much faster than the state-of-the-art very deep nets such as character-level CNNs even in the setting of large training data. Motivated by these findings, we carefully studied deepening of word-level CNNs to capture global representations of text, and found a simple network architecture with which the best accuracy can be obtained by increasing the network depth without increasing computational cost by much. We call it deep pyramid CNN. The proposed model with 15 weight layers outperforms the previous best models on six benchmark datasets for sentiment classification and topic categorization.
引用
收藏
页码:562 / 570
页数:9
相关论文
共 15 条
  • [11] Kim Y, 2014, IEEE ASME INT C ADV, P1747, DOI 10.1109/AIM.2014.6878336
  • [12] Mikolov T, 2013, Advances in Neural Information Processing Systems, P3111, DOI DOI 10.48550/ARXIV.1310.4546
  • [13] Simonyan K, 2015, Arxiv, DOI arXiv:1409.1556
  • [14] Yang Zichao, 2016, P N AM ASS COMP LING
  • [15] Zhang X, 2015, ADV NEURAL INF PROCE, DOI DOI 10.5555/2969239.2969312