A survey on the application of recurrent neural networks to statistical language modeling

被引:166
作者
De Mulder, Wim [1 ]
Bethard, Steven [2 ]
Moens, Marie-Francine [1 ]
机构
[1] Katholieke Univ Leuven, B-3001 Heverlee, Belgium
[2] Univ Alabama Birmingham, Birmingham, AL 35294 USA
关键词
Recurrent neural networks; Natural language processing; Language modeling; Speech recognition; Machine translation; N-GRAM;
D O I
10.1016/j.csl.2014.09.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a survey on the application of recurrent neural networks to the task of statistical language modeling. Although it has been shown that these models obtain good performance on this task, often superior to other state-of-the-art techniques, they suffer from some important drawbacks, including a very long training time and limitations on the number of context words that can be taken into account in practice. Recent extensions to recurrent neural network models have been developed in an attempt to address these drawbacks. This paper gives an overview of the most important extensions. Each technique is described and its performance on statistical language modeling, as described in the existing literature, is discussed. Our structured overview makes it possible to detect the most promising techniques in the field of recurrent neural networks, applied to language modeling, but it also highlights the techniques for which further research is required. (C) 2014 The Authors. Published by Elsevier Ltd.
引用
收藏
页码:61 / 98
页数:38
相关论文
共 131 条
  • [1] Ahmed Farag, 2009, Polibits, V0, P39
  • [2] Almeida L. B., 1998, On-Line Learning in Neural Networks, P111
  • [3] [Anonymous], 1994, THESIS CARNEGIE MELL
  • [4] [Anonymous], 2013, P IEEE INT C ACOUSTI, DOI 10.1109/ICASSP.2013.6638947
  • [5] [Anonymous], 2013, P 2013 C N AM CHAPT
  • [6] [Anonymous], 2012, P 9 INT WORKSHOP SPO
  • [7] [Anonymous], 2012, PROC 4 NAMED ENTITY
  • [8] [Anonymous], 1998, WordNet
  • [9] [Anonymous], THESIS U MARYLAND
  • [10] [Anonymous], P ACL IJCNLP 2009 C