The alignment template approach to statistical machine translation

被引:268
作者
Och, FJ
Ney, H
机构
[1] Google Inc, Mountain View, CA 94043 USA
[2] RWTH Aachen Univ Technol, Dept Comp Sci, Lehrstuhl Informat 6, D-52056 Aachen, Germany
关键词
D O I
10.1162/0891201042544884
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A phrase-based statistical machine translation approach-the alignment template approach-is described. This translation approach allows for general many-to-many relations between words. Thereby, the context of words is taken into account in the translation model, and local changes in word order from source to target language can be learned explicitly. The model is described using a log-linear modeling approach, which is a generalization of the often used source-channel approach. Thereby, the model is easier to extend than classical statistical machine translation systems. We describe in detail the process for learning phrasal translations, the feature functions used, and the search algorithm. The evaluation of this approach is performed on three different tasks. For the German-English speech Verbmobil task, we analyze the effect of various system components. On the French-English Canadian Hansards task, the alignment template system obtains significantly better results than a single-word-based translation model. In the Chinese-English 2002 National Institute of Standards and Technology (NIST) machine translation evaluation it yields statistically significantly better NIST scores than all competing research and commercial translation systems.
引用
收藏
页码:417 / 449
页数:33
相关论文
共 43 条
  • [1] Learning dependency translation models as collections of finite-state head transducers
    Alshawi, H
    Bangalore, S
    Douglas, S
    [J]. COMPUTATIONAL LINGUISTICS, 2000, 26 (01) : 45 - 60
  • [2] Bellman R., 1957, DYNAMIC PROGRAMMING
  • [3] BERGER AL, 1994, P ARPA WORKSH HUM LA, P157
  • [4] The psychology of reactions to environmental agents
    Berglund, B
    Job, RFS
    [J]. ENVIRONMENT INTERNATIONAL, 1996, 22 (01) : 1 - 1
  • [5] Brown P. F., 1993, Computational Linguistics, V19, P263
  • [6] Brown P. F., 1990, Computational Linguistics, V16, P79
  • [7] Charniak E., 2003, P MT SUMM 9 NEW ORL, P40
  • [8] GENERALIZED ITERATIVE SCALING FOR LOG-LINEAR MODELS
    DARROCH, JN
    RATCLIFF, D
    [J]. ANNALS OF MATHEMATICAL STATISTICS, 1972, 43 (05): : 1470 - &
  • [9] MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM
    DEMPSTER, AP
    LAIRD, NM
    RUBIN, DB
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01): : 1 - 38
  • [10] GARCIAVAREA I, 2001, P 39 ANN M ASS COMP, P204