Word reordering and a dynamic programming beam search algorithm for statistical machine translation

被引:90
作者
Tillmann, C [1 ]
Ney, H
机构
[1] IBM Corp, Thomas J Watson Res Ctr, Yorktown Hts, NY 10598 USA
[2] Rhein Westfal TH Aachen, Dept Comp Sci, Lehrstuhl Informat 6, D-52056 Aachen, Germany
关键词
D O I
10.1162/089120103321337458
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article, we describe an efficient beam search algorithm for statistical machine translation based on dynamic programming (DP). The search algorithm uses the translation model presented in Brown et al. (1993). Starting from a DP-based solution to the traveling-salesman problem, we present a novel technique to restrict the possible word reorderings between source and target language in order to achieve an efficient search algorithm. Word reordering restrictions especially useful for the translation direction German to English are presented. The restrictions are generalized, and a set of four parameters to control the word reordering is introduced, which then can easily be adopted to new translation directions. The beam search procedure has been successfully tested on the Verbmobil task (German to English, 8,000-word vocabulary) and on the Canadian Hansards task (French to English, 100,000-word vocabulary). For the medium-sized Verbmobil task, a sentence can be translated in a few seconds, only a small number of search errors occur, and there is no performance degradation as measured by the word error criterion used in this article.
引用
收藏
页码:97 / 133
页数:37
相关论文
共 27 条
[1]  
ALONAIZAN Y, 1999, STAT MACH TRANSLATIO
[2]  
[Anonymous], 1997, EUROPEAN C SPEECH CO
[3]  
BERGER A, 1996, Patent No. 5510981
[4]  
BERGER AL, 1994, P ARPA HUM LANG TECH, P152
[5]  
Brown P. F., 1992, Computational Linguistics, V18, P467
[6]  
Brown P. F., 1993, Computational Linguistics, V19, P263
[7]  
Brown P. F., 1990, Computational Linguistics, V16, P79
[8]  
GARCIAVAREA I, 1998, P 5 INT C SPOK LANG, P1135
[9]  
Germann U, 2001, 39TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, P228
[10]   A DYNAMIC PROGRAMMING APPROACH TO SEQUENCING PROBLEMS [J].
HELD, M ;
KARP, RM .
JOURNAL OF THE SOCIETY FOR INDUSTRIAL AND APPLIED MATHEMATICS, 1962, 10 (01) :196-210