Hierarchical phrase-based translation

被引:286
作者
Chiang, David [1 ]
机构
[1] Univ So Calif, Inst Informat Sci, Marina Del Rey, CA 90292 USA
关键词
D O I
10.1162/coli.2007.33.2.201
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a statistical machine translation model that uses hierarchical phrases-phrases that contain subphrases. The model is formally a synchronous context-free grammar but is learned from a parallel text without any syntactic annotations. Thus it can be seen as combining fundamental ideas from both syntax-based translation and phrase-based translation. We describe our system's training and decoding methods in detail, and evaluate it for translation speed and translation accuracy. Using BLEU as a metric of translation accuracy, we find that our system performs significantly better than the Alignment Template System, a state-of-the-art phrase-based system.
引用
收藏
页码:201 / 228
页数:28
相关论文
共 41 条
[1]   Learning dependency translation models as collections of finite-state head transducers [J].
Alshawi, H ;
Bangalore, S ;
Douglas, S .
COMPUTATIONAL LINGUISTICS, 2000, 26 (01) :45-60
[2]  
[Anonymous], TR1098 HARV U CTR RE
[3]  
[Anonymous], 2005, Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
[4]  
Bar-Hillel Yehoshua, 1964, LANGUAGE INFORMATION, P116
[5]  
Block HU, 2000, ART INTEL, P411
[6]  
BOD R, 1992, P COLING 92 NANT, P855
[7]  
Brown P. F., 1993, Computational Linguistics, V19, P263
[8]  
Chiang D, 2005, P HUM LANG TECHN C C, P779
[9]  
CHIANG D, 2005, P 43 ANN M ASS COMP, P263, DOI DOI 10.3115/1219840.1219873
[10]  
COLLINS M, 2005, P 43 ANN M ASS COMP, P531, DOI DOI 10.3115/1219840.1219906