Stochastic Finite-State Models for Spoken Language Machine Translation

被引：7

作者：

Bangalore, Srinivas ^{[1
]}

Riccardi, Giuseppe ^{[1
]}

机构：

[1] AT and T Labs Research, 180 Park Avenue, Florham Park, NJ 07932, United States

来源：

Machine Translation | 2002年 / 17卷 / 03期

关键词：

Algorithms - Mathematical models - Problem solving - Speech recognition - Statistical methods - Stochastic control systems - Trees (mathematics);

D O I：

10.1023/B:COAT.0000010804.12581.96

中图分类号：

学科分类号：

摘要：

The problem of machine translation can be viewed as consisting of two subproblems (a) lexical selection and (b) lexical reordering. In this paper, we propose stochastic finite-state models for these two subproblems. Stochastic finite-state models are efficiently learnable from data, effective for decoding and are associated with a calculus for composing models which allows for tight integration of constraints from various levels of language processing. We present a method for learning stochastic finite-state models for lexical selection and lexical reordering that are trained automatically from pairs of source and target utterances. We use this method to develop models for English-Japanese and English-Spanish translation and present the performance of these models for translation on speech and text. We also evaluate the efficacy of such a translation model in the context of a call routing task of unconstrained speech utterances.

引用

页码：165 / 184