Stochastic Finite-State Models for Spoken Language Machine Translation
被引:7
作者:
Bangalore, Srinivas
论文数: 0引用数: 0
h-index: 0
机构:
AT and T Labs Research, 180 Park Avenue, Florham Park, NJ 07932, United StatesAT and T Labs Research, 180 Park Avenue, Florham Park, NJ 07932, United States
Bangalore, Srinivas
[1
]
Riccardi, Giuseppe
论文数: 0引用数: 0
h-index: 0
机构:
AT and T Labs Research, 180 Park Avenue, Florham Park, NJ 07932, United StatesAT and T Labs Research, 180 Park Avenue, Florham Park, NJ 07932, United States
Riccardi, Giuseppe
[1
]
机构:
[1] AT and T Labs Research, 180 Park Avenue, Florham Park, NJ 07932, United States
Algorithms - Mathematical models - Problem solving - Speech recognition - Statistical methods - Stochastic control systems - Trees (mathematics);
D O I:
10.1023/B:COAT.0000010804.12581.96
中图分类号:
学科分类号:
摘要:
The problem of machine translation can be viewed as consisting of two subproblems (a) lexical selection and (b) lexical reordering. In this paper, we propose stochastic finite-state models for these two subproblems. Stochastic finite-state models are efficiently learnable from data, effective for decoding and are associated with a calculus for composing models which allows for tight integration of constraints from various levels of language processing. We present a method for learning stochastic finite-state models for lexical selection and lexical reordering that are trained automatically from pairs of source and target utterances. We use this method to develop models for English-Japanese and English-Spanish translation and present the performance of these models for translation on speech and text. We also evaluate the efficacy of such a translation model in the context of a call routing task of unconstrained speech utterances.