Increasing Translation Speed in Phrase-based Models via Suboptimal Segmentation

Germán Sanchis-Trilles, Francisco Casacuberta

Abstract

Phrase-Based Models constitute nowadays the core of the state of the art in the statistical pattern recognition approach to machine translation. Being able to introduce context information into the translation model, they usually produce translations whose quality is often difficult to improve. However, these models have usually an important drawback: the translation speed they are able to deliver is mostly not sufficient for real-time tasks, and translating a single sentence can sometimes take some minutes. In this paper, we describe a novel technique for reducing significantly the size of the translation table, by performing a Viterbi-style selection of the phrases that constitute the final phrase-table. Even in cases where the pruned phrase table contains only 6% of the segments of the original one, translation quality is not worsened. Furthermore, translation quality remains the same in the worst case, achieving an increase of 0.3 BLEU in the best case.

Download


Paper Citation


in Harvard Style

Sanchis-Trilles G. and Casacuberta F. (2008). Increasing Translation Speed in Phrase-based Models via Suboptimal Segmentation . In Proceedings of the 8th International Workshop on Pattern Recognition in Information Systems - Volume 1: PRIS, (ICEIS 2008) ISBN 978-989-8111-42-5, pages 135-143. DOI: 10.5220/0001741701350143


in Bibtex Style

@conference{pris08,
author={Germán Sanchis-Trilles and Francisco Casacuberta},
title={Increasing Translation Speed in Phrase-based Models via Suboptimal Segmentation},
booktitle={Proceedings of the 8th International Workshop on Pattern Recognition in Information Systems - Volume 1: PRIS, (ICEIS 2008)},
year={2008},
pages={135-143},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001741701350143},
isbn={978-989-8111-42-5},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 8th International Workshop on Pattern Recognition in Information Systems - Volume 1: PRIS, (ICEIS 2008)
TI - Increasing Translation Speed in Phrase-based Models via Suboptimal Segmentation
SN - 978-989-8111-42-5
AU - Sanchis-Trilles G.
AU - Casacuberta F.
PY - 2008
SP - 135
EP - 143
DO - 10.5220/0001741701350143