loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Jorge Civera and Alfons Juan

Affiliation: ITI/DSIC, Universidad Politécnica de Valencia, Spain

Abstract: Finite mixture modelling is a standard pattern recognition technique. However, in statistical machine translation (SMT), the use of mixture modelling is currently being explored. Two main advantages of the mixture approach are first, its flexibility to find an appropriate tradeoff between model complexity and the amount of training data available and second, its capability to learn specific probability distributions that better fit subsets of the training dataset. This latter advantage is even more important in SMT, since it is widely accepted that most state-of-the-art translation models proposed have limited application to restricted semantic domains. In this work, we revisit the mixture extension of the well-known M21 translation model. The M2 mixture model is evaluated on a word alignment large-scale task obtaining encouraging results that prove the applicability of finite mixture modelling in SMT.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.141.12.30

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Civera, J. and Juan, A. (2008). Word Alignment Quality in the IBM 2 Mixture Model. In Proceedings of the 8th International Workshop on Pattern Recognition in Information Systems (ICEIS 2008) - PRIS; ISBN 978-989-8111-42-5, SciTePress, pages 93-102. DOI: 10.5220/0001739700930102

@conference{pris08,
author={Jorge Civera. and Alfons Juan.},
title={Word Alignment Quality in the IBM 2 Mixture Model},
booktitle={Proceedings of the 8th International Workshop on Pattern Recognition in Information Systems (ICEIS 2008) - PRIS},
year={2008},
pages={93-102},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001739700930102},
isbn={978-989-8111-42-5},
}

TY - CONF

JO - Proceedings of the 8th International Workshop on Pattern Recognition in Information Systems (ICEIS 2008) - PRIS
TI - Word Alignment Quality in the IBM 2 Mixture Model
SN - 978-989-8111-42-5
AU - Civera, J.
AU - Juan, A.
PY - 2008
SP - 93
EP - 102
DO - 10.5220/0001739700930102
PB - SciTePress