loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Benyamin Ahmadnia 1 ; Raul Aranovich 1 and Bonnie J. Dorr 2

Affiliations: 1 Department of Linguistics, University of California, Davis, CA, U.S.A. ; 2 Institute for Human and Machine Cognition (IHMC), Ocala, FL, U.S.A.

Keyword(s): Computational Linguistics, Natural Language Processing, Neural Machine Translation, Low-Resource Languages, Joint Learning.

Abstract: This paper describes a systematic study of an approach to Farsi-Spanish low-resource Neural Machine Translation (NMT) that leverages monolingual data for joint learning of forward and backward translation models. As is standard for NMT systems, the training process begins using two pre-trained translation models that are iteratively updated by decreasing translation costs. In each iteration, either translation model is used to translate monolingual texts from one language to another, to generate synthetic datasets for the other translation model. Two new translation models are then learned from bilingual data along with the synthetic texts. The key distinguishing feature between our approach and standard NMT is an iterative learning process that improves the performance of both translation models, simultaneously producing a higher-quality synthetic training dataset upon each iteration. Our empirical results demonstrate that this approach outperforms baselines.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.15.218.44

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Ahmadnia, B. ; Aranovich, R. and Dorr, B. (2021). Strengthening Low-resource Neural Machine Translation through Joint Learning: The Case of Farsi-Spanish. In Proceedings of the 13th International Conference on Agents and Artificial Intelligence - Volume 1: NLPinAI; ISBN 978-989-758-484-8; ISSN 2184-433X, SciTePress, pages 475-481. DOI: 10.5220/0010362604750481

@conference{nlpinai21,
author={Benyamin Ahmadnia and Raul Aranovich and Bonnie J. Dorr},
title={Strengthening Low-resource Neural Machine Translation through Joint Learning: The Case of Farsi-Spanish},
booktitle={Proceedings of the 13th International Conference on Agents and Artificial Intelligence - Volume 1: NLPinAI},
year={2021},
pages={475-481},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010362604750481},
isbn={978-989-758-484-8},
issn={2184-433X},
}

TY - CONF

JO - Proceedings of the 13th International Conference on Agents and Artificial Intelligence - Volume 1: NLPinAI
TI - Strengthening Low-resource Neural Machine Translation through Joint Learning: The Case of Farsi-Spanish
SN - 978-989-758-484-8
IS - 2184-433X
AU - Ahmadnia, B.
AU - Aranovich, R.
AU - Dorr, B.
PY - 2021
SP - 475
EP - 481
DO - 10.5220/0010362604750481
PB - SciTePress