loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Authors: Tomohiro Hattori and Satoshi Tamura

Affiliation: Department of Computing, Gifu University,1-1 Yanagido, Gifu, Japan

Keyword(s): Speech Recognition, Hubert, Minority Language, Adaptation.

Abstract: In the field of speech recognition, models and datasets are becoming larger and larger. However, it is difficult to create large datasets for minority languages, which is an obstacle to improve the accuracy of speech recognition. In this study, we attempt to improve the recognition accuracy for minority languages, by utilizing models trained on large datasets of major language, followed by adapting its language model part to the target language. It is believed that deep-learning speech recognition models learn acoustic and language processing parts. Acoustic one may be common among any languages and has fewer differences than language one. Therefore, we investigate whether it is possible to build a recognizer by keeping acoustic processing learned in the other languages and adapting language processing to the minority language.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.141.46.208

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Hattori, T. and Tamura, S. (2023). Speech Recognition for Minority Languages Using HuBERT and Model Adaptation. In Proceedings of the 12th International Conference on Pattern Recognition Applications and Methods - ICPRAM; ISBN 978-989-758-626-2; ISSN 2184-4313, SciTePress, pages 350-355. DOI: 10.5220/0011682700003411

@conference{icpram23,
author={Tomohiro Hattori and Satoshi Tamura},
title={Speech Recognition for Minority Languages Using HuBERT and Model Adaptation},
booktitle={Proceedings of the 12th International Conference on Pattern Recognition Applications and Methods - ICPRAM},
year={2023},
pages={350-355},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0011682700003411},
isbn={978-989-758-626-2},
issn={2184-4313},
}

TY - CONF

JO - Proceedings of the 12th International Conference on Pattern Recognition Applications and Methods - ICPRAM
TI - Speech Recognition for Minority Languages Using HuBERT and Model Adaptation
SN - 978-989-758-626-2
IS - 2184-4313
AU - Hattori, T.
AU - Tamura, S.
PY - 2023
SP - 350
EP - 355
DO - 10.5220/0011682700003411
PB - SciTePress