loading
Documents

Research.Publish.Connect.

Paper

Authors: Steinunn Friðriksdóttir and Anton Ingason

Affiliation: Faculty of Icelandic and Comparative Cultural Studies, University of Iceland, Sæmundargata 2, 102 Reykjavík, Iceland

ISBN: 978-989-758-395-7

ISSN: 2184-433X

Keyword(s): Confusion Sets, Homophones, Context Dependency, Rich Morphology, Disambiguation, Icelandic.

Abstract: The processing of strings which are semantically distinct but can be easily confused with each other, often on account of being pronounced identically, is a prime example of context dependency in Natural Language Processing. This problem arises when a system needs to distinguish whether a bank is a ‘river bank’ or a ‘financial institution’ and it also challenges systems for context-sensitive spelling and grammar correction because pairs like their/there and I/me are one common source of issues that such systems must address. In practice, this type of context-dependency can be especially prominent in languages with rich morphology where large paradigms of inflected word forms lead to a proliferation of such confusion sets. In this paper, we present our novel confusion set corpus for Icelandic as well as our findings from an experiment that uses well-known classification algorithms to disambiguate confusion sets that appear in our corpus.

PDF ImageFull Text

Download
CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.236.8.46

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Friðriksdóttir, S. and Ingason, A. (2020). Disambiguating Confusion Sets in a Language with Rich Morphology.In Proceedings of the 12th International Conference on Agents and Artificial Intelligence - Volume 1: NLPinAI, ISBN 978-989-758-395-7, ISSN 2184-433X, pages 446-451. DOI: 10.5220/0009371504460451

@conference{nlpinai20,
author={Steinunn Rut Friðriksdóttir. and Anton Karl Ingason.},
title={Disambiguating Confusion Sets in a Language with Rich Morphology},
booktitle={Proceedings of the 12th International Conference on Agents and Artificial Intelligence - Volume 1: NLPinAI,},
year={2020},
pages={446-451},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0009371504460451},
isbn={978-989-758-395-7},
}

TY - CONF

JO - Proceedings of the 12th International Conference on Agents and Artificial Intelligence - Volume 1: NLPinAI,
TI - Disambiguating Confusion Sets in a Language with Rich Morphology
SN - 978-989-758-395-7
AU - Friðriksdóttir, S.
AU - Ingason, A.
PY - 2020
SP - 446
EP - 451
DO - 10.5220/0009371504460451

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.