Efficient Term Frequency Inverse Document Frequency Method for Homonym Word Detection Using Concept-Based Similarity Measures

Sunil Kumar, Rajendra Gupta

2023

Abstract

Two or more words having the same spelling or sound but different meanings are called homonyms. The word homonyms and non-homographic homophones are complimentary subsets of homophones, which are words with the same pronunciation but different meanings. Despite their similarities, there has been substantial dispute about whether the two patterns in word recognition are similar. Identifying homonyms is one of the issues that make collecting and evaluating data from the scientific literature which is a tedious task. The terminology used to explain homonymy, heterography, and related phenomena is a bit muddled and often misunderstood, so some cleaning up is required for clarity. The paper presents a Term Frequency/Inverse Document Frequency Method for Homonym Words detection using Concept based Similarity Measures. The results show the homonym identification is achieved around 7-13 percentage better results for different datasets as compared to earlier proposed method.

Download


Paper Citation


in Harvard Style

Kumar S. and Gupta R. (2023). Efficient Term Frequency Inverse Document Frequency Method for Homonym Word Detection Using Concept-Based Similarity Measures. In Proceedings of the 1st International Conference on Artificial Intelligence for Internet of Things: Accelerating Innovation in Industry and Consumer Electronics - Volume 1: AI4IoT; ISBN 978-989-758-661-3, SciTePress, pages 537-540. DOI: 10.5220/0012604300003739


in Bibtex Style

@conference{ai4iot23,
author={Sunil Kumar and Rajendra Gupta},
title={Efficient Term Frequency Inverse Document Frequency Method for Homonym Word Detection Using Concept-Based Similarity Measures},
booktitle={Proceedings of the 1st International Conference on Artificial Intelligence for Internet of Things: Accelerating Innovation in Industry and Consumer Electronics - Volume 1: AI4IoT},
year={2023},
pages={537-540},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012604300003739},
isbn={978-989-758-661-3},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 1st International Conference on Artificial Intelligence for Internet of Things: Accelerating Innovation in Industry and Consumer Electronics - Volume 1: AI4IoT
TI - Efficient Term Frequency Inverse Document Frequency Method for Homonym Word Detection Using Concept-Based Similarity Measures
SN - 978-989-758-661-3
AU - Kumar S.
AU - Gupta R.
PY - 2023
SP - 537
EP - 540
DO - 10.5220/0012604300003739
PB - SciTePress