Normalizing Emotion-Driven Acronyms towards Decoding Spontaneous Short Text Messages

Bizhanova Aizhan, Atsushi Fujii

2019

Abstract

Reflecting the rapid growth in the use of Social Networking Services (SNSs), it has of late become popular for users to share their feelings, impression, and opinions with each other, about what they saw or experienced, rapidly by means of short text messages (SMS). This trend has let a large number of users consciously or unconsciously use emotion-bearing words and also acronyms to reduce the number of characters to type. We have noticed this new emerging category of language unit, namely “Emotion-Driven Acronyms (EDAs)”. Because by definition, each acronym consists of less characters than its original full form, the acronyms for different full forms often coincidently identical. Consequently, the misuse of EDAs substantially decreases the readability of messages. Our long-term research goal is to normalize text in a corrupt language into the canonical one. In this paper, as the first step towards the exploration of EDAs, we focus only on the normalization for EDAs and propose a method to disambiguate the occurrence of an EDA that corresponds to different full forms depending on the context, such as “smh (so much hate / shaking my head)”. We also demonstrate what kind of features are effective in our task experimentally and discuss the nature of EDAs from different perspectives.

Download


Paper Citation


in Harvard Style

Aizhan B. and Fujii A. (2019). Normalizing Emotion-Driven Acronyms towards Decoding Spontaneous Short Text Messages.In Proceedings of the 11th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART, ISBN 978-989-758-350-6, pages 731-738. DOI: 10.5220/0007407707310738


in Bibtex Style

@conference{icaart19,
author={Bizhanova Aizhan and Atsushi Fujii},
title={Normalizing Emotion-Driven Acronyms towards Decoding Spontaneous Short Text Messages},
booktitle={Proceedings of the 11th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART,},
year={2019},
pages={731-738},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0007407707310738},
isbn={978-989-758-350-6},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 11th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART,
TI - Normalizing Emotion-Driven Acronyms towards Decoding Spontaneous Short Text Messages
SN - 978-989-758-350-6
AU - Aizhan B.
AU - Fujii A.
PY - 2019
SP - 731
EP - 738
DO - 10.5220/0007407707310738