thankfulness) according to the context of the
message.
5 CONCLUSIONS
In this paper, we have proposed an approach for
disambiguation of emotion-driven acronyms (EDA)
in short text messages by using its emotion
descriptive words evoked from the message.
Our proposed method generates candidates list for
the given EDA by looking up Urban Dictionary and
query Twitter Search API with candidates full from
to automatically collect data for training and testing.
For data collection, we have proposed new approach
to collect the data without any manually annotation.
We proposed a method, which collects the data by
querying Twitter Search API with candidate of the
given EDA plus EDA indicator word, which is
extracted from the EDA definition on UD webpage.
Thus, we do not need to manually annotate Twitter
text messages.
For the identification of emotional state for the
given EDA, we also used description provided by
Urban Dictionary and using seven emotion categories
from the existing studies, we could automatically
identify emotion label for the given EDA.
In disambiguation task, we conducted two
experiments using emotion labelled dataset and
dataset without any emotion label for the performance
evaluation. We could achieve high F-score by
integrating dictionary lookup, automatically
collecting and labelling and probabilistic LM.
In future work, we plan to improve the
performance of our system by considering EDA with
many emotion labels and EDA with identical emotion
labels.
REFERENCES
Barua, J., Patel, D., 2016. Discovery, Enrichment and
Disambiguation of Acronyms. Springer International
Publishing, LNCS, vol. 9829, pp. 345-360.
Boguraev, B.K., Chu-Carroll, J., Ferrucci, D.A., Levas,
A.T., and Prager, J.M., 2015. Context-Based
Disambiguation of Acronyms and Abbreviations.
United States Patent. Patent No.: US 9,020,805 B2.
Bracewell, D.B., Russell, S., and Wu, A.S., 2005.
Identification, Expansion and Disambiguation of
Acronyms in Biomedical Texts. In ISPA Workshops,
LNCS, vol. 3759, pp. 186-195.
Charbonnier, J., Wartena, C., 2018. Using Word
Embeddings for Unsupervised Acronym
Disambiguation. In 27
th
International Conference on
Computational Linguistics, pp. 2610-2619. New
Mexico, USA
Duque, A., Martinez-Romo, J., Araujo, L., 2016. Can
multilinguality improve Biomedical Word Sense
Disambiguation. Journal of Biomedical Informatics,
vol. 64, pp. 320-332.
Sridhar, V.K.R., 2015. Unsupervised Text Normalization
Using Distributed Representations of Words and
Phrases. In NAACL-HLT, pp. 8-16. Denver, Colorado.
Li, C., Ji, L., and Yan, J., 2015. Acronym Disambiguation
Using Word Embedding. In 29
th
of AAAI Conference
on Artificial Intelligence, pp. 4178-4179. Austin, TX.
Mohammad, S.M., 2012. #Emotional Tweets. In 1
st
of Joint
Conference on Lexical and Computational Semantics,
pp. 246-255. Montreal, Canada.
Moon, S., McInnes, B., and Melton, G.B., 2015. Challenges
and Practical Approaches with Word Sense
Disambiguation of Acronyms and Abbreviations in the
Clinical Domain. Healthcare Informatics Research,
21(1):35-42.
Roger, M., 2009. Ordering the Suggestions of a
Spellchecker Without Using Context. Natural
Language Engineering, 15(2):173-192.
Shaver, P., Schwartz, J., Kirson, D., and O'Connor, C.,
2001. Emotional Knowledge: Further Exploration of a
Prototype Approach. In: G. Parrott (Eds.), Emotions in
Social Psychology: Essential Readings, 26-56.
Philadelphia, PA.
Sproat, R., Black, A., Chen, S., Kumar, S., Ostendorf, M.,
and Richards, C., 2001. Normalization of Non-Standard
Words. Computer Speech and Language, 15(3): 287-
333.
Wang, W., Chen, L., Thirunarayan, K., and Sheth, A.P.,
2012. Harnessing Twitter “Big Data” for Automatic
Emotion Identification. In: 4
th
IEEE Conference on
Social Computing, pp. 587-592. Washington, DC.
Xue, Z., Yin, D., and Davison, B.D., 2011. Normalizing
Microtext. In: AAAI-11: Workshop on Analyzing
Microtext, pp. 74-79. San Francisco, CA.