Common Topic Identification in Online Maltese News Portal Comments
Samuel Zammit, Fiona Sammut, David Suda
2021
Abstract
This paper aims to identify common topics in a dataset of online news portal comments made between April 2008 and January 2017 on the Times of Malta website. By making use of the FastText algorithm, Word2Vec is used to obtain word embeddings for each unique word in the dataset. Furthermore, document vectors are also obtained for each comment, where again similar comments are assigned similar representations. The resulting word and document embeddings are also clustered using k-means clustering to identify common topic clusters. The results obtained indicate that the majority of comments follow a political theme related either to party politics, foreign politics, corruption, issues of an ideological nature, or other issues. Comments related to themes such as sports, arts and culture were not common, except around years with major events. Additionally, a number of topics were identified as being more prevalent during some time periods rather than others. These include the Maltese divorce referendum in 2011, the Maltese citizenship scheme in 2013, Russia’s annexation of Crimea in 2014, Brexit in 2015 and corruption/Panama Papers in 2016.
DownloadPaper Citation
in Harvard Style
Zammit S., Sammut F. and Suda D. (2021). Common Topic Identification in Online Maltese News Portal Comments.In Proceedings of the 10th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM, ISBN 978-989-758-486-2, pages 548-555. DOI: 10.5220/0010250605480555
in Bibtex Style
@conference{icpram21,
author={Samuel Zammit and Fiona Sammut and David Suda},
title={Common Topic Identification in Online Maltese News Portal Comments},
booktitle={Proceedings of the 10th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,},
year={2021},
pages={548-555},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010250605480555},
isbn={978-989-758-486-2},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 10th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,
TI - Common Topic Identification in Online Maltese News Portal Comments
SN - 978-989-758-486-2
AU - Zammit S.
AU - Sammut F.
AU - Suda D.
PY - 2021
SP - 548
EP - 555
DO - 10.5220/0010250605480555