Discovery of Newsworthy Events in Twitter

Fernando Fradique Duarte, Óscar Mortágua Pereira, Rui L. Aguiar

Abstract

The new communication paradigm established by Social Media, along with its growing popularity in recent years contributed to attract an increasing interest by several research fields. One such research field is the field of event detection in Social Media. The purpose of this work is to implement a system to detect newsworthy events in Twitter. A similar system proposed in the literature is used as the base of this implementation. For this purpose, a segmentation algorithm implemented using a dynamic programming approach is proposed in order to split the tweets into segments. Wikipedia is then leveraged as an additional factor in order to rank these segments. The top k segments in this ranking are then grouped together according to their similarity using a variant of the Jarvis-Patrick clustering algorithm. The resulting candidate events are filtered using an SVM model trained on annotated data, in order to retain only those related to real-world newsworthy events. The implemented system was tested with three months of data, representing a total of 4,770,636 tweets created in Portugal and mostly written in the Portuguese language. The precision obtained by the system was 76.9 % with a recall of 41.6%.

Download


Paper Citation


in Harvard Style

Duarte F., Mortágua Pereira Ó. and Aguiar R. (2018). Discovery of Newsworthy Events in Twitter.In Proceedings of the 3rd International Conference on Internet of Things, Big Data and Security - Volume 1: IoTBDS, ISBN 978-989-758-296-7, pages 244-252. DOI: 10.5220/0006712702440252


in Bibtex Style

@conference{iotbds18,
author={Fernando Fradique Duarte and Óscar Mortágua Pereira and Rui L. Aguiar},
title={Discovery of Newsworthy Events in Twitter},
booktitle={Proceedings of the 3rd International Conference on Internet of Things, Big Data and Security - Volume 1: IoTBDS,},
year={2018},
pages={244-252},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006712702440252},
isbn={978-989-758-296-7},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 3rd International Conference on Internet of Things, Big Data and Security - Volume 1: IoTBDS,
TI - Discovery of Newsworthy Events in Twitter
SN - 978-989-758-296-7
AU - Duarte F.
AU - Mortágua Pereira Ó.
AU - Aguiar R.
PY - 2018
SP - 244
EP - 252
DO - 10.5220/0006712702440252