Examining the Intra-Location Differences Among Twitter Samples
Rositsa Ivanova, Ema Kušen, Stefan Sobernig
2023
Abstract
In this paper, we explore Twitter data samples collected from five different geographical locations. For each of these geographical locations, we compare variations occurring within samples collected simultaneously from two different machines running Twitter API clients. In addition, we split the collected data samples into “complete” and “incomplete” datasets. An incomplete dataset is a collection of Twitter messages where at least one machine received a smaller data sample due to some interruption. A complete dataset is one that includes all tweets that Twitter’s API delivers for a particular set of search parameters. Our findings indicate that 86% of the complete samples show some variations in the attribute values attached to extracted tweets. While the complete datasets show comparable attribute values and network characteristics, the incomplete data samples exhibit substantial differences. We arrive at recommendations for researchers on Online Social Networks on how to mine Twitter data while mitigating these risks.
DownloadPaper Citation
in Harvard Style
Ivanova R., Kušen E. and Sobernig S. (2023). Examining the Intra-Location Differences Among Twitter Samples. In Proceedings of the 8th International Conference on Complexity, Future Information Systems and Risk - Volume 1: COMPLEXIS, ISBN 978-989-758-644-6, SciTePress, pages 94-101. DOI: 10.5220/0011990600003485
in Bibtex Style
@conference{complexis23,
author={Rositsa Ivanova and Ema Kušen and Stefan Sobernig},
title={Examining the Intra-Location Differences Among Twitter Samples},
booktitle={Proceedings of the 8th International Conference on Complexity, Future Information Systems and Risk - Volume 1: COMPLEXIS,},
year={2023},
pages={94-101},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0011990600003485},
isbn={978-989-758-644-6},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 8th International Conference on Complexity, Future Information Systems and Risk - Volume 1: COMPLEXIS,
TI - Examining the Intra-Location Differences Among Twitter Samples
SN - 978-989-758-644-6
AU - Ivanova R.
AU - Kušen E.
AU - Sobernig S.
PY - 2023
SP - 94
EP - 101
DO - 10.5220/0011990600003485
PB - SciTePress