the remaining six collections, we found rather small,
yet noticeable variations in the number of vertices,
number of edges, number of connected components,
and the degree distribution. As expected, all incom-
plete collections showed differences in terms of net-
work characteristics. However, we also found smaller
network-level differences regarding certain network
characteristics for some complete collections.
Based on our study, we derive the following rec-
ommendations for researchers using Twitter’s free-
of-charge API. First, the status codes and error mes-
sages issued by Twitter’s API must be handled and
documented properly in order to avoid incomplete
data samples. Second, we suggest researchers to be
cautious when relying on attribute values which are
expected to change in time and in space, such as
count attributes (e.g. retweet count or like count). In
addition, our previous research on (dis-)similarities
between individual geolocations (see Ivanova et al.
2022) recommended the use of three or more geolo-
cations for accessing the Twitter API in parallel, and
the use of a three-day delay.
REFERENCES
Boutyline, A. and Willer, R. (2017). The social struc-
ture of political echo chambers: Variation in ideologi-
cal homophily in online networks. Political Psychology,
38(3):551–569.
Broniatowski, D. A., Jamison, A. M., Qi, S., AlKulaib,
L., Chen, T., Benton, A., Quinn, S. C., and Dredze, M.
(2018). Weaponized health communication: Twitter bots
and Russian trolls amplify the vaccine debate. American
Journal of Public Health, 108(10):1378–1384.
Campan, A., Atnafu, T., Truta, T. M., and Nolan, J. (2018).
Is data collection through Twitter streaming API useful
for academic research? In Proc. IEEE International
Conference on Big Data, pages 3638–3643. IEEE.
Grinberg, N., Joseph, K., Friedland, L., Swire-Thompson,
B., and Lazer, D. (2019). Fake news on Twitter during the
2016 US presidential election. Science, 363(6425):374–
378.
Gruzd, A. and Roy, J. (2014). Investigating political po-
larization on Twitter: A Canadian perspective. Policy &
Internet, 6(1):28–45.
Ivanova, R. V., Sobernig, S., and Strembeck, M. (2022).
Does geographical location have an impact on data sam-
ples extracted from Twitter? In Proc. 9th International
Conference on Social Networks Analysis, Management
and Security (SNAMS) (in press). IEEE.
Kim, Y., Nordgren, R., and Emery, S. (2020). The story
of Goldilocks and three Twitter’s APIs: A pilot study
on Twitter data sources and disclosure. International
Journal of Environmental Research and Public Health,
17(3):864.
Ku
ˇ
sen, E. and Strembeck, M. (2021). Building blocks of
communication networks in times of crises: Emotion-
exchange motifs. Computers in Human Behavior, 123.
Kwak, H., Lee, C., Park, H., and Moon, S. (2010). What
is Twitter, a social network or a news media? In Proc.
19th International Conference on World Wide Web, pages
591–600. ACM.
Morone, F. and Makse, H. A. (2015). Influence maximiza-
tion in complex networks through optimal percolation.
Nature, 524(7563):65–68.
Morstatter, F., Pfeffer, J., Liu, H., and Carley, K. M. (2013).
Is the sample good enough? comparing data from twit-
ter’s streaming API with twitter’s firehose. In Kiciman,
E., Ellison, N. B., Hogan, B., Resnick, P., and Soboroff,
I., editors, Proc. International AAAI Conference on Web
and Social Media, pages 400–408. The AAAI Press.
Pfeffer, J., Mayer, K., and Morstatter, F. (2018). Tampering
with Twitter’s sample API. EPJ Data Science, 7(1):50.
Pfeffer, J., Mooseder, A., Hammer, L., Stritzel, O., and Gar-
cia, D. (2022). This sample seems to be good enough!
assessing coverage and temporal reliability of Twitter’s
academic API. CoRR, abs/2204.02290.
Timoneda, J. C. (2018). Where in the world is my tweet:
Detecting irregular removal patterns on Twitter. PloS
one, 13(9).
Wang, Y., Callan, J., and Zheng, B. (2015). Should we
use the sample? analyzing datasets sampled from twit-
ter’s stream api. ACM Transactions on the Web (TWEB),
9(3):1–23.
Xiong, Y., Cho, M., and Boatwright, B. (2019). Hashtag
activism and message frames among social movement
organizations: Semantic network analysis and thematic
analysis of Twitter during the #MeToo movement. Pub-
lic Relations Review, 45(1):10–23.
Examining the Intra-Location Differences Among Twitter Samples
101