Beyond Twitter: Exploring Alternative API Sources for Social Media
Analytics
Alina Campan
a
and Noah Holtke
School of Computing and Analytics, Northern Kentucky University, Nunn Drive, Highland Heights, U.S.A.
Keywords: Social Media Analytics, Federated Social Media Platforms, API-Scraping.
Abstract: Social media is a valuable source of data for applications in a multitude of fields: agriculture, banking,
business intelligence, communication, disaster management, education, government, health, hospitality and
tourism, journalism, management, marketing, etc. There are two main ways to collect social media data: web
scraping (requires more complex custom programs, faces legal and ethical concerns) and API-scraping using
services provided by the social media platform itself (clear protocols, clean data, follows platform established
rules). However, API-based access to social media platforms has significantly changed in the last few years,
with the mainstream platforms placing more restrictions and pricing researchers out. At the same time, new,
federated social media platforms have emerged, many of which have a growing user base and could be
valuable data sources for research. In this paper, we describe an experimental framework to API-scrape data
from the federated Mastodon platform (specifically its flagship node, Mastodon.social), and the results of
volume, sentiment, emotion, and topic analysis on two datasets we collected – as a proof of concept for the
usefulness of sourcing data from the Mastodon platform.
1 INTRODUCTION
Social media and online social networks (OSNs) have
been a primary means to spread and consume
information for a while now, due to the low cost and
high pervasiveness. Despite their negative aspects,
such as the echo chamber effect, and their potential
for the spread of misinformation and disinformation,
the discourse on social media also has positive
dimensions, as is reflective of real-world events and
trends. This allows for the positive use of social
media data in a multitude of application fields:
agriculture, banking, business intelligence,
communication, disaster management, disruptive
technology, education, ethics, government, health,
hospitality and tourism, journalism, management,
marketing, understanding terrorism (Zachlod, 2022).
Different analysis methods are being used, including
sentiment analysis, topic discovery, word frequency
analysis and content analysis (Zachlod, 2022); also,
analysis methods are still being researched and
developed that are capable of effectively handling
massive amounts of social media data (Zachlod,
a
https://orcid.org/0000-0002-9296-3036
2022) with acceptable accuracy. Commercial tools
for social media analysis are also available.
Despite the variety of application fields and
analytic methods, the deployment of social media
analysis frameworks follows similar “steps necessary
to gain useful information or even knowledge out of
social media”; these steps are discovery, tracking (or
collection), preparation, and analysis (Stieglitz,
2018), (Zachlod, 2022). In the tracking step, data is
collected from one (or more) social media
platform(s), using the provided communication
method (API, RSS, HTML scraping.) In a recent
literature review, Zachlod reported that from 94
articles they reviewed, the social media platforms
investigated in these research works were: Twitter (55
studies), Facebook (25 studies), Instagram (13
studies), YouTube (8 studies), TripAdvisor (8
studies), LinkedIn (4 studies), other - Foursquare,
Google +, TikTok, WeChat, Sina Weibo (21 times)
(Zachlod, 2022). Of all social media sites, Twitter
used to be the most popular. Twitter was once the
dominant social media platform among all others.
This was due to its less stringent privacy controls
compared to platforms like Facebook (as Twitter is a
Campan, A. and Holtke, N.
Beyond Twitter: Exploring Alternative API Sources for Social Media Analytics.
DOI: 10.5220/0013035600003838
Paper published under CC license (CC BY-NC-ND 4.0)
In Proceedings of the 16th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2024) - Volume 1: KDIR, pages 441-447
ISBN: 978-989-758-716-0; ISSN: 2184-3228
Proceedings Copyright © 2024 by SCITEPRESS – Science and Technology Publications, Lda.
441