how they deal with the issue. Although there are
bullying support websites, many victims might not
seek help or talk about their experiences with friends,
and so bullying-related discussions on general social
media sites might provide anonymous indirect
support or hostility, depending on the attitude of the
influencer and their followers.
To start, YouTube was searched for UK-based
female lifestyle vloggers with at least 20,000
subscribers, which was surprisingly difficult. 34 were
identified and Mozdeh was used to download all 4.6
million comments on all their videos. Of course, few
of the videos or comments were about bullying but
about 8k were and the bullying set was defined as
being the comments matching the following Mozdeh
query of the comments (i.e., containing any of the
words): bullying bully bullied bullies cyberbully
cyberbullied cyberbullies cyberbullying. This was set
A for WATA. Set B was the remaining comments so
there was no ‘Remainder’ set for Figure 1.
For the WAD stage, only bullying-related words
were analysed because non-bullying topics were
irrelevant (i.e., set A but not set B words). This was
achieved in Mozdeh by entering the above bullying
query and clicking the ‘Mine Associations…’ button.
This produced over 1000 bullying-related words but
only the top 100 were analysed as a practical step
because the WAD stage was time consuming and
after 100 words, no new themes were emerging.
The WAC and TA stages created 12 bullying
related themes, five describing bullying and seven
expressing support for victims. The first set included
the location of the bullying (school or online), its
duration, how it happened and its long-term effects.
The second themes included thanking and expressing
support for victims, criticising bullies, praising the
victims, expressing empathy, and offering general
advice. One theme did not seem to have been
mentioned before in the bullying literature:
supporting victims by abstracting the situation to
emphasise that they and their individual traits were
not to blame for being bullied, despite this being the
apparent focus of the bullying. This last point
identifies a particular strength of WATA: its
automatic identification of words at the start can help
the researcher identify issues that they were not
previously aware of.
4.2 ADHD Updates on Twitter
Attention Deficit Hyperactivity Disorder (ADHD) is
a common behavioural disorder that can cause
problems in some aspects of people’s lives. A WATA
study investigated how people with ADHD tweeted
about their condition in the hope of identifying new
insights into the sufferer perspective compared to
previous research using interviews or focus groups
(Thelwall, et al., 2021). The rationale for the study
was that Twitter is sometimes used to provide life
updates and people with ADHD might tweet about
things that they did not consider important enough to
mention in interviews or surveys.
Although personal issues are not a natural topic
for gathering tweets, ADHD is a common enough
condition for it to be reasonably well represented on
Twitter. Gathering tweets by people with ADHD is
tricky, though, since many tweeters without ADHD
can mention it (e.g., researchers, teachers, friends,
family, organisations). This makes it difficult to
generate high precision queries that target tweets by
people with ADHD. The solution to this was to query
the phrase, ‘my ADHD’, since checks at Twitter.com
suggested that this phrase was almost exclusively
used in tweets by people that appeared to be reporting
about their own ADHD. This query had the additional
advantage that tweets containing ‘my ADHD’ would
not only probably be from people with ADHD but
also probably be about something related to their
ADHD. The latter point is important because of
course people with ADHD may also tweet about the
news, sport, computer games and anything else.
For the data collection stage, set A comprised
tweets containing the phrase ‘my ADHD’ and set B
included tweets containing the phrase ‘my X’, where
X was any one of 99 other health conditions. It was
important to compare against other self-declared
health conditions to get insights that were specific to
ADHD. Mozdeh collected 1m tweets for this, 59k of
which were ADHD related.
For the WAD stage, only ADHD-related words
were analysed because non-ADHD topics were
irrelevant (i.e., set A but not set B words). This was
achieved in Mozdeh by entering ‘my ADHD’ and
clicking the ‘Mine Associations…’ button. This
produced over 1000 ADHD-related words but only
the top 200 were analysed because no new themes
were emerging when this number was reached.
The thematic analysis stage identified 19 themes,
although 4 were trivial (e.g., usernames). These
included medication, focus/distraction, fidgeting,
other symptoms, accommodations, diagnosis,
psychiatrists, brain, ‘my ADHD brain’,
neurodivergence, self, blame/causation, and co-
morbidities. For example, the theme ‘my ADHD
brain’ was derived from WAD words including brain,
hellbrain and ass.
Ass for the YouTube example, most themes
reflected issues that were already known about in the