retweet occurs. The user-tweet-graph is used to iden-
tify authoritative users. The same model is used in
(Arxiden, 2013) to measure users influence in terms
of users activities.
Surprisingly, the research community seems to
have not yet taken into consideration many other kind
of graphs that can be easily built and analyzed just by
elaborating a flow of tweets. Some of these graphs
can be extremely interesting both for research and ap-
plicative purpose.
In this paper we introduce a new type of graph: the
mention graph. This graph, presented in Section 2,
can be used to improve the identification of authorita-
tive accounts, to discover active and dynamic commu-
nities, or to assign weights to the follow relationships.
In order to verify the overall structure of our
graph, we built an instance of the mention graph
and we performed a series of quantitative analysis
that are in general used in network analysis (Myers
et al., 2014): the degree distributions, connected com-
ponents, path length distributions; clustering coeffi-
cients.
For further information we also built and analyzed
an instance of the retweet graph in order to compare
our results with those presented in (Bild et al., 2015).
The analysis performed shows that the mention
graph is sound, that is the values obtained are in line
with the expected values and are similar to the same
values for the follow graph as shown in (Myers et al.,
2014).
The paper is organized as follows: in Section 2 we
briefly describe our graph model; in Section 3 we re-
port about our experimentation. Finally, in Section 4
we conclude the work.
2 GRAPH MODELS
We model interactions between Twitter users by
defining the mention graph.
In the mention graph each node represents a Twit-
ter account. A directed edge between two nodes a and
b exists if the account a mentions the account b in at
least one tweet. To record multiple citations, we la-
bel the edge with a list of timestamps corresponding
to each mention, allowing the filtering of edges on the
basis of a temporal parameters. As a design choice,
mentions contained into retweets and replies are ig-
nored.
It is worth to note, with respect to the follow
graph, the mention graph:
• captures the information spreading on the net-
work. The follow relationship tends to better rep-
resent the social ties between users, since users
most likely follow another user on a social ba-
sis. On the counter part, the mention relationship
better represents the information spreading on the
network.
• furthers a deep qualitative analysis of the follow
graph itself. For instance, comparing the follow
and the mention graphs we quantitativelyevaluate
the actual strength of the follow relationship.
• is easier to be built and updated. Since all infor-
mation needed is derivable just parsing tweets, the
graph building process is not affected by the rate
limitations of the Twitter Search API.
In order to compare our mention graph with the
retweet graph presented in (Bild et al., 2015), we also
built an instance of this last kind of graph as follows:
in the retweets graph two nodes, a and b, are con-
nected by a direct edge if a has retweeted at least one
tweet of b. Similarly to the mention graph, times-
tamps of the different retweets between the same user
accounts are stored in a list labeling the corresponding
directed edge, while retweets of retweets are ignored
by design.
3 EXPERIMENTATION
3.1 Building Graph Instances
We perform network analysis of the new mention and
retweet graphs using a Twitter collection that was
built by monitoring the activities of the Italian Pub-
lic Administrations on Twitter. More precisely, the
collection was obtained selecting a list of about 400
Italian seed keywords (e.g. with all Italian minis-
ters, agencies, etc.) and 5000 Twitter authoritative ac-
counts. These accounts were selected by starting from
a few dozens of seed accounts, such as the official ac-
counts of the Prime Minister (@Palazzo
Chigi) and
the Ministers (@Viminale, @MinisteroDifesa etc.),
and completing the list with the inclusion of their fol-
lowing accounts with the highest number of follow-
ers. We have also restricted the tweets to the Italian
language.
In the period from the 7th May to the 23rd May
2015 we collected about 5,604,779 tweets, contain-
ing 469,359 users, 991,589 mentions and 1,041,955
retweets.
3.2 Analysis
We present a preliminary analysis of some topological
features of mention and retweet graphs. In particu-
lar we use the degree distribution, connected compo-