recommendation (Seo et al., 2017) and location
prediction (McGee et al., 2013). In particular, it has
been shown that the community structure in social
networks is deeply correlated with ties strength and
communities extracted under the binary
correspondence of the network are often less
representative of the real community structure (Fan et
al., 2007; Newman, 2004). The study of tie strength
speculates, moreover, that “the stronger the tie
connecting two individuals, the more similar they are”
(Granovetter, 1973). Obviously, in social profiling,
not all relationships are the same to the profiled user.
Some of them being more frequent or intense than
others are, presumably, more revealing of his interests.
Thus, close friends in generic social media or frequent
collaborations in co-authorships networks should not
be treated the same as acquaintances or occasional
collaborations.
Accordingly, we propose a strength-sensitive
community-based profiling approach (named
SCoBSP) , built upon an existing community-based
process (Tchuente et al., 2013) which assumed the
network to be binary, i.e. all friends are equally related
to ego user (who is to be profiled) as well as to each
other. In the light of above findings, such assumption
has two key problems. On the one hand, interests are
inferred from less relevant people (those having weak
ties) and, in the other hand, the community structure
on which the profiling process is completely based is
not correctly depicted. To handle this, our approach
leverages strength of both ego-friend and friend-friend
relationships. The former allows to identify most
relevant people from whom to infer worthwhile
interests, while the latter enables to depict the most
realistic community structure of the ego network.
The remainder of the paper is organized as
follows: The next section presents works most related
to ours. Section 3 describes our approach to social
profiling on weighted ego networks. Section 4
presents evaluation results on real world co-authorship
networks and Section 5 concludes the paper with some
future directions.
2 RELATED WORK
The scientific literature outlines many studies that
exploit relationship information and social graph
characteristics in user profiling ((Piao and Breslin,
2018), Bilal et al., 2019). We review in this section
those closely related to ours, i.e. research based on
user’s ego network. Most of work within this line
were conducted on Twitter and considered only user-
friend’s connections (Piao and Breslin, 2018). For
instance, (Bhattacharya et al., 2014) mine user’s
interests from the topical expertise of the users whom
he follows in twitter. Other studies consider
connections among friends too. (Li et al., 2014)
proposed a new co-profiling approach to jointly infer
users’ attributes and relationship type (being the
reason behind link formation) in ego networks. They
assume connections are discriminatively correlated
with user attributes (e.g., employer) through
relationship type (e.g., colleague). Similarly, (Ma et
al., 2017) attempts to learn profile via a social-aware
semi-supervised topic model that relies on latent
reasons behind social connections and refined the
profiling results by a novel label propagation strategy.
Exploring another aspect of social graphs, (Tchuente
et al., 2013) described a community-based process to
infer user’s attributes via user-groups affinities and
achieved very satisfactory performance compared to
individual based models. This process is later
extended in several ways, (On-At et al., 2014)
addressed the sparse network problem by adding
distance-2 neighbors (friends of a friend) using
snowball sampling technique, while (On-At et al.,
2017a,b) integrated temporal criteria and considered
evolution of both relationships and shared
information in the network.
As for studies exploring tie strength, (McGee et
al., 2013) developed a network-based model to infer
user's locations by leveraging the strength between
users on twitter. To the best of the author’s
knowledge, this is the only study that directly
investigates tie strength in attribute profiling.
However, their model is designed to predict a single
attribute (i.e. specific to location prediction).
Conversely, the community-based process proposed
in (Tchuente et al., 2013) is intended to be generic but
assumed the network to be binary. This motivates us
to investigate tie strength contribution over such
model to infer more relevant social profile.
3 PROPOSITION
In this section, we first introduce the ego network and
user profile models and then present our strength-
sensitive profiling process that leverages relationship
strength and community structure.
3.1 Notation
For a given user u (who is to be profiled), let G =
(V,E’,E,U) be the undirected ego network graph with
positive edge strengths, where V is the set of u’s direct
relations (alters) , E’ the set of ego-alter connections