den features is gaining popularity (Nu
˜
nez-Gonzalez
and Gra
˜
na, 2015; Tiroshi et al., 2014; Zeng et al.,
2013). Graph-based representation allows to visu-
alise and deeply understand semantic relations among
items and users. Introduced in the beginning of the
20th century by sociologists, Social Network Analy-
sis (SNA)(Scott, 2012) became widespread in recent
years due to the advent of the social network websites
such as Facebook and Twitter.
For example, the effective identification of impor-
tant nodes and, in general, discovering of hidden net-
work organization structure is a challenging task for
current large-scale social networks (Sun et al., 2014).
Like the most powerful people in the community, im-
portant nodes influence not only the nodes directly
connected to them, but also the whole network. User’s
interests also have heterogeneous structure and they
can be represented by the network of connected nodes
(interests) which have varying degrees of importance.
Typical for the Collaborative filtering methods user-
items relationships can be easily represented as a bi-
partite graph and applying SNA to this network struc-
ture might discover some hidden features. Obtained
additional information is capable to help to better un-
derstand the data that can be useful to improve the
results of recommendation process.
The advantages of usage of SNA motivated us
to examine rating prediction process from the other
side. We have used the network structure to predict
the user’s given rating based only on user’s personal
preferences and his previous feedback. The proposed
method has no limitations concerning the type of rec-
ommended items, which is typical for content-based
systems, and also does not rely on the other users’
feedback, as collaborative filtering algorithm does.
This approach adapts to the user’s behaviour and al-
lows us to calculate recommendations independently
for each user.
Summary of Contributions
The main contributions of the paper are the follow-
ings:
• Novel graph-based rating prediction method that
is based on eigenvector centrality.
• Rating impact measuring process, which helps to
deal with the variation of rating.
• Experiments on three well-known MovieLens
datasets to show the comparison results in dif-
ferent data sparseness and measuring its per-
formance with respect to well-known MAE and
RMSE metrics.
factorfactor
itemitem
itemitem
factorfactor
factorfactor
itemitem
factorfactor
itemitem
factorfactor
itemitem
itemitem
Figure 1: The example of created undirected bipartite item-
factor graph.
2 METHODOLOGY
Our research was based on the hypothesis that the rat-
ing of the node in the network corresponds to the rat-
ing of the most important nodes which are connected
to it.
We have focused on the item-item relationships
inside users interests. This examination allowed us
to look deeper into the preferences of each individ-
ual user, by identifying the underlying factors that in-
fluenced his choice. The item-item relationships are
of little use to find hidden patterns, because they de-
scribe only the existence of a link (similarity) between
objects, but do not explain the reason for this connec-
tion. To compensate it, we needed to add the interme-
diate objects in the relationships between items - the
factors.
The factor is any object semantically defining
the existence of connection between items. In other
words, if there is a common factor between two or
more items, these items are semantically connected.
Genres (categories) of music, films or literature may
be considered as the representative examples of fac-
tors.
As a result, we have created undirected bipartite
graph G = (I, F, E) where I is a set of items, F is a
set of factors and E denoting the edges of the graph
(Figure 1).
Following our aforementioned hypothesis, we
require to determine the importance of each node in
the network, before we can predict the rating. For
this purpose, we have used the SNA eigenvector
centrality measure. Based on the idea that the node
importance in a network is increased by having con-
nections to other nodes that are themselves important,
eigenvector centrality computes the centrality of a
node based on the centrality of its neighbours. Hence,
the value of eigenvector centrality can be large not
Graph-based Rating Prediction using Eigenvector Centrality
229