In this paper, starting from the generation of synt-
hetic groups (with various criteria), different ranking
aggregation methods and two aggregation strategies
are used to generate group recommendations. We eva-
luate how good this integrated ranking is, with respect
to the individual ratings contained in the users’ pro-
file (without any ranking process). We performed an
analysis of the generated group recommendations via
ranking varying the size of the groups, the inner group
members similarity, and the rank aggregation mecha-
nism.
The aim of the paper is to evaluate whether or not
the ranking mechanisms may have an impact on the
goodness of GRSs and whether this can be evaluated
in off-line testing. The first results show that this kind
of evaluation is not very simple, and it seems not to
provide significant information. Indeed, a more deep
analysis shows some correlation between the charac-
teristics of the groups and the evaluation of the re-
commendations. This suggests extending the analysis
crossing the data and evaluating the impact of each
ranking technique with respect to the internal charac-
teristics of each group.
2 RELATED WORKS
Typically, GRSs are obtained by merging the single
users’ profiles in order to obtain a preferences pro-
file for the whole group, and then, by using a sin-
gle user recommendation system on this virtual pro-
file to obtain the recommendations for the group.
On the contrary, a second approach relies on firstly
using a single user recommendation system on each
user’s profile and merging these recommendations
using some group decision strategy (Masthoff, 2011).
In both cases, there is the problem to decide how to
combine preferences or recommendations.
Only few approaches considered that the decisions
taken within a group are influenced by many factors,
not only by the individual user preferences. PolyLens
(O’Connor et al., 2001) has been one of the first ap-
proaches to include social characteristics (such as the
nature of a group, the rights of group members, and
social value functions for groups) within the group
recommendation process. Also in (Ardissono et al.,
2003), intra-group roles, such as children and the di-
sabled were contemplated; each group is subdivided
into homogeneous subgroups of similar members that
fit a stereotype, and recommendations are predicted
for each subgroup and an overall preference is built
considering some subgroups more influential than ot-
hers.
The results on group recommendation, presented
in the literature, showed that there is no strategy that
can be defined as the “best”, but different approa-
ches are better suited in different scenarios, depen-
ding from the characteristics of the specific group
(Masthoff, 2011). Besides, traditional aggregation
techniques do not seem to capture the features of real-
world scenarios, as, for example, the possibility of
weighting/ranking the users in the group in order to
compute the recommendation. On the contrary, in
(Gartrell et al., 2010), the authors started to evalu-
ate the group members’ weights, in terms of their
influence in a group relying on the concept of “ex-
pertise” (how many items they rated on a set of 100
popular movies) and “group dissimilarity” (a pair-
wise dissimilarity on ratings), and selecting a diffe-
rent aggregation function starting from a “social va-
lue” (that models the intra-group relationships) deri-
ved from questionnaires. The proposed approach was
tested on real groups and not on a dataset. In (Amer-
Yahia et al., 2009), the authors propose to use the
disagreement among users’ ratings to implement an
efficient group recommendation algorithm. In (Ber-
kovsky and Freyne, 2010), an approach that provi-
des group recommendations with explicit relations-
hips within a family is proposed, investigating four
different models for weighting user data, related to
user’s function within a family or on the observed user
interactions. In (Rossi et al., 2015), the authors aimed
at identifying dominant users within a group by analy-
zing users’ interactions on social networks since their
opinions influence the group decision. The authors
developed a model weighed for group recommendati-
ons that calculates the leadership among users using
their popularity as a measure, and evaluated the sy-
stem with real users.
Finally, concerning the problem of group recom-
mendation evaluation, in the work of (Baltrunas et al.,
2010), the authors analyzes the effectiveness of group
recommendations obtained aggregating the individual
lists of recommendations produced by a collaborative
filtering system. It is observed that the effectiveness
of a group recommendation does not necessarily de-
crease when the group size grows. Moreover, when
individual recommendations are not effective a user
could obtain better suggestions looking at the group
recommendations. Finally, it is shown that the more
alike the users in the group are, the more effective the
group recommendations are.
An Off-line Evaluation of Usersâ
˘
A
´
Z Ranking Metrics in Group Recommendation
253