Table 2: Precision and NDCG values for all users.
P@5 P@10 NDCG @5 NDCG @10
Entity-centric Profile (Abbar et al., 2013) 0.512 0.551 0.806 0.786
Global Profile 0.586 0.593 0.855 0.797
is computed as follows:
NDCG(E, k) =
1
|E|
|E|
∑
j=1
Z
k j
k
∑
i=1
2
rel( j,i)
− 1
log
2
(1 +i)
where Z
k j
is a normalization factor calculated to make
NDCG at k equal to 1 in case of perfect ranking, and
rel( j, i) is the relevance score of a news article at rank
i.
In our setting, relevance scores rel( j, i) have two
different values: 1(relevant) if the news article was
commented by the user u, and 0(not relevant) if the
news article was not commented by the user u. The
precision and NDCG results for the three strategies
are shown in Table 1.
We can see in Table 1 that our approach of using
global profile outperforms the baseline approach with
a gain between 4 and 7 of % in term of precision
and 5% in term of ranking at NDCG@5. The reason
is that most of news articles do not address entities
without relating them to some key-concepts. More-
over, when viewpoints are expressed about entities,
they usually refer to certain key-concepts of those en-
tities. Thus, using only entities to build profiles gives
less room for diversification which penalizes the per-
formance. Consequently the combination of both en-
tities and key-concepts give the best results.
6 CONCLUSION AND OUTLOOK
In this paper, we have proposed a two-stage personal-
ized news recommendation approach that takes into
account users interests based on their comments in
news sites. We recommend a set of diverse news arti-
cles using dissimilarity measure based on (1) seman-
tic diversification and/or (3) sentiment diversification.
As future works, we plan to first test our model in a
bigger set of users and explore more diversification
techniques based on users’ comments.
REFERENCES
Abbar, S., Amer-Yahia, S., Indyk, P., and Mahabadi, S.
(2013). Real-time recommendation of diverse related
articles. In Proceedings of the 22Nd International
Conference on World Wide Web, WWW ’13, pages
1–12, Republic and Canton of Geneva, Switzerland.
International World Wide Web Conferences Steering
Committee.
Abel, F., Gao, Q., Houben, G.-J., and Tao, K. (2011). Ana-
lyzing user modeling on twitter for personalized news
recommendations. In Proceedings of the 19th Interna-
tional Conference on User Modeling, Adaption, and
Personalization, UMAP’11, pages 1–12, Berlin, Hei-
delberg. Springer-Verlag.
Agrawal, R., Gollapudi, S., Halverson, A., and Ieong, S.
(2009). Diversifying search results. In WSDM, pages
5–14.
Carterette, B. and Chandar, P. (2009). Probabilistic models
of ranking novel documents for faceted topic retrieval.
In CIKM, pages 1287–1296.
Chen, J., Nairn, R., Nelson, L., Bernstein, M., and Chi, E.
(2010). Short and tweet: Experiments on recommend-
ing content from information streams. In Proceed-
ings of the SIGCHI Conference on Human Factors in
Computing Systems, CHI ’10, pages 1185–1194, New
York, NY, USA. ACM.
Clarke, C. L. A., Kolla, M., Cormack, G. V., Vechto-
mova, O., Ashkan, A., B
¨
uttcher, S., and MacKinnon,
I. (2008). Novelty and diversity in information re-
trieval evaluation. In SIGIR, pages 659–666.
Gollapudi, S. and Sharma, A. (2009). An axiomatic ap-
proach for result diversification. In Proceedings of the
18th International Conference on World Wide Web,
WWW ’09, pages 381–390, New York, NY, USA.
ACM.
Hassin, R., Rubinstein, S., and Tamir, A. (1997). Approx-
imation algorithms for maximum dispersion. Opera-
tions Research Letters, 21:133–137.
Hong, L. and Davison, B. D. (2010). Empirical study
of topic modeling in twitter. In Proceedings of the
First Workshop on Social Media Analytics, SOMA
’10, pages 80–88, New York, NY, USA. ACM.
Kacimi, M. and Gamper, J. (2011). Diversifying search re-
sults of controversial queries. In Proceedings of the
20th ACM International Conference on Information
and Knowledge Management, CIKM ’11, pages 93–
98, New York, NY, USA. ACM.
Kacimi, M. and Gamper, J. (2012). Mouna: Mining opin-
ions to unveil neglected arguments. In Proceedings of
the 21st ACM International Conference on Informa-
tion and Knowledge Management, CIKM ’12, pages
2722–2724, New York, NY, USA. ACM.
Korte, B. and Hausmann, D. (1978). An analysis of the
greedy heuristic for independence systems. Annals of
Discrete Mathematics, 2:65–74.
Li, Q., Wang, J., Chen, Y. P., and Lin, Z. (2010). User
comments for news recommendation in forum-based
social media. Inf. Sci., 180(24):4929–4939.
Meguebli, B.-L. Y., Kacimi, M., Doan, B.-l., and Popineau,
F. (2014a). Building rich user profiles for personal-
KDIR2014-InternationalConferenceonKnowledgeDiscoveryandInformationRetrieval
478