Then the predictions of the classifiers were calcu-
lated for the test set and stored in the data base. In
the next step the users rated the status updates in their
test set, again on a binary scale. For each classifier we
determined the precision, recall and f-measure.
In the next section we compare the results of our
method using a feature space without the topic model
against the results using a feature space that includes
the topic model. It would have been worthwhile to
compare these results with Facebooks EdgeRank TM
algorithm, however to the best of our knowledge a
user’s news feed filtered according to the EdgeRank
TM algorithm is not available via the Graph API.
4.3 Results
The SVM trained with the feature space including the
topic model made better results in terms of our se-
lected measures. Here our approach achieved an av-
erage precision of 0.51 at an average recall of 0.47
and an average f-measure of 0.48. It should be noted
that for two participants the precision and recall val-
ues were 0. When talking to those users after the ex-
periment, it turned out that they did not have many rel-
evant status updates in their training set thus making
the learning of a model with a high predictive accu-
racy difficult. We assume that with more training data
the filtering for these participants could be improved.
The results for the SVM trained with the feature
space without topic model were slightly worse. In
that setting we achieved an average precision of 0.46
at an average recall of 0.38 and an average f-measure
of 0.39. Using the reduced feature space three partic-
ipants had precision and recall values of 0.
For both settings six of the ten participants could
achieve an f-measure of 0.5 or better. The results for
each participant and the two feature spaces applied
are depicted in Table 1.
5 CONCLUSIONS
In this paper we presented an approach to identify
the relevant status updates in a user’s Facebook news
feed. The method combines features based on the
interactions with status updates together with met-
rics from the field of Social Network Analysis, and
an LDA topic model as input for a Support Vector
Machine. A first evaluation conducted as laboratory
study suggests that the approach can lead to satisfying
results for a large number of users. However a larger
field study is needed to further prove the usefulness of
the filtering algorithm.
Table 1: Results of the evaluation experiment (PRC: Preci-
sion, RCL: Recall, F: F-Measure). Each row represents the
results of one participant. The last row shows the column
averages.
W/o Topic Model Topic Model
PRC RCL F PRC RCL F
0.667 0.4 0.5 0.545 0.6 0.571
1 0.571 0.727 0.75 0.857 0.8
0 0 0 0.833 0.5 0.625
0 0 0 0 0 0
0.714 0.714 0.714 0.722 0.929 0.812
0.529 0.818 0.643 0.5 0.545 0.522
0 0 0 0 0 0
0.455 0.556 0.500 0.429 0.333 0.375
0.5 0.083 0.143 0.5 0.25 0.333
0.733 0.688 0.71 0.786 0.688 0.733
0.46 0.383 0.394 0.507 0.47 0.477
ACKNOWLEDGEMENTS
This research has been funded by the Investitionsbank
Berlin in the project “Voice2Social”, and co-financed
by the European Regional Development Fund.
REFERENCES
Blei, D. M., Ng, A. Y., and Jordan, M. I. (2003). Latent
dirichlet allocation. Journal of Machine Learning Re-
search, 3:993–1022.
Boccara, N. (2008). Models of opinion formation: influence
of opinion leaders. Int. J. Mod. Phys. C, 19(1):93–109.
Cortes, C. and Vapnik, V. (1995). Support-vector networks.
Mach. Learn., 20(3):273–297.
Kleinberg, J. M. (1998). Authoritative Sources in a Hyper-
linked Environment. In Proceedings of the 9th An-
nual ACM-SIAM Symposium on Discrete Algorithms,
pages 668–677. AAAI Press.
Page, L., Brin, S., Motwani, R., and Winograd, T. (1998).
The pagerank citation ranking: Bringing order to the
web. Technical report, Stanford University.
Roch, C. H. (2005). The dual roots of opinion leadership.
Journal of Politics, 67(1):110–131.
Weng, J., Lim, E.-P., Jiang, J., and He, Q. (2010). Twit-
terrank: finding topic-sensitive influential twitterers.
In Proceedings of the third ACM international con-
ference on Web search and data mining, WSDM ’10,
pages 261–270, New York, NY, USA. ACM.
ICAART2013-InternationalConferenceonAgentsandArtificialIntelligence
356