Table 4: Performance results on the Facebook dataset. Best results for each row are marked in bold.
Measures Standard Greedy-WeBA Overlapping Greedy-WeBA Overlapping Greedy-WeBA with Node-Attributes
Recall (average) 27,85 ± 2,01 40,02 ± 1,92 55,60 ± 2,47
Precision (average) 32,10 ± 3,30 37,16 ± 3,05 48,75 ± 2,99
F1 Score (average) 29,80 ± 1,40 36,45 ± 1,73 51,99 ± 1,54
Jaccard Index (average) 17,48 ± 0,78 22,64 ± 1,66 35,19 ± 1,44
ping communities is fundamental for detecting the
correct groups of users in social networks, where of-
ten users can belong to several social circles (due to
various interests, hobbies or relationships). Moreover,
we showed that the inclusion of node attributes can
provide important additional information, leading to
results which better fit the real communities.
There are several possible directions for future
work. For instance, we would like to improve the cur-
rent algorithm by including a method for automatic
inferring the best kernel-size. Moreover, we would
like to study how the community kernels change di-
namically over time, and how this affects auxiliary
communities.
REFERENCES
Ahn, Y.-Y., Bagrow, J. P., and Lehmann, S. (2010). Link
communities reveal multiscale complexity in net-
works. Nature, 466(7307):761–764.
Chang, J. and Blei, D. M. (2009). Relational topic models
for document networks. In International Conference
on Artificial Intelligence and Statistics, pages 81–88.
Donetti, L. and Munoz, M. A. (2004). Detecting network
communities: a new systematic and efficient algo-
rithm. Journal of Statistical Mechanics: Theory and
Experiment, 2004(10):P10012.
Du, N., Wu, B., Pei, X., Wang, B., and Xu, L. (2007). Com-
munity detection in large-scale social networks. In
Proceedings of the 9th WebKDD and 1st SNA-KDD
2007 workshop on Web mining and social network
analysis, pages 16–25. ACM.
G
¨
unnemann, S., Boden, B., F
¨
arber, I., and Seidl, T. (2013).
Efficient mining of combined subspace and subgraph
clusters in graphs with feature vectors. In Advances in
Knowledge Discovery and Data Mining, pages 261–
275. Springer.
Gunnemann, S., Farber, I., Boden, B., and Seidl, T. (2010).
Subspace clustering meets dense subgraph mining: A
synthesis of two paradigms. In Data Mining (ICDM),
2010 IEEE 10th International Conference on, pages
845–850. IEEE.
Leskovec, J., Lang, K. J., Dasgupta, A., and Mahoney,
M. W. (2008). Statistical properties of community
structure in large social and information networks. In
Proceedings of the 17th international conference on
World Wide Web, pages 695–704. ACM.
Leskovec, J. and Mcauley, J. J. (2012). Learning to discover
social circles in ego networks. In Advances in neural
information processing systems, pages 539–547.
Liu, Y., Niculescu-Mizil, A., and Gryc, W. (2009). Topic-
link lda: joint models of topic and author community.
In proceedings of the 26th annual international con-
ference on machine learning, pages 665–672. ACM.
Mishra, N., Schreiber, R., Stanton, I., and Tarjan, R. E.
(2008). Finding strongly knit clusters in social net-
works. Internet Mathematics, 5(1-2):155–174.
Newman, M. E. (2004a). Detecting community struc-
ture in networks. The European Physical Journal B-
Condensed Matter and Complex Systems, 38(2):321–
330.
Newman, M. E. (2004b). Fast algorithm for detecting
community structure in networks. Physical review E,
69(6):066133.
Newman, M. E. (2006a). Finding community structure in
networks using the eigenvectors of matrices. Physical
review E, 74(3):036104.
Newman, M. E. (2006b). Modularity and community
structure in networks. Proceedings of the National
Academy of Sciences, 103(23):8577–8582.
Papadimitriou, S., Sun, J., Faloutsos, C., and Philip, S. Y.
(2008). Hierarchical, parameter-free community dis-
covery. In Machine Learning and Knowledge Discov-
ery in Databases, pages 170–187. Springer.
Papadopoulos, S., Kompatsiaris, Y., Vakali, A., and Spyri-
donos, P. (2012). Community detection in social
media. Data Mining and Knowledge Discovery,
24(3):515–554.
Rosvall, M. and Bergstrom, C. T. (2007). An information-
theoretic framework for resolving community struc-
ture in complex networks. Proceedings of the Na-
tional Academy of Sciences, 104(18):7327–7331.
Wang, L., Lou, T., Tang, J., and Hopcroft, J. E. (2011). De-
tecting community kernels in large social networks. In
Data Mining (ICDM), 2011 IEEE 11th International
Conference on, pages 784–793. IEEE.
Xie, J. and Szymanski, B. (2012). Towards linear time over-
lapping community detection in social networks. In
Tan, P.-N., Chawla, S., Ho, C., and Bailey, J., editors,
Advances in Knowledge Discovery and Data Mining,
volume 7302 of Lecture Notes in Computer Science,
pages 25–36. Springer Berlin Heidelberg.
Yang, J. and Leskovec, J. (2013). Overlapping community
detection at scale: A nonnegative matrix factorization
approach. In Proceedings of the Sixth ACM Interna-
tional Conference on Web Search and Data Mining,
WSDM ’13, pages 587–596. ACM.
Yang, J., McAuley, J., and Leskovec, J. (2013). Community
detection in networks with node attributes. In Data
Mining (ICDM), 2013 IEEE 13th International Con-
ference on, pages 1151–1156. IEEE.
KDIR 2015 - 7th International Conference on Knowledge Discovery and Information Retrieval
524