8 CONCLUSIONS
This paper proposes a scalable substructure discov-
ery algorithm for HoMLNs using the decoupling-
based strategy. A generic Map/Reduce algorithm
was introduced for the parallel processing of lay-
ers and then composing the results, again, using a
Map/Reduce framework. The basic components of
substructure discovery - substructure expansion, com-
bining substructures from each layer to generate sub-
structures spanning layers (composition function),
duplicate elimination, and counting isomorphic sub-
structures - were incorporated into the Map/Reduce
framework. Empirical correctness was established.
Extensive experimental analysis was performed on di-
verse synthetic and real-world graph datasets.
ACKNOWLEDGMENTS
This work was supported by NSF awards #1955798
and #1916084.
REFERENCES
Berenstein, A., Magarinos, M. P., Chernomoretz, A., and
Aguero, F. (2016). A multilayer network approach
for guiding drug repositioning in neglected diseases.
PLOS.
Boccaletti, S., Bianconi, G., Criado, R., del Genio, C.,
G
´
omez-Garde
˜
nes, J., Romance, M., Sendi
˜
na-Nadal,
I., Wang, Z., and Zanin, M. (2014). The structure and
dynamics of multilayer networks. Physics Reports.
Boden, B., G
¨
unnemann, S., Hoffmann, H., and Seidl, T.
(2012). Mining coherent subgraphs in multi-layer
graphs with edge labels. In Proceedings of the 18th
ACM SIGKDD International Conference on Knowl-
edge Discovery and Data Mining, KDD ’12, pages
1258–1266. ACM.
Bringmann, B. and Nijssen, S. (2008). What is frequent in
a single graph? In Pacific-Asia Conf. on Knowl. Disc.
and Data Mining (PAKDD), pages 858–863. Springer.
Cook, D. J. and Holder, L. B. (1993). Substructure dis-
covery using minimum description length and back-
ground knowledge. Journal of Artificial Intelligence
Research, 1:231–255.
Das, S. and Chakravarthy, S. (2015). Partition and con-
quer: Map/reduce way of substructure discovery. In
International Conference on Big Data Analytics and
Knowledge Discovery, pages 365–378. Springer.
Das, S. and Chakravarthy, S. (2018). Duplicate reduction in
graph mining: Approaches, analysis, and evaluation.
IEEE Trans. Knowl. Data Eng., 30(8):1454–1466.
De Domenico, M., Sol
´
e-Ribalta, A., G
´
omez, S., and Are-
nas, A. (2014). Navigability of interconnected net-
works under random failures. Proc. of Ntl. Acad. of
Sciences.
Domenico, M. D., Nicosia, V., Arenas, A., and Latora, V.
(2014). Layer aggregation and reducibility of multi-
layer interconnected networks. CoRR, abs/1405.0425.
Elseidy, M., Abdelhamid, E., Skiadopoulos, S., and Kalnis,
P. (2014). Grami: Frequent subgraph and pattern min-
ing in a single large graph. Proceedings of the VLDB
Endowment, 7(7):517–528.
Ketkar, N. S., Holder, L. B., and Cook, D. J. (2005). Sub-
due: Compression-based frequent pattern discovery in
graph data. In Proceedings of the 1st Int. Workshop on
open source data mining: frequent pattern mining im-
plementations, pages 71–76.
Kivel
¨
a, M., Arenas, A., Barthelemy, M., Gleeson, J. P.,
Moreno, Y., and Porter, M. A. (2013). Multilayer net-
works. CoRR, abs/1309.7233.
Leskovec, J. and Krevl, A. (2014a). LiveJournal - SNAP
Datasets: Stanford large network dataset collection.
http://snap.stanford.edu/data/com-LiveJournal.html.
Leskovec, J. and Krevl, A. (2014b). Orkut - SNAP Datasets:
Stanford large network dataset collection. http://snap.
stanford.edu/data/com-Orkut.html.
Liu, G. and Wong, L. (2008). Effective pruning techniques
for mining quasi-cliques. In Joint European confer-
ence on machine learning and knowledge discovery
in databases, pages 33–49. Springer.
Padmanabhan, S. and Chakravarthy, S. (2009). Hdb-
subdue: A scalable approach to graph mining. In
International Conference on Data Warehousing and
Knowledge Discovery, pages 325–338. Springer.
Pavel, H. R., Roy, A., Santra, A., and Chakravarthy, S.
(2023). Closeness centrality detection in homoge-
neous multilayer networks. In Proceedings of the 15th
International Joint Conference on Knowledge Discov-
ery, Knowledge Engineering and Knowledge Manage-
ment, IC3K 2023, KDIR.
Rao, R. B. and Lu, S. C. (1992). Learning engineering mod-
els with the minimum description length principle. In
AAAI, pages 717–722.
Santra, A., Bhowmick, S., and Chakravarthy, S. (2017a).
Efficient community re-creation in multilayer net-
works using boolean operations. In International Con-
ference on Computational Science, ICCS 2017, 12-14
June 2017, Zurich, Switzerland, pages 58–67.
Santra, A., Bhowmick, S., and Chakravarthy, S. (2017b).
Hubify: Efficient estimation of central entities across
multiplex layer compositions. In IEEE International
Conference on Data Mining Workshops.
Santra, A., Komar, K., Bhowmick, S., and Chakravarthy, S.
(2022). From base data to knowledge discovery – a
life cycle approach – using multilayer networks. Data
& Knowledge Engineering, page 102058.
Wang, W., Wang, C., Zhu, Y., Shi, B., Pei, J., Yan, X.,
and Han, J. (2005). Graphminer: a structural pattern-
mining system for large disk-based graph databases
and its applications. In Proceedings of ACM SIGMOD
2005, pages 879–881.
Yang, S., Yan, X., Zong, B., and Khan, A. (2012). Towards
effective partition management for large graphs. In
SIGMOD Conference, pages 517–528.
MLN-Subdue: Substructure Discovery In Homogeneous Multilayer Networks
47