4 CONCLUSIONS
In this paper, we have presented the NOESIS network
data mining framework. NOESIS is open source and
lightweight. It can be used as an stand-alone network
analysis tool, using the provided graphical user inter-
face, or as a reusable library in other software devel-
opment projects, since it is distributed under a permis-
sive BSD free software license. It is available at the
NOESIS project web page: http://noesis.ikor.org.
NOESIS algorithms are implemented using struc-
tured parallel programming patterns, which enable an
effective use of the available computing resources.
The framework is built on top of a hardware abstrac-
tion layer that provides parallelization mechanisms
and hides their underlying complexity. In the future,
it will let programmers execute their algorithms in a
fully distributed computing system, such as a server
farm or the cloud, in a fully-transparent way.
The NOESIS framework is evolving and new data
mining techniques are scheduled to be developed in
the future, from overlapping community detection
methods to quasi-local link scoring and prediction
techniques, as well as additional graph layout tech-
niques. Since the NOESIS graphical user interface is
based on a model-driven application generator, cre-
ating ports of the application generator to other plat-
forms, such as Android or the Web, will automatically
enable the use of the NOESIS GUI in those platforms.
NOESIS is in constant development and improve-
ment. Our goal is to provide the most complete open
source network data mining framework, while main-
taining its ease of use and hiding the complexity of the
underlying execution environment so that even non-
expert programmers can develop their own modules
and network analysis techniques.
ACKNOWLEDGEMENTS
This work is partially supported by the Spanish Min-
istry of Economy and the European Regional Devel-
opment Fund (FEDER), under grant TIN2012-36951,
and the Ministry of Education of Spain under the
program “Ayudas para contratos predoctorales para
la formaci
´
on de doctores 2013” (grant BES-2013-
064699). We are grateful to Aar
´
on Rosas, Francisco-
Javier Gij
´
on, and Julio-Omar Palacio for their contri-
butions to the implementation of community detec-
tion methods.
REFERENCES
Albert, R. and Barab
´
asi, A.-L. (2002). Statistical mechanics
of complex networks. Reviews of Modern Physics,
74(1):47.
Bastian, M., Heymann, S., Jacomy, M., et al. (2009). Gephi:
an open source software for exploring and manipulat-
ing networks. ICWSM, 8:361–362.
Batagelj, V. and Mrvar, A. (1998). Pajek-program for large
network analysis. Connections, 21(2):47–57.
Berzal, F., Blanco, I., Cubero, J.-C., and Marin, N. (2002).
Component-based data mining frameworks. Commu-
nications of the ACM, 45(12):97–100.
Fortunato, S. (2010). Community detection in graphs.
Physics Reports, 486(3):75–174.
Herman, I., Melanc¸on, G., and Marshall, M. S. (2000).
Graph visualization and navigation in information vi-
sualization: A survey. Visualization and Computer
Graphics, IEEE Transactions on, 6(1):24–43.
Jackson, M. O. et al. (2008). Social and economic networks.
Princeton University Press Princeton.
Lancichinetti, A. and Fortunato, S. (2009). Community de-
tection algorithms: a comparative analysis. Physical
Review E, 80(5):056117.
Liben-Nowell, D. and Kleinberg, J. (2007). The link-
prediction problem for social networks. Journal of the
American society for information science and technol-
ogy, 58(7):1019–1031.
L
¨
u, L. and Zhou, T. (2011). Link prediction in complex
networks: A survey. Physica A: Statistical Mechanics
and its Applications, 390(6):1150–1170.
Martin, R. C. (2003). Agile Software Development: Princi-
ples, Patterns, and Practices. Prentice Hall PTR.
Newman, M. (2010). Networks: An Introduction. Oxford
University Press.
Palla, G., Der
´
enyi, I., Farkas, I., and Vicsek, T. (2005).
Uncovering the overlapping community structure of
complex networks in nature and society. Nature,
435(7043):814–818.
Shannon, P., Markiel, A., Ozier, O., Baliga, N. S., Wang,
J. T., Ramage, D., Amin, N., Schwikowski, B., and
Ideker, T. (2003). Cytoscape: a software environment
for integrated models of biomolecular interaction net-
works. Genome research, 13(11):2498–2504.
Smith, M. A., Shneiderman, B., Milic-Frayling, N.,
Mendes Rodrigues, E., Barash, V., Dunne, C.,
Capone, T., Perer, A., and Gleave, E. (2009). Analyz-
ing (social media) networks with nodexl. In Proceed-
ings of the fourth international conference on Com-
munities and technologies, pages 255–264. ACM.
Tamassia, R. (2013). Handbook of graph drawing and vi-
sualization. CRC press.
Wasserman, S. and Faust, K. (1994). Social network analy-
sis: Methods and applications. Cambridge University
Press.
Yang, J. and Leskovec, J. (2013). Overlapping community
detection at scale: A nonnegative matrix factorization
approach. In Proceedings of the Sixth ACM Interna-
tional Conference on Web Search and Data Mining,
WSDM ’13, pages 587–596. ACM.
The NOESIS Open Source Framework for Network Data Mining
321