Big Graph-based Data Visualization Experiences - The WordNet Case Study

Enrico Caldarola, Antonio Picariello, Antonio M. Rinaldi

2015

Abstract

In the Big Data era, the visualization of large data sets is becoming an increasingly relevant task due to the great impact that data have from a human perspective. Since visualization is the closer phase to the users within the data life cycle's phases, there is no doubt that an effective, efficient and impressive representation of the analyzed data may result as important as the analytic process itself. This paper presents an experience for importing, querying and visualizing graph database and in particular, we describe as a case study the WordNet database using Neo4J and Cytoscape. We will describe each step in this study focusing on the used strategies for overcoming the different problems mainly due to the intricate nature of the case study. Finally, an attempt to define some criteria to simplify the large-scale visualization of WordNet will be made, providing some examples and considerations which have arisen.

References

  1. Beaudoin, L., Parent, M.-A., and Vroomen, L. C. (1996). Cheops: A compact explorer for complex hierarchies. In Visualization'96. Proceedings., pages 87-92. IEEE.
  2. Bederson, B. B. and Shneiderman, B. (2003). The craft of information visualization: readings and reflections. Morgan Kaufmann.
  3. Caldarola, E. G., Picariello, A., and Castelluccia, D. (2015). Modern enterprises in the bubble: Why big data matters. ACM SIGSOFT Software Engineering Notes, 40(1):1-4.
  4. Caldarola, E. G., Sacco, M., and Terkaj, W. (2014). Big data: The current wave front of the tsunami. ACS Applied Computer Science, 10(4):7-18.
  5. Carriere, J. and Kazman, R. (1995). Research report. interacting with huge hierarchies: beyond cone trees. In Information Visualization, 1995. Proceedings., pages 74-81. IEEE.
  6. Collins, C. (2006). Wordnet explorer: applying visualization principles to lexical semantics. Computational Linguistics Group, Department of Computer Science, University of Toronto, Toronto, Ontario, Canada.
  7. Collins, C. (2007). Docuburst: Radial space-filling visualization of document content. Knowledge Media Design Institute, University of Toronto, Technical Report KMDI-TR-2007-1.
  8. Eades, P. (1984). A heuristics for graph drawing. Congressus numerantium, 42:146-160.
  9. Fayyad, U. M., Wierse, A., and Grinstein, G. G. (2002). Information visualization in data mining and knowledge discovery. Morgan Kaufmann.
  10. Fellbaum, C. (1998). Wordnet. The Encyclopedia of Applied Linguistics.
  11. Finlayson, M. A. (2013). MIT Java Wordnet Interface (JWI) User's Guide, Version 2.2.x.
  12. Finlayson, M. A. (2014). Java libraries for accessing the princeton wordnet: Comparison and evaluation. Proceedings of the 7th Global Wordnet Conference, Tartu, Estonia.
  13. Friendly, M. and Denis, D. J. (2001). Milestones in the history of thematic cartography, statistical graphics, and data visualization. URL http://www.datavis.ca/milestones.
  14. Fruchterman, T. M. and Reingold, E. M. (1991). Graph drawing by force-directed placement. Softw., Pract. Exper., 21(11):1129-1164.
  15. Gansner, E. R. and North, S. C. (1998). Improved forcedirected layouts. In Graph Drawing, pages 364-373. Springer.
  16. Holzschuher, F. and Peinl, R. (2013). Performance of graph query languages: comparison of cypher, gremlin and native access in neo4j. In Proceedings of the Joint EDBT/ICDT 2013 Workshops, pages 195-204. ACM.
  17. Huang, W., Hong, S.-H., and Eades, P. (2007). Effects of sociogram drawing conventions and edge crossings in social network visualization. J. Graph Algorithms Appl., 11(2):397-429.
  18. Johnson, B. and Shneiderman, B. (1991). Tree-maps: A space-filling approach to the visualization of hierarchical information structures. In Visualization, 1991. Visualization'91, Proceedings., IEEE Conference on, pages 284-291. IEEE.
  19. Kamps, J. and Marx, M. (2002). Visualizing wordnet structure. Proc. of the 1st International Conference on Global WordNet, pages 182-186.
  20. Kobourov, S. G. (2012). Spring embedders and force directed graph drawing algorithms. arXiv preprint arXiv:1201.3011.
  21. Mathur, A. and Dalal, D. (2015). APIARY: A Case for Neo4J? Equal Experts Labs.
  22. Mazza, R. (2009). Introduction to information visualization. Springer Science & Business Media.
  23. Miller, G. A. (1995). WORDNET: a Lexical Database for English. Communications of the ACM, 38(11):39-41.
  24. Munzner, T. (1997). H3: Laying out large directed graphs in 3d hyperbolic space. In Information Visualization, 1997. Proceedings., IEEE Symposium on, pages 2-10. IEEE.
  25. Munzner, T. (2008). Process and pitfalls in writing information visualization research papers. In Information visualization, pages 134-153. Springer.
  26. Purchase, H. (1997). Which aesthetic has the greatest effect on human understanding? In Graph Drawing, pages 248-261. Springer.
  27. Purchase, H. C., Cohen, R. F., and James, M. (1996). Validating graph drawing aesthetics. In Graph Drawing, pages 435-446. Springer.
  28. Reingold, E. M. and Tilford, J. S. (1981). Tidier drawings of trees. Software Engineering, IEEE Transactions on, (2):223-228.
  29. Rinaldi, A. M. (2008). A content-based approach for document representation and retrieval. In Proceedings of the eighth ACM symposium on Document engineering, pages 106-109. ACM.
  30. Rinaldi, A. M. (2012). Improving tag clouds with ontologies and semantics. In Database and Expert Systems Applications (DEXA), 2012 23rd International Workshop on, pages 139-143. IEEE.
  31. Rinaldi, A. M. (2014). A multimedia ontology model based on linguistic properties and audio-visual features. Information Sciences, 277:234-246.
  32. Robinson, I., Webber, J., and Eifrem, E. (2013). Graph databases. ” O'Reilly Media, Inc.”.
  33. Spence, R. (2001). Information visualization, volume 1. Springer.
  34. Tutte, W. T. (1963). How to draw a graph. Proc. London Math. Soc, 13(3):743-768.
  35. Van Bruggen, R. (2014). Learning Neo4j. Packt Publishing Ltd.
  36. Ware, C. (2012). Information visualization: perception for design. Elsevier.
  37. Webber, J. (2012). A programmatic introduction to neo4j. In Proceedings of the 3rd annual conference on Systems, Programming, and Applications: Software for Humanity, pages 217-218. ACM.
  38. Wills, G. J. (1997). Nicheworks-interactive visualization of very large graphs. In Graph Drawing, pages 403-414. Springer.
Download


Paper Citation


in Harvard Style

Caldarola E., Picariello A. and Rinaldi A. (2015). Big Graph-based Data Visualization Experiences - The WordNet Case Study . In Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, (IC3K 2015) ISBN 978-989-758-158-8, pages 104-115. DOI: 10.5220/0005632201040115


in Bibtex Style

@conference{kdir15,
author={Enrico Caldarola and Antonio Picariello and Antonio M. Rinaldi},
title={Big Graph-based Data Visualization Experiences - The WordNet Case Study},
booktitle={Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, (IC3K 2015)},
year={2015},
pages={104-115},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005632201040115},
isbn={978-989-758-158-8},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, (IC3K 2015)
TI - Big Graph-based Data Visualization Experiences - The WordNet Case Study
SN - 978-989-758-158-8
AU - Caldarola E.
AU - Picariello A.
AU - Rinaldi A.
PY - 2015
SP - 104
EP - 115
DO - 10.5220/0005632201040115