Web Tool Over the Dedicated API. We have imple-
mented a basic web tool for the experimental analysis
of the presented technique. The web tool is based on
the client-server architecture. The server side of the
system is the implementation of the pre-processing
technique as described above, where the input dataset
is transformed into one continuous graph, stored,
indexed and made available through the API. The
client side consists of simple user interface that di-
rectly utilize the API, and depicts the user navigation
actions, such as panning and scrolling, into spatial
queries with respect to the two-dimensional space be-
fore sending the request to the server.
4.1 API Efficiency
Metrics. We evaluate the efficiency of the technique
based on the time needed to retrieve the nodes from
the API and present them at the web tool after a spatial
query, measured in msecs. The time presented is the
figures is the time needed for the query execution, the
rendering time for the first elements to appear on the
screen and the total response time of the system when
all the elements are rendered.
Methodology. We render randomly selected parts
of the datasets using spatial queries with rectan-
gular bounding boxes ranging from 500x500 px to
4000x4000 px. As the size of the area increases the
spatial queries on the dataset match larger number of
graph elements, allowing us to examine the response
time over a variation of total rendered graph elements.
The experiments present the average results of a series
of one hundred repetitions of the graph rendering for
each rectangle size.
Results. In Figure 2 we present the results for the
synthetic datasets. In all cases the total time is closely
connected to the number of rendered elements. The
system renders more than 500 graph elements in less
than two seconds and up to 5200 graph elements with-
out causing lagging, performance issues or hindering
the user experience, as shown in 2 (e). The fact that
so many graph elements can be rendered smoothly,
is of high importance, as similar systems have lim-
its to the number of presented elements. In Figure 3
we examine the average time needed for the render-
ing of one graph element. In Figure 3 (a) we show
that the average rendering time is not dependent on
the density of the input dataset, while in Figure 3 (b)
we show that it is not dependent on the size of the
input dataset. These experiments prove that our tech-
nique scales efficiently for any size or density of the
input dataset and supports the exploration of the in-
formation for datasets with millions of nodes without
any performance issues.
5 CONCLUSIONS
In this paper, we present a novel technique for the pre-
processing very large datasets with hundreds of mil-
lions of elements and their representation as graphs in
the two-dimensional space. Our technique has been
designed in a way to meet all the identified challenges
regarding exploration needs and user experience. The
presented technique process large real datasets with
millions of elements as well as dense graphs with
high node degree. The technique does not impose
any restrictions on the dataset while the information
is offered through a dedicated API that supports many
functionalities, including keyword search, path explo-
ration and neighbor information.
REFERENCES
Abello, J., van Ham, F., and Krishnan, N. (2006). ASK-
GraphView: A Large Scale Graph Visualization Sys-
tem. TVCG, 12(5).
Archambault, D., Munzner, T., and Auber, D. (2007).
Grouse: Feature-Based, Steerable Graph Hierarchy
Exploration. In EuroVis.
Atemezing, G. A. and Troncy, R. (2014). Towards a linked-
data based visualization wizard. In COLD.
Auber, D. (2004). Tulip - A Huge Graph Visualization
Framework. In Graph Drawing Software.
Benedetti, F., Po, L., and Bergamaschi, S. (2014). A Visual
Summary for Linked Open Data sources. In ISWC.
Berners-Lee, T., Chen, Y., Chilton, L., Connolly, D., Dha-
naraj, R., Hollenbach, J., Lerer, A., and Sheets, D.
(2006). Tabulator: Exploring and analyzing linked
data on the semantic web. In Proceedings of the
3rd international semantic web user interaction work-
shop, volume 2006, page 159. Citeseer.
Brunetti, J. M., Gil, R., and Garc
´
ıa, R. (2012). Facets and
Pivoting for Flexible and Usable Linked Data Explo-
ration. In Interacting with Linked Data Workshop.
Camarda, D. V., Mazzini, S., and Antonuccio, A.
(2012). LodLive, exploring the web of data. In I-
SEMANTICS.
De Araujo, S. F. and Schwabe, D. (2009). Explorator: a
tool for exploring rdf data through direct manipula-
tion. In Linked data on the web WWW2009 workshop
(LDOW2009).
Dokulil, J. and Katreniakov
´
a, J. (2009). Using Clusters in
RDF Visualization. In Advances in Semantic Process-
ing.
Falconer, S., Callendar, C., and Storey, M.-A. (2010). A Vi-
sualization Service for the Semantic Web. In Knowl-
edge Engineering and Management by the Masses.
Hastrup, T., Cyganiak, R., and Bojars, U. (2008). Browsing
Linked Data with Fenfire. In WWW.
Heim, P., Ertl, T., and Ziegler, J. (2010a). Facet graphs:
Complex semantic querying made easy. In Extended
Semantic Web Conference, pages 288–302. Springer.
ICEIS 2020 - 22nd International Conference on Enterprise Information Systems
114