A 3D USER INTERFACE FOR THE SEMANTIC NAVIGATION
OF WWW INFORMATION
Manuela Angioni, Roberto Demontis, Massimo Deriu and Franco Tuveri
CRS4 - Center for Advanced Studies, Research and Development in Sardinia, Polaris, 09010 Pula (CA), Italy
Keywords: 3D User Interface, Semantic net, WWW Information Retrieval.
Abstract: The automatic creation of a conceptual knowledge map using documents coming from the Web is a very
relevant problem because of the difficulty to distinguish between valid and invalid contents documents. In
this paper we present an improved search engine GUI for displaying and organizing alternative views of
data, by the use of a 3D graphical interface, and a method for organizing search results using a semantic
approach during the storage and retrieval of information. The presented work deals with two main aspects.
The first one regards the semantic aspects of knowledge management, in order to support the user during the
query composition and to supply to him information strictly related to his interests. The second one argues
the advantages coming from the adoption of a 3D user interface, to provide alternative views of data.
1 INTRODUCTION
A web-based search engine responds to a user’s
query with a list of documents. This list can be
viewed as the engine’s model of the user’s idea of
relevance the engine ‘believes’ that the first
document is the most likely to be relevant, the
second is slightly less likely, and so on. In the
specific context of the DART project (Angioni et
al., 2007), that is focused in the development of a
distributed architecture for a semantic search engine,
we think that the answer to a query can be given
providing the user with several kind of results, not
always related in the standard way the search
engines we use today do. A semantic approach for
information storage and retrieval is a way to
represent knowledge in the form of human language,
similar as it is represented in the human mind, but
while hyperlinks have come into widespread use, the
closely related semantic link is not yet widely used.
Semantic networks are a powerfull manner to
represent knowledge. A semantic network is
fundamentally a system for capturing, storing and
transferring information that works much the same
as (and is, in fact, modelled after) the human brain.
It is robust, efficient and flexible. It can grow to
extraordinary complexity, necessitating a
sophisticated approach to knowledge visualization,
balancing the need for simplicity with the full
expressive power of the network. In this paper we
expose a tool for the navigation of semantic
concepts by the use of a 3D user interface and a
semantic approach for information storage and
retrieval, that together permit users to share
information more effectively, and provide reductions
in query formulation and execution. To be effective
this new approach must be used, and thus an
approach that face the user with relevant resources
in reply to a query related to a specific domain of
interest is desirable. This motivation bring us to
develop a prototype, able to support the user during
the query composition and to supply information
strictly related to his interests, considering concepts
and solution related to the semantic aspects of
information retrieval.
2 RELATED WORKS
The issue of visually representing abstract
information by 3D is not new. Relevant backgrounds
for our work can be found in the field of Information
Visualization as well as in best practices techniques
adopted by existing solutions for ontology
representation. The works described below have
been assumed as a starting point in the development
of our project.
256
Angioni M., Demontis R., Deriu M. and Tuveri F. (2008).
A 3D USER INTERFACE FOR THE SEMANTIC NAVIGATION OF WWW INFORMATION.
In Proceedings of the Tenth International Conference on Enterprise Information Systems - HCI, pages 256-261
DOI: 10.5220/0001707202560261
Copyright
c
SciTePress
OntoRama (OntoRama, 2007) is an ontology
browser for RDF models based on a hyperbolic
layout of nodes and arcs. As the nodes in the center
are distributed on more space than those near to the
circumference, they are visualized with a higher
level of detail, while maintaining a reasonable
overview of the peripheral nodes. In addition to this
pseudo-3D space, OntoRama also introduces the
idea of cloned nodes in order to reduce the number
of crossed arcs and enhance the readability. The
duplicate nodes are displayed using an ad-hoc color
in order to avoid confusion. Unfortunately, this
application does not support editing and can only
manage RDF data.
Another interesting work is the OntoViz
(OntoViz, 2007) plug-in displays an ontology as a
graph by exploiting an open source library optimized
for graph visualization (Gansner & North, 1999).
Intuitively, classes and instances are represented as
nodes, while relations are visualized as oriented arcs.
Both nodes and arcs are labelled and displaced in a
way that minimizes overlapping, but not the size of
the graph. Therefore, the navigation of the graph,
enhanced only by magnification and panning tools,
does not provide a good overall view of the
ontology, as the graphical elements easily become
indistinguishable. OntoViz supports visualization of
several disconnected graphs at once. The users can
select a set of classes or instances to visualize.
OntoViz generates graphs that are static and non-
interactive which makes it less suitable for the
visualization of large ontologies.
TGViz (TGVizTab, 2007), similarly to OntoViz,
visualizes Protege (Protege, 2007) ontologies as
graphs. In this case however, the displacement of
nodes and arcs is computed using the spring layout
algorithm implemented in the Java TouchGraph
library (TouchGraph, 2007).
3 PROVIDING SEMANTIC
RELATION TO WEB BASED
KNOWLEDGE
Providing traditional Web searching with semantic
features is an application of the Semantic Web,
based on an explicit representation of semantics
about web resources and real world object. It is
aimed to improve both the proportion of relevant
material actually retrieved and the proportion of
retrieved material that is actually relevant. Recently,
research on information systems has increasingly
focused on how to effectively manage and share data
in a such heterogeneous and distributed
environment. In particular, the investigation of the
Semantic Web as an extension of the actual World
Wide Web, is aimed to make the Web content
machine understandable, allowing agents and
applications to access a variety of heterogeneous
resources (Dolog & W.Nejdl 2007).
It has been proposed to deal with problems such
as information overload and info-smog that are
responsible for the “lost on the net” effect and make
the web content inaccessible Our approach is to
index and retrieve information both in a generic and
in a specific context whether documents can be
mapped or not on ontologies, vocabularies and
thesauri. To achieve this goal, we perform a
semantic analysis process on structured and
unstructured parts of documents. The unstructured
parts need a linguistic analysis and a semantic
interpretation performed by means of Natural
Language Processing (NLP) techniques, while the
structured parts need a specific parser.
3.1 Semantic Analysis Process
A semantic analysis process will be carried out on
the transition from the old style of serving the web-
data visualization to the new style of providing the
3D graphical user interface. Firstly, we increase the
semantic net of WordNet (WordNet, 2007), a lexical
dictionary for the English language that groups
nouns, verbs, adjectives and adverbs into synonyms
sets, called synsets, linked by relations, such as
meronymy, synonymy or hyperonymy/hyponymy,
identifying valid and well-founded conceptual
relations and links contained in documents in order
to build a data structure, composed by concepts and
correlation between concepts and information, to
overlay the result set returned from the search
engine. At the same time we realized the
importance of being able to access a
multidisciplinary structure of documents, evaluating
several solutions like language specific thesaurus or
on-line encyclopedia. To achieve this goal we
choose a multidisciplinary, multilingual, web-based,
free content document encyclopaedia, such as
Wikipedia (Wikipedia, 2007), that contains about
1.900.000 encyclopedic information. We used the
great amount of documents included in Wikipedia to
extract new knowledge and to define a new semantic
net enriching WordNet. We added with new terms,
new associative relations and their classification, as
emphasized in (Harabagiu et al., 1999) where
authors identify several other weaknesses in the
WordNet semantic net constitution. In fact it
A 3D USER INTERFACE FOR THE SEMANTIC NAVIGATION OF WWW INFORMATION
257
contains about 115.000 word “senses”, that means
few number of connections between words related
by topics, not enough respect to the web language
dictionary. A conceptual map built using Wikipedia
pages allows a user to associate a concept to other
ones enriched with some relations that an author
points out. The use of Wikipedia guarantees, with
reasonable certainty, that such conceptual
connection is valid because it is produced by
someone who, at least theoretically, has the skills or
the experience to justify it. Moreover, the rules and
the reviewers controls set up guarantee reliability
and objectivity.
4 INTEGRATING 3D
INFORMATION
VISUALIZATION INTO
SEMANTIC NET
TECHNOLOGIES
One of the critical issue in our research was to
improve the user experience during the phases of
searching and extracting of the concepts. We
investigate new paradigm and model of UI, specially
deepening the study of 3D visualization. Although
the exposition of semantic networks into a 2D map
presentation can provide reasonably good results in
some cases, it is missing some important features to
make a real difference and provide satisfactory
results. Usually a user is presented with a flat panel,
limited in size, where information may be presented
in a form of color patches, sometimes textured, with
different shapes which can be distinguished when
the number of objects presented is low. The user
interaction is limited to selecting a point in the
panel. What we needeed was to exceeds the
capabilities of traditional interaction devices and
two-dimensional displays, allowing the user to
change the point of view, improving the perception
and the understanding of contents (Biström et al.
2005). Our goal was to traverse the net via concept
list views, via their relations, or by retracing the
user's history. The tool we developed, allows to
browse Web resources by means of the map of
concept. Formulating the query through the search
engine, the user can move through the SemanticNet
and extract the concepts which really interest him,
limiting the search field and obtaining a more
specific result. In a 3D space, a user can easily
understand the meaning of an object, simply
rotating, shifting, and moving it (Cellary et al.,
2004). Each node represented in the 3D view is a
concept and arcs are used to showed in order to
represents the different kinds of relations between
the concepts. If the representation is suitable for the
search context, the objects are easy to explore, and
the related information are learned faster and better
(Wiza et al, 2004).
4.1 Building The Semantic Net
In order to build the SemanticNet, we need to
condider terms and their classification. The reason is
that varied mental association of places, events,
persons and things depend on the cultural
backgrounds of the users' personal history. In fact,
the ability to associate a concept to another is
different from person to person. The SemanticNet is
definitely not exhaustive but it is limited by the
dictionary of WordNet, by the contents included in
Wikipedia and by the accuracy of the information
given by the system. Starting from the information
contained in Wikipedia about a term of WordNet,
the system is capable of enriching the SemanticNet
by adding new nodes, links and attributes, such as
IS-A or PART-OF relations. Moreover, the system is
able to classify automatically the textual contents of
web resources, indexed through the Classifier, a
module that uses WordNet Domains (Magnini et al.,
2002) and (Magnini et al., 2004), and applies a
density function, based on the synonyms and
hypernyms frequency (Scott, 1998) and the
computation of the number of synsets related to
each term of the document. In this way, the system
is able to retrieve the most frequently used “senses”
by extracting the synonyms relations given by the
use of similar terms in the document sentences.
Through the categorization of the document
itself it can associate the term the most correct
meaning and can assign a weight to each category
related to the content. In fact, each term in WordNet
has more than one meaning each corresponding to a
Wikipedia page. We therefore need to extract the
specific meaning described in the Wikipedia page
content, in order to build a conceptual map where
the nodes are the “senses” (synset of WordNet or
term+category) and the links are given both by the
WordNet semantical-linguistic relations and by the
conceptual associations built by the Wikipedia
authors. To achieve this goal, the system performs a
syntactic and a semantic disambiguation, as better
described by Angioni, Demontis and Tuveri (2007),
of the textual content of the Wikipedia page and
extracts its meaning associating it to a node
corresponding to the specific “sense” of a term in
WordNet (synset) or, if it does not exists in
ICEIS 2008 - International Conference on Enterprise Information Systems
258
WordNet, adding to the SemanticNet a new node
(term + category).
5 THE DISPLAYABLE DATA
STRUCTURE
As described before, we have enriched the WordNet
semantic net with new terms extracted from
approximately 60.000 Wikipedia articles
corresponding to WordNet terms. Through the
classification of the content of Wikipedia pages ,the
system assigns them a synset ID of WordNet, if it
exists. By analyzing the content of the Wikipedia
pages, a new kind of relation named
“COMMONSENSE” is defined, that delineates the
SemanticNet with the semantic relations “IS-A” and
“PART-OF” given by WordNet. The
COMMONSENSE relation is a connection between
a term and the links defined by the author of the
related Wikipedia page. These relations are
important each time the system give back results to a
query. Results provide the concepts referred in
WordNet or in the Wikipedia derived conceptual
map. The concept map is constituted by 25010 nodes
each corresponding to a “sense” of WordNet and
related to a page of Wikipedia. Starting from these
nodes, 371281 relations were extracted, part of
which are also included in WordNet, but in the most
case they are new relations extracted from
Wikipedia. Terms not contained in WordNet and in
Wikipedia are new nodes of the augmented semantic
net. Terms which meanings are different in
Wikipedia in respect with the meaning existing in
WordNet will be added as new nodes in the
SemanticNet but they will not have a synset ID
given by WordNet.
Figure 1: A view of the 3D UI with COMMONSENSE
relation.
In Figure 1 a portion of the SemanticNet is
described, starting from the node tiger. The
Wikipedia text related to the term is analyzed and
classified under the category Animals. In this way
the system is able to exclude one of the two senses
included in WordNet, the one having the meaning
related with person, and to take into consideration
only the sense related with animal.
So, all the new relations extracted from the page
itself as well as the relation included in WordNet can
be associated to this specific node by the system.
6 ARCHITECTURE OF THE 3D
USER INTERFACE
A major motivation for keeping 3D GUIs for
displaying search results simple is to keep them
user-friendly. In general the 3D UIs for displaying
virtual worlds need more immersive and ad hoc
hardware interfaces, anyway in our case one of the
essential requirements was to guarantee a good
usability by concrete representations and simplicity
(Houston, 2002).
As a consequence we need to keep the user into
a Web context, and allowing her/him to manage
interactive 3D contents within the Web browser.
Recently Web sites that include 3D content, i.e.
Web sites where users navigate and interact (at least
partially) through a 3D graphical interface, are
increasingly employed in different domains, such as
tutoring and training, tourism, e-commerce and
scientific visualization However, while a substantial
body of literature and software tools is available
about making 2D Web sites adaptive, very little has
been published on the problem of personalizing 3D
Web content and interaction.
6.1 Adaptive Manipulation Of 3D Web
Content
To identify the best approach to achieve the task of
adaptive manipulation of 3D Web content, we have
explored several 3D Web technology. The study has
been limited only to W3C languages able to describe
virtual worlds, we have deliberately choose to do not
consider java applets, flash interactive movies, and
other technologies that can be integrated into web
pages to view 3D contents. In particular at the
begining we have evaluated and discarded VRML
(VRML), 3DML (3DML tutorial). The reason to set
aside these languages is that they require plug-in
A 3D USER INTERFACE FOR THE SEMANTIC NAVIGATION OF WWW INFORMATION
259
modules with a partial support to their
specifications.
We also considered SVG (SVG) as another way
to avoid 3D interactive contents, specially because
SVG engines are natively supported by the most
common Web browsers. Unfortunately this language
is not dedicated to 3D graphics and it needs ad-hoc
extensions to process 3D contents, from our point of
view it looks too restrictive because makes the all
process too trivial and do not guarranties a good
level of interactivity. At the end of the analisy we
finally choose the X3D (eXtensible 3D) language
(Web 3D Consortium, 2007) This choice has been
driven by its features, specially because it is is the
ISO open standard for 3D content delivery on the
web supported by a large community of users.
Conceptually, the semantics of X3D describe an
abstract functional behaviour of time-based,
interactive 3D, multimedia information and do not at
all specify a specific software or hardware setup.
The key difference with the other languages is
inherent in the X3D runtime environment, in
particular the scene graph, which is a directed,
acyclic graph containing the objects represented as
nodes and object relationships in the 3D world. The
basic structure of X3D documents is very similar to
any other XML document. All elements are nested
within the X3D tag including the scene graph tag.
6.2 Implementation
The user interface that we have developed is a
software module used according to the user
preferences and the current user context that
provides an interactive scene defined by a X3D
document that represent the selected portion of
SemanticNet. Depending on the search context it
provides a three-dimensional view that guarantees a
better usability in terms of information navigation,
and it could also provide different layouts for
different cases. When the user provides a query
expressed in Natural Language, the system analyse
it, and returns a set of results containing the main
categories and the more representative synset found
If the results don’t fit with the the user need, he
can decide to reformulate a new query using the
SemanticNet in order to enrich the query with new
terms related or semantically near to the original
query
Figure 2: Software Architecture.
The module parts are:
Web Application – it is the interface between the
SemanticNet and the front-end of the 3DUI. Its
tasks are to submit the query originated in the
browser to the SemanticNet, to forward results to
the ResultCollector, and finally to encapsulate
and to send the X3D document to the browser.
ResultCollector – it has the role to collect the
retrieved terms semantically related each other
by the means of a concept list. It analize their
relations, and provide a suitable description
defined by a GraphML file (GraphML Working
Group ) and (Bonnel, 2005).
3D Manager – it receives and parse the
GraphML file that describe the interested portion
of the SemanticNet within its structural
properties including directed, undirected, mixed
and hierarchical graphs, starting from the
singular node. It also generates on the fly the
X3D document that offers an interactive 3D
scene with the search results. Subsequently it
sends the document to the Web application that
makes it available to the user browser.
Figure 3: An example of the GraphML exchange file.
ICEIS 2008 - International Conference on Enterprise Information Systems
260
7 CONCLUSIONS
The goal of search engines is increased usage and
the common wisdom is that a simpler interface
broadens the base of potential users. We have
presented a 3D interface for a web search engine that
permits the user to decide which other “meanings”
add to the original query. We have explored the
opportunity to provide a such software infrastructure
for the management of semantic knowledge
structures in the ambit of the WWW, investigating
new interaction models and design principles in
order to create a better interactive systems. The tool
we developed allow to browse Web resources by
means of the map of concept called “SemanticNet”
built enriching the WordNet semantic net with new
nodes, links and attributes. The importance of
integrating 3D Information Visualization into
semantic network technologies is that 3D may
support semantic search process providing
functionalities to make the information more
accessible and improving their usability. The future
works are focused on improvements in the interface
selection method and development of new interfaces
for the SemanticNet. It is expected that the interface
selection algorithm may be improved by registration
of user interactions, such as preferred visualization
interfaces and options selected in particular
interfaces.
REFERENCES
M. Angioni, et al., 2007, DART: “The Distributed Agent-
Based Retrieval Toolkit”. In Proc. of CEA 07, pp. 425-
433 Gold Coast – Australia.
Ontorama, 2007, from http://www.kvocentral.org/
software/ontorama.html
Ontoviz tab, 2007: Visualizing protege ontologies.
from http://protege.cim3.net/cgi-bin/wiki.pl?OntoViz
Gasner and North, 1999, . “An open graph visualization
system and its applications to software engineering”In
Software Practice and Experience , pp 30, 11,
1203.1233.
TgVizTab,2007, from http://users.ecs.soton.ac.uk/
ha/TGVizTab/
Protege, 2007, from http://protege.stanford.edu/
Touchgraph, from. http://www.touchgraph.com/
P.Dolog & W.Nejdl. “Semantic Web Technologies for the
Adaptive Web”. pp. 697-719.-2007
WordNet, 2007, from http://wordnet.princeton.edu
Wikipedia, 2007, from http://www.wikipedia.org
Harabagiu, Miller, Moldovan, 1999, WordNet 2 “A
Morphologically and Semantically Enhanced
resource”. Workshop SIGLEX'99: Standardizing
Lexical Resources.
Biström et al (2005). “Post-WIMP User Interface Model
for 3D Web Applications”. In Research Seminar on
Digital Media. Telecommunications Software and
Multimedia Laboratory, Helsinki University of
Technology. Finland, 2005
W.Cellary, et al. "Visualizing Web Search Results in 3D,"
In Computer, vol.37, no.5, pp. 87-89, May, 2004
W.Wiza, et al. (2004). "Periscope: a system for adaptive
3D visualization of search results". In Web3D '04:
Proceedings of the ninth international conference on
3D Web technology, pp. 29-40, New York, NY, USA.
ACM Press.
B. Magnini, C. Strapparava, G. Pezzulo, A.Gliozzo, “The
Role of Domain Information in Word Sense
Disambiguation. Natural Language Engineering”,
special issue on Word Sense Disambiguation, 8(4), pp.
359-373, Cambridge University Press. 2002
B. Magnini, C. Strapparava, “User Modelling for News
Web Sites with Word Sense Based Techniques”. User
Modeling and User-Adapted Interaction 14(2), pp.
239-257 - 2004
S. Scott, S. Matwin, “Text Classification using WordNet
Hypernyms”. COLING/ACL Workshop on Usage of
WordNet in Natural Language Processing Systems,
Montreal, 1998
M. Angioni, R. Demontis, F. Tuveri, “Enriching WordNet
to Index and Retrieve Semantic Information”. In 2nd
International Conference on Metadata and Semantics
Research, 11-12 October, Ionian Academy, Corfu,
Greece. 2007
B. Houston, Z. Jacobson, “A Simple 3D Visual Text
Retrieval Interface”. In TRO-MP-050 - Multimedia
Visualization of Massive Military Datasets. Workshop
Proceedings. 2002
VRML Virtual Reality Modeling Language,
http://www.w3.org/MarkUp/VRML/
3DML tutorial, http://www.flatland.net/blog/?page_id=48
Scalable Vector Graphics (SVG), http://www.w3.org/
Graphics/SVG/
Web 3D Consortium – Overnet, 2007, from
http://www.web3d.org.
GraphML Working Group: The GraphML file format.
http://graphml.graphdrawing.org/
N. Bonnel, A. Cotarmanac’h, A.Morin, 2005, Meaning
Metaphor for Visualizing Search Results, in
International Conference on Information Visualisation,
IEEE Computer Society, pp. 467–472.
A 3D USER INTERFACE FOR THE SEMANTIC NAVIGATION OF WWW INFORMATION
261