communication is inadequate with respect to the
SOBE community dimension.
The objective of this paper is to present the
whole work, by also adding with respect to
(Barbagallo et al., 2010) an overview of the platform
architecture and a case study in the context of the
COIN European project. The rest of the paper is
organized as follows. In Section 2, we describe the
social ontology evolution process on which SOBE is
based. In Section 3, we present the platform
architecture and the modules that perform the
various steps of the process. Section 4 is dedicated
to the presentation of a case of study in which we
carried out an experimentation of the SOBE
platform in the ICT domain. Finally, in Section 5,
we present conclusions and future works.
2 THE SOBE PROCESS
The social ontology building and evolution (SOBE)
process (see Figure 1) exploits the UPON
methodology (De Nicola et al., 2009), and the
ontology learning methodology defined in (Velardi
et al., 2007), enriching them with social participation
aspects. In particular, UPON is characterized by an
incremental nature, reflected by the outcomes
produced in the different phases of the process: first
the relevant terms in the domain are identified and
gathered in a lexicon; then the latter is progressively
enriched with definitions, yielding a glossary;
adding to it the specialisation relationships allows a
taxonomy to be produced, until further enrichments
and a final formalization produces the sought
domain ontology. The SOBE process exploits this
step-wise approach and enriches it through an
automatic support for knowledge extraction from
existing digital resources, and social participation
aspects for consensus reaching among the
community of experts that participates to the
ontology building. The automatic knowledge
extraction support aims at reducing the workload of
the people involved in the ontology building, and at
reusing the amount of knowledge contained in any
type of existing documental resources (e.g.,
technical papers and reports, standards
specifications, etc.), and structured resources (e.g.,
dictionaries, thesauri, ontologies). In accordance
with the UPON methodology, SOBE firstly
addresses the terminological aspects, i.e. the lexicon.
The start up consists in processing a corpus of
documents, related to the addressed domain, for
automatically extracting terms that are considered
relevant in that domain. This extraction phase is
based on natural language processing techniques,
statistical analysis, and contrastive analysis against a
pre-defined corpora of documents related to
different domains. The extracted terms are referred
in Figure 1 as E-Lexicon. In the case of enrichment
of an existing ontology, the E-Lexicon is filtered out
of the terms that are already in the lexicon of the
current ontology (O-Lexicon). Then, the E-Lexicon
is validated by the community of experts to reach an
agreement on the new terms to be included in the
ontology (N-Lexicon). After the identification of the
N-Lexicon, terms have to be enriched with natural
language definitions in order to build the desired
glossary. As in the philosophy of SOBE, definitions
are firstly extracted from existing dictionaries or
ontologies (e.g., Google Define, WordNet), yielding
the E-Glossary which has to be humanly validated.
Glossary validation is performed by voting extracted
definitions. Any potential conflict due to lack of
agreement or terms with no definitions are managed
by opening discussion forums about glossary entries.
The result of the glossary validation step is gathered
in the N-Glossary. The following step is the
categorization of the N-Glossary entries by
associating to each of them a kind (i.e., Object,
Process, Actor) in accordance with the OPAL
framework (D’Antonio et al., 2007). Terms with
definitions and associated kinds represent the new
concepts to be inserted in the ontology. Starting
from the newly acquired concepts definitions,
natural language processing techniques allow an
automatic proposal of their hypernyms, producing a
set of micro-taxonomies: Eµ-Taxonomies. An Eµ-
Taxonomy is a specialization hierarchy between
concepts. In the case of ontology enrichment the Eµ-
Taxonomies are merged with the taxonomy of the
existing ontology (i.e., O-Taxonomy) producing the
N-Taxonomy. In the last step, the taxonomy is
enriched with other relationships (e.g., part of,
attributes) producing the final N-Ontology. The
human validation phase involves three types of
actors, who play specific roles in validating the
results of the automatic extraction tools: the
Ontology Master (OM), Participants (Ps) and
Moderators (Ms). The OM is responsible for the
whole ontology enrichment process and is in charge
of managing and supervising all its different phases.
People involved as
Ps play an active role in the
validation of the extraction tools results. They are in
charge of validating, adding, modifying terms and
definitions, and proposing new ones. This represents
the SOBE social participation aspect which is
realized through three different mechanisms: voting,
discussing and proposing. Voting enables the
THE SOCIAL ONTOLOGY BUILDING AND EVOLUTION (SOBE) PLATFORM
417