perceived as a form of “accessibility data quality”
instead of a category of its own. Otherwise, the
authors also argue for the intrinsic dimensions
defined by (Haug and Arlbjørn, 2011).
- Discussion
The mentioned classifications above, were
undertaken with different goals in mind and
according to different context. Most projects have
avoided the difficult issue of quality assessment or
have only touched it briefly. This leads us to clearly
deduce that the area of data quality is widespread
debate and may differ from an application domain to
another, hence the interest to enroll in a specific
context.
- Our Motivation
The quantity of data handled by information systems
(IS) is increasing worldwide. Particularly in the case
of Mediation systems, where each data source is
independent and change frequently which may affect
the quality of data and consequently to the
satisfaction of the end user.
We choose to study and assess the quality of data
retrieved from the virtual data integration or so-
called 'the mediation’ also because we noticed that
despite the importance of this domain and the
importance of the end user satisfaction. Until now,
no study has conducted a quality assessment of data
retrieved by this type of system. This motivation will
make a good starting point as a problematic for our
work around.
In the next section, we take first the concept of
mediation up in order to have an idea about our
context of our research.
4 MEDIATION SYSTEM
In what follow, we present in the first Sub Section
the concept of mediation in general and in the
second sub section, we illustrates the process of
mediation.
4.1 The Concept of Mediation
The mediation concept dating back to 70’s, Levy
defines mediation as a transparent access service to a
huge number of autonomous, heterogeneous,
dynamics and distributed information sources (Levy
A. Y et al , 1996). Also, (A.Zellou, 2008) defines
mediation systems in as an intermediate tool
between a user or application, and a set of
information sources; this tool provides a transparent
access to sources by a unique interface and query
language.
We define the mediator more precisely as a
system that offers a common query interface to a set
of heterogeneous data sources which can (1) Accepts
the participation of different data sources (2)
Contains information about the contents of the data
sources; (3) Integrates the different data sources by
means of a unifying, global or mediated schema; (4)
Receives queries from users that are expressed in the
language of the global schema; (5) Collects data
from sources upon request; at query time;(6) In
order to answer global queries, it sends appropriate
queries to the sources; (7) Combines the answers
received from the sources to build up the final
answer to the user.
Mediation information has several advantages. It
provides a uniform, unique and multi-source access
through a unique interface. It produces an integrated
answer by exploiting relations between sources. It
offers an independence between applications users
and information sources in order to permit evolution
of applications and take into account sources
autonomy (INRIA, 2001). For us, an ideal mediation
system would sbe useful anytime and anywhere.
Intuitively, a Mediator cannot directly assess
queries which is directed to it, because it doesn’t
contain any form of data (Hadi, Zellou and
Bounabat, 2013). These data are stored in a
distributed way and in independent sources.
Mediator has only an abstract view of data stored in
these sources (Zellou, 2008).
Different integration systems based on mediators
have been proposed in literature. We find mainly:
MOMIS (Beneventano and Bergamaschi, 2004),
TSIMMIS (Garcia-Molina et al.1994 ), HERMES
(S. & Brink A. & Emery R. & Lu J. J. & Rajput A.
& Rogers T. J. & Ross R. & Ward C. .,1995),
Information Manifold (Weld, 1997), Internet Softbot
(Etzioni and Weld, 1994), Infomaster (Genesereth
M. R. et al. , 1997), OBSERVER (Mena et al. ,
1996) , PICSEL (Rousset et al.,2002) and WASSIT
(Zellou, 2008).
These systems above mentioned, were
distinguishable by the manner of mapping between
the global schema and the schema of data sources.
Abstraction of all this variety, we choose to present
hereunder, the processing of mediation as a
conventional global architecture of mediation with
three layers.