5 CONCLUSION
In this paper we have interested on schema
matching, and focused on structural context
matching for enhanced XML schemas. We began by
an analysis of problems involved in the matching,
and we proposed a new solution taking into account
of heterogeneity of the schema sources. For the
structural similarity measure, we recovered a matrix
of terminological similarity coefficients between
schema nodes based on the similarity of their labels.
We outlined the limitations of current solutions
through the study of Cupid and Similarity Flooding
systems. Then we proposed a structural matching
technique that considers the context of schemas
nodes (defined by their roots, intermediates and
leafs contexts in schema graph). By the way, we
suggest a simple structural algorithm based on the
previous ideas and exploit the three types of
contexts. We refer to the result produced by the
algorithm as a mapping. The user validates this
mapping in order to produce a final mapping result
that serves to generate transformation scripts.
For future work, we would like to improve the
matching process, while taking into account the
optimisation of the process in order to determine a
set of semantic equivalences between schemas
(source and target). That will facilitate the
generation of operators based on the primitive of
transformations between entities of EXS schemas.
The second axis to land concerns the efficiency and
the time of human interaction. The key is then to
discover how to minimize ser interaction but
maximizing the impact of the feedback.
REFERENCES
Abiteboul, S., Cluet, S., Milo, T., 1997. Correspondence
and Translation for heterogeneous data. In Proceeding
of The international Conference on Database Theory
(ICDT). 351-363.
Boukottaya, A., Vanoirbeek, C., Paganelli, F., Abou-
Khaled, O., 2004. Automating XML documents
transformations: a conceptual modelling based
approach. In Proceedings of the first Asian-Pacific
conference on Conceptual modelling. ACM, 81-90.
Castano, S. and De Antonellis, V., 1999. A schema
analysis and Reconciliation Tool Environment For
Heterogeneous Databases. In Proceedings of
International Database Engineering and Applications
Symposium.
Doan, A., Madhavan, J., Domingos, P., Halevey, A., 2001.
Reconciling schemas of disparate data sources: A
machine Learning Approach. In Proceedings ACM
SIGMOD conference. 509-520.
Drew, P., King, R., McLeod, D., Rusinkiewicz, M.,
Silberschatz, A., 1993. Report of the Workshop on
Semantic Heterogeneity and Interoperation in
Multidatabase Systems. In Proceedings ACM
SIGMOD record, 47-56.
Fellbum, C., 1998. WordNet: An Electronic Lexical
Database. MIT press.
Lamolle, M. and Mellouli, N., 2003. Intégration de bases
de données hétérogènes via XML.EGC’2003.
Lamolle, M. and Zerdazi, A., 2005. Intégration de Bases
de données hétérogènes par une modélisation
conceptuelle XML, COSI’05. 216-227.
Li, W.S. and Clifton, C., 1994, Semantic Integration in
Heterogeneous Databases Using Neural Networks.
VLDB.
Li, W.S. and Clifton C., 2000, SemInt: A Tool for
Identifying Attribute Correspondences in
Heterogeneous Databases Using Neural Network. Data
and Knowledge Engineering. 49-84.
Madhavan, J., Bernstein, P., Rahm, E., 2001. Generic
schema matching with cupid. VLDB.
Melnik, S., Garcia-Molina, H., Rahm, E., 2002. Similarity
Flooding: A versatile Graph Matching and its
Application to Schema Matching. Data Engineering.
Miller, A.G., 1995. WordNet: A lexical Database for
English. ACM. 39-41.
Miller, A.G., Hass, L., Hernandez, M.A., 2000. Schema
mapping as query discovery. VLDB. 77-88.
Rahm, E. and Bernstein, P., 2001 A survey of approaches
to automatic schema matching. In VLDB Journal.
334-350.
XML Schema, W3C Recommendation, 2001. XML-
Schema Primer, W3 Consortium, 2001. Available at
http://www.w3.org/TR /xmlschema-0.
Zerdazi, A. and Lamolle, M., 2005. Modélisation des
schémas XML par adjonction de métaconnaissances
sémantiques. ASTI’05. 29-32.
Zerdazi, A. and Lamolle, M., 2006. Intégration de sources
hétérogènes par matching semi-automatique de
schémas XML étendus. INFORSID’2006. 991-1006.
MATCHING OF ENHANCED XML SCHEMAS WITH A MEASURE OF STRUCTURAL-CONTEXT SIMILARITY
133