A CONTEXT-BASED APPROACH FOR LINGUISTIC MATCHING

Youssef Bououlid Idrissi, Julie Vachon

Abstract

As currently implemented by most data mapping systems, linguistic matching often boils down to string comparison or to a synonym look-up in a dictionary. But these solutions have proved to be inefficient for dealing with highly heterogeneous data sources. To cope with data source heterogeneity more efficiently, we introduce INDIGO, a system which computes semantic matching by taking into account data sources’ context. The distinctive feature of INDIGO consists in enriching data sources with semantic information extracted from their individual development artifacts before mapping them. As explained in this article, experiments conducted on two case studies proved the relevance of this approach.

References

  1. A. Doan, P. D. and Halevy, A. (2003). Learning to match the schemas of data sources : a multistrategy approach. Journal of Machine Learning, 50(3):279-301.
  2. Bououlid, I. and Vachon, J. (2005). Context analysis for semantic mapping of data sources using a multistrategy machine learning approach. In Proc. of the International Conf. on Enterprise Information Systems (ICEIS05), pages 445-448, Miami.
  3. Bououlid, I. and Vachon, J. (2007). A context-based approach for for complex semantic matching. In Proc. of CAiSE-Forum, Trondheim, Norway.
  4. B.T. Le, R. D.-K. and Gandon, F. (2004). On ontology matching problems - for building a corporate semantic web in a multi-communities organization. In Proc. of the Int. Conf. on Enterprise Information Systems (ICEIS), volume 4, pages 236-243.
  5. Do, H. and Rahm, E. (2002). Coma: a system for flexible combination of schema matching approaches. In Proceedings of the 28th Conf. on Very Large Databases.
  6. Euzenat, J. and et. al (2004). State of the art on ontology alignment. part of a research project funded by the ist program of the commission of the european communities. Technical Report project number IST-2004- 507482, Knowledge Web Consortium.
  7. Hu, W., Cheng, G., Zheng, D., Zhong, X., and Qu, Y. (2006). The results of falcon-ao in the oaei 2006 campaign. In International Workshop on Ontology Matching.
  8. J. Madhavan, P. B. and Rahm, E. (2001). Generic schema matching using cupid. In In Proceedings of the 27th VLDB Conference, pages 48-58, Roma, Italy.
  9. K. Kotis, G. V. and Stergiou, K. (2004). Capturing semantics towards automatic coordination of domain ontologies. In Artificial Intelligence: Methodology, Systems, and Applications, volume 3192 of LNCS, pages 22- 32.
  10. McUmber, R. (2003). Developing pet store using rup and xde. Web Site.
  11. Melnik, S., Garcia-Molina, H., and Rahm, H. (2002). Similarity flooding: a versatile graph matching algorithm and its application to schema matching. In Proc. International Conference on Data Engineering (ICDE'02), pages 117-128.
  12. Microsystems, S. (2005). Java petstore. Web Site.
  13. Noy, N. and Musen, M. (2001). Anchor-prompt: Using non-local context for semantic matching. In In Proc. IJCAI 2001 workshop on ontology and information sharing, pages 63-70, Seattle (WA US).
  14. Y. Qu, W. H. and Cheng, G. (2006). Constructing virtual documents for ontology matching. In Proceedings of the 15th International World Wide Web Conference.
Download


Paper Citation


in Harvard Style

Bououlid Idrissi Y. and Vachon J. (2007). A CONTEXT-BASED APPROACH FOR LINGUISTIC MATCHING . In Proceedings of the Second International Conference on Software and Data Technologies - Volume 3: ICSOFT, ISBN 978-989-8111-07-4, pages 197-202. DOI: 10.5220/0001345601970202


in Bibtex Style

@conference{icsoft07,
author={Youssef Bououlid Idrissi and Julie Vachon},
title={A CONTEXT-BASED APPROACH FOR LINGUISTIC MATCHING},
booktitle={Proceedings of the Second International Conference on Software and Data Technologies - Volume 3: ICSOFT,},
year={2007},
pages={197-202},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001345601970202},
isbn={978-989-8111-07-4},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Second International Conference on Software and Data Technologies - Volume 3: ICSOFT,
TI - A CONTEXT-BASED APPROACH FOR LINGUISTIC MATCHING
SN - 978-989-8111-07-4
AU - Bououlid Idrissi Y.
AU - Vachon J.
PY - 2007
SP - 197
EP - 202
DO - 10.5220/0001345601970202