COMBINING SEMANTIC INFORMATION AND INFORMATION QUALITY ON THE ENRICHMENT OF WEB DATA INTEGRATION SYSTEMS

Damires Souza, Bernadette Farias Lóscio, Ana Carolina Salgado

2012

Abstract

The emergence of the Web and its permanent growth has caused a big impact on the database research community. Thereby, Database research areas have evolved in order to consider the new problems arising from the need of managing the huge volume of data available on the Web. One of such areas is Data Integration (DI), which is considered a pervasive challenge faced by applications that need to query across multiple autonomous and heterogeneous data sources. To help matters, we argue that semantic information like ontological and contextual information, combined with Information Quality (IQ) provided by IQ measures, may be employed together in order to enrich processes in DI (e.g., schema matching and query answering). In this paper, we present our ideas regarding what we mean by semantic information and IQ and why and how they may be combined in order to produce semantic knowledge to be used in Web Data Integration Systems. Furthermore, we propose a preliminary version of a metamodel, which presents a formal description of relationships between concepts associated with semantic information and IQ.

References

  1. Baader F., Calvanese D., McGuinness D., Nardi D., PatelSchneider P. editors., 2003. The Description Logic Handbook: Theory, Implementation and Applications. Cambridge University Press.
  2. Batista, M. C., Salgado, A.C., 2007. Data Integration Schema Analysis: An Approach with Information Quality. In Proceedings of the 12th International Conference on Information Quality (ICIQ), MIT, Massachusetts, USA, October 2007.
  3. Belian, R., Salgado, A. C., 2010. A Context-based Schema Integration Process Applied to Healthcare Data Sources. In Proceedings of the International Conference On the move to meaningful internet systems, Springer-Verlag.
  4. Bolchini, C., Curino, C., Orsi, G., Quintarelli, E., Rossato, R., Schreiber, F., Tanca, L., 2009. And what can context do for data? In: Communication of the ACM, Volume 52 (11), pp. 136-140.
  5. Dey, A., 2001. Understanding and Using Context. Personal and Ubiquitous Computing Journal, Volume 5, pp. 4-7. .
  6. Duchateau, F., Bellahsene Z., 2010. Measuring the Quality of an Integrated Schema. In Conceptual Modeling - ER 2010, Lecture Notes in Computer Science.
  7. Fuchs, F., Hochstatter, I., Krause, M., Berger, M., 2005. A Metamodel Approach to Context Information. In: PerCom Workshops 2005, 2005, pp. 8-14, Kauai Island, HI.
  8. Ge, M., Helfert, M., 2007. A Review of Information Quality Research - Develop a Research Agenda. In Proceedings of the 12th International Conference on Information Quality (ICIQ), MIT, Massachusetts, USA November 2007.
  9. Giunchiglia, F., Shvaiko, P., Yatskevich, M., 2004. Smatch: an algorithm and an implementation of semantic matching. In: European Semantic Web Symposium (ESWC). pp. 61-75.
  10. Gruber, T., 1995. Toward principles for the design of ontologies used for knowledge sharing. International Journal of Human-Computer Studies, 43:907-928.
  11. Halevy, A., Rajaraman, A., Ordille, J., 2006. Data Integration: the Teenage Years, In Proceedings. of the 25th International Conference on Very Large Data Bases (VLDB), pages 9-16, Seoul, Korea, September 2006.
  12. Hedeler, C., Belhajjame, K., Fernandes, A.A.A., Embury, S.M., Paton, N.W., 2009. Dimensions of Databases, In Proc. of 26th British National Conference on Databases, Birmingham, UK, pages 55-66.
  13. Helfert, M., Foley, O., 2009. A Context Aware Information Quality Framework. In Proceedings of the 4th International Conference on Cooperation and Promotion of Information Resources in Science and Technology (COINFO'09), November 21-23, Beijing, IEEE Computer Society Press, pp.: 187-193.
  14. Keeton, K., Mehra, P., Wilkes, J., 2009. Do You Know Your IQ? : A Research Agenda for Information Quality in Systems. ACM SIGMETRICS Performance Evaluation Review, Vol. 37, Issue 3, December 2009.
  15. Mandreoli, F., Martoglia, R., Villani, G., Penzo, W., 2009. Flexible query answering on graph-modeled data. In: 12th International Conference on Extending Database Technology (EDBT'09), Saint-Petersburg, Russia, pp. 216-227.
  16. Molina, H., Olsina, L., 2008. Assessing Web Applications Consistently: A Context Information Approach. In Proceedings of ICWE'2008. pp.224-230.
  17. Pires, C. E., Souza, D., Pachêco, T., Salgado, A. C., 2009. A Semantic-based Ontology Matching Process for PDMS. In: 2nd International Conference on Data Management in Grid and P2P Systems (Globe'09), Linz, Austria, pp. 124-135.
  18. Roth, A., Naumann, F., 2005. Benefit and Cost of Query Answering in PDMS. In Proceedings of the Int. Workshop on Databases, Information Systems and Peer-to-Peer Computing (DBISP2P), 2005.
  19. Souza D., Arruda T., Salgado A. C., Tedesco P., Kedad, Z., 2009. Using Semantics to Enhance Query Reformulation in Dynamic Environments. In: Proceedings of the 13th East European Conference on Advances in Databases and Information Systems (ADBIS'09), Riga, Latvia, pp. 78-92.
  20. Souza D., Pires, C. E., Kedad, Z., Tedesco, P., Salgado, A.C., 2011. A Semantic-based Approach for Data Management in a P2P System. In LNCS Transactions on Large-Scale Data- and Knowledge-Centered Systems, 2011.
  21. Souza, D., Belian, R., Salgado, A. C., Tedesco, P., 2008. Towards a Context Ontology to Enhance Data Integration Processes. In: 4th Workshop on Ontologies-based Techniques for DataBases in Information Systems and Knowledge Systems (ODBIS), Auckland, New Zealand, pp.49-56.
  22. Stuckenschmidt, H., Giunchiglia, F., van Harmelen, F., 2005. Query processing in ontology-based peer-topeer systems. In V. Tamma, S. Craneeld, T. Finin, and S. Willmott, editors, Ontologies for Agents: Theory and Experiences. Birkhuser.
  23. Sung, L., Ahmed, N., Blanco, R., Li, H, Soliman, M. A., Hadaller, D., 2005. A Survey of Data Management in Peer-to-Peer Systems. School of Computer Science, University of Waterloo, 2005.
  24. Vieira, V., Tedesco, P., Salgado, A.C., Brézillon P., 2007. Investigating the Specifics of Contextual Elements Management: The CEManTIKA Approach. The Sixth International and Interdisciplinary Conference on Modeling and Using Context. B. Kokinov et al. (Eds.): LNAI 4635, Springer-Verlag, pp. 493-506.
  25. Wang, J.A., 2010. Quality Framework for Data Integration. In Proceedings of the 27th British National Conference on Databases (BNCOD).
  26. Wang, R., Strong, D., 1996. Beyond Accuracy: What Data Quality Means to Data Consumers. Journal of Management Information Systems, Vol. 12, N. 4, pages 5-33, 1996.
  27. Wang, X., H., Gu, T., Zhang, D. Q., Pung, H. K., 2004. Ontology based context modelling and reasoning using OWL. In: Proceedings of the 1st Workshop on Context Modeling and Reasoning, 2004, Orlando, Florida.
  28. Xiao, H., 2006. Query processing for heterogeneous data integration using ontologies. PhD Thesis in Computer Science. University of Illinois at Chicago.
  29. Yasar, A., Paridel, K., Preuveneers, D.,Berbers, Y., 2011. When efficiency matters: Towards quality of contextaware peers for adaptive communication in VANETs. 2011 IEEE Intelligent Vehicles Symposium (IV) (June 2011), pg. 1006-1012.
Download


Paper Citation


in Harvard Style

Souza D., Farias Lóscio B. and Salgado A. (2012). COMBINING SEMANTIC INFORMATION AND INFORMATION QUALITY ON THE ENRICHMENT OF WEB DATA INTEGRATION SYSTEMS . In Proceedings of the 8th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST, ISBN 978-989-8565-08-2, pages 219-224. DOI: 10.5220/0003961602190224


in Bibtex Style

@conference{webist12,
author={Damires Souza and Bernadette Farias Lóscio and Ana Carolina Salgado},
title={COMBINING SEMANTIC INFORMATION AND INFORMATION QUALITY ON THE ENRICHMENT OF WEB DATA INTEGRATION SYSTEMS },
booktitle={Proceedings of the 8th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,},
year={2012},
pages={219-224},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003961602190224},
isbn={978-989-8565-08-2},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 8th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,
TI - COMBINING SEMANTIC INFORMATION AND INFORMATION QUALITY ON THE ENRICHMENT OF WEB DATA INTEGRATION SYSTEMS
SN - 978-989-8565-08-2
AU - Souza D.
AU - Farias Lóscio B.
AU - Salgado A.
PY - 2012
SP - 219
EP - 224
DO - 10.5220/0003961602190224