INSTANCES NAVIGATION FOR QUERYING INTEGRATED DATA FROM WEB-SITES

Domenico Beneventano, Sonia Bergamaschi, Stefania Bruschi, Francesco Guerra, Mirko Orsini, Maurizio Vincini

2006

Abstract

Research on data integration has provided a set of rich and well understood schema mediation languages and systems that provide a meta-data representation of the modeled real world, while, in general, they do not deal with data instances. Such meta-data are necessary for querying classes result of an integration process: the end user typically does not know the contents of such classes, he simply defines his queries on the basis of the names of classes and attributes. In this paper we introduce an approach enriching the description of selected attributes specifying as meta-data a list of the “relevant values” for such attributes. Furthermore relevant values may be hierarchically collected in a taxonomy. In this way, the user may exploit new meta-data in the interactive process of creating/refining a query. The same meta-data are also exploited by the system in the query rewriting/unfolding process in order to filter the results showed to the user. We conducted an evaluation of the strategy in an e-business context within the EU-IST SEWASIE project. The evaluation proved the practicability of the approach for large value instances.

References

  1. Beneventano, D., Bergamaschi, S., Guerra, F., and Vincini, M. (2003). Synthesizing an integrated ontology. IEEE Internet Computing Magazine, pages 42-51.
  2. Beneventano, D. and Lenzerini, M. (2005). Final release of the system prototype for query management. Sewasie, Deliverable D.3.5, Final Version, available at http://www.dbgroup.unimo.it/pubs.html.
  3. Bergamaschi, S., Castano, S., Beneventano, D., and Vincini, M. (2001). Semantic integration of heterogeneous information sources. Data & Knowledge Engineering, Special Issue on Intelligent Information Integration, 36(1):215-249.
  4. Broder, A. Z., Maarek, Y. S., Bharat, K., Dumais, S. T., Papa, S., Pedersen, J., and Raghavan, P. (2005). Current trends in the integration of searching and browsing. In WWW (Special interest tracks and posters), page 793.
  5. Buneman, P., Davidson, S., Fernandez, M., and Suciu, D. (1997). Adding structure to unstructured data. In Proc. of ICDT 1997, pages 336-350, Delphi, Greece.
  6. Chaudhuri, S., Ramakrishnan, R., and Weikum, G. (2005). Integrating db and ir technologies: What is the sound of one hand clapping? In Proceedings of the Second Biennial Conference on Innovative Data Systems Research, Asilomar, CA, USA, pages 1-12.
  7. Dong, X. and Halevy, A. Y. (2005). Malleable schemas: A preliminary report. In Proceedings of he Eight International Workshop on the Web & Databases (WebDB 2005), Baltimore, Maryland, USA, pages 139-144.
  8. Galindo-Legaria, C. A. (1994). Outerjoins as disjunctions. In Snodgrass, R. T. and Winslett, M., editors, SIGMOD Conference, pages 348-358. ACM Press.
  9. Gibson, D., Kleinberg, J., and Raghavan, P. (2000). Clustering categorical data: an approach based on dynamical systems. VLDB Journal, 8(3-4):222-236.
  10. Gottlob, G., Koch, C., Baumgartner, R., Herzog, M., and Flesca, S. (2004). The lixto data extraction project - back and forth between theory and practice. In Proceedings of the Twenty-third ACM SIGACT-SIGMODSIGART Symposium on Principles of Database Systems, pages 1-12, Paris, France.
  11. Halevy, A. (2003). Data integration: a status report. In Proceedings of the German Database Conference, BTW03, Leipzig.
  12. Halevy, A. Y. (2004). Structures, semantics and statistics. In Proceedings of the 30th International Conference on VLDB, Toronto, Canada, pages 4-6.
  13. N. Noy, M. Uschold, C. W. (2005). Representing classes as property values on the semantic web. Semantic Web Best Practices and Deployment Working Group, part of the W3C Semantic Web Activity.
  14. (http://www.w3.org/TR/swbp-classes-as-values).
  15. Nestorov, S., Abiteboul, S., and Motwani, R. (1997). Inferring structure in semistructured data. SIGMOD Record, 26(4):39-43.
Download


Paper Citation


in Harvard Style

Beneventano D., Bergamaschi S., Bruschi S., Guerra F., Orsini M. and Vincini M. (2006). INSTANCES NAVIGATION FOR QUERYING INTEGRATED DATA FROM WEB-SITES . In Proceedings of WEBIST 2006 - Second International Conference on Web Information Systems and Technologies - Volume 1: WEBIST, ISBN 978-972-8865-46-7, pages 46-53. DOI: 10.5220/0001247900460053


in Bibtex Style

@conference{webist06,
author={Domenico Beneventano and Sonia Bergamaschi and Stefania Bruschi and Francesco Guerra and Mirko Orsini and Maurizio Vincini},
title={INSTANCES NAVIGATION FOR QUERYING INTEGRATED DATA FROM WEB-SITES},
booktitle={Proceedings of WEBIST 2006 - Second International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,},
year={2006},
pages={46-53},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001247900460053},
isbn={978-972-8865-46-7},
}


in EndNote Style

TY - CONF
JO - Proceedings of WEBIST 2006 - Second International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,
TI - INSTANCES NAVIGATION FOR QUERYING INTEGRATED DATA FROM WEB-SITES
SN - 978-972-8865-46-7
AU - Beneventano D.
AU - Bergamaschi S.
AU - Bruschi S.
AU - Guerra F.
AU - Orsini M.
AU - Vincini M.
PY - 2006
SP - 46
EP - 53
DO - 10.5220/0001247900460053