FedDW: A TOOL FOR QUERYING FEDERATIONS OF DATA WAREHOUSES - Architecture, Use Case and Implementation

Stefan Berger, Michael Schrefl

2009

Abstract

Recently, Federated Data Warehouses – collections of autonomous and heterogeneous Data Marts – have become increasingly attractive as they enable the exchange of business information across organization boundaries. The advantage of federated architectures is that users may access the global, mediated schema with OLAP applications, while the Data Marts need not be changed and retain full autonomy. Although the underlying concepts are mature, tool support for Federated DWs has been poor so far. This paper presents the prototype of the “FedDW” Query Tool that supports distributed query processing in federations of ROLAP Data Marts. It acts as middleware component that reformulates user queries according to semantic correspondences between the autonomous Data Marts. We explain FedDW’s architecture, demonstrate a use-case and explain our implementation. We regard our proof-of-concept prototype as a first step towards the development of industrial strength query tools for DW federations.

References

  1. Abell ó, A., Samos, J., and Saltor, F. (2002). On Relationships Offering New Drill-across Possibilities. In Theodoratos, D., editor, DOLAP, pp. 7-13. ACM.
  2. Akinde, M. O., Böhlen, M. H., Johnson, T., Lakshmanan, L. V. S., and Srivastava, D. (2003). Efficient OLAP Query Processing in Distributed Data Warehouses. Inf. Syst., 28(1-2):111-135.
  3. Banek, M., Vrdoljak, B., Tjoa, A. M., and Skocir, Z. (2007). Automating the Schema Matching Process for Heterogeneous Data Warehouses. In Song, I. Y., Eder, J., and Nguyen, T. M., editors, DaWaK, volume 4654 of Lecture Notes in Computer Science, pp. 45-54. Springer.
  4. Berger, S. and Schrefl, M. (2006). Analysing Multi-Dimensional Data across Autonomous Data Warehouses. In Tjoa, A. M. and Tho, N., editors, DaWaK, pp. 120- 133. Springer.
  5. Berger, S. and Schrefl, M. (2008). From Federated Databases to a Federated Data Warehouse System. HICSS, 0:394.
  6. Bernardino, J., Furtado, P., and Madeira, H. (2002). DWSAQA: A Cost Effective Approach for Very Large Data Warehouses. In Nascimento, M. A., O zsu, M. T., and Zaïane, O. R., editors, IDEAS, pp. 233-242.
  7. Brunneder, W. (2008). Development of an SQL-MDi Query Parser (in German). Master's thesis, Univ. of Linz.
  8. Cabibbo, L. and Torlone, R. (2005). Integrating Heterogeneous Multidimensional Databases. In Frew, J., editor, SSDBM, pp. 205-214.
  9. Doan, A. and Halevy, A. Y. (2005). Semantic Integration Research in the Database Community: A Brief Survey. AI Magazine, 26(1):83-94.
  10. Gamma, E., Helm, R., Johnson, R., and Vlissides, J. (1995). Design patterns: elements of reusable object-oriented software. Addison-Wesley Longman Publishing Co., Inc., Boston, MA, USA.
  11. Gingras, F. and Lakshmanan, L. V. S. (1998). nD-SQL: A Multi-dimensional Language for Interoperability and OLAP. In Gupta, A., Shmueli, O., and Widom, J., editors, VLDB, pp. 134-145. Morgan Kaufmann.
  12. Graefe, G. and McKenna, W. J. (1993). The Volcano Optimizer Generator: Extensibility and Efficient Search. In Proceedings of the Ninth International Conference on Data Engineering, April 19-23, 1993, Vienna, Austria, pp. 209-218. IEEE Computer Society.
  13. Grant, J., Litwin, W., Roussopoulos, N., and Sellis, T. K. (1993). Query Languages for Relational Multidatabases. VLDB J., 2(2):153-171.
  14. Halevy, A. Y., Rajaraman, A., and Ordille, J. J. (2006). Data Integration: The Teenage Years. In Dayal, U., Whang, K.-Y., Lomet, D. B., Alonso, G., Lohman, G. M., Kersten, M. L., Cha, S. K., and Kim, Y.-K., editors, VLDB, pp. 9-16. ACM.
  15. Inmon, W. (2005). Building the Data Warehouse. John Wiley & Sons, New York, 4th edition.
  16. Kimball, R. (2002). The Data Warehouse Toolkit: Practical Techniques for Building Dimensional Datawarehouses. John Wiley & Sons, 2nd edition.
  17. Lakshmanan, L. V. S., Sadri, F., and Subramanian, S. N. (2001). SchemaSQL: An Extension to SQL for Multidatabase Interoperability. ACM Trans. Database Syst., 26(4):476-519.
  18. Mangisengi, O., Eßmayr, W., Huber, J., and Weippl, E. (2003). XML-based OLAP Query Processing in a Federated Data Warehouses. In ICEIS (1), pp. 71-78.
  19. Medina, E. and Trujillo, J. (2002). A Standard for Representing Multidimensional Properties: The Common Warehouse Metamodel (CWM). In Manolopoulos, Y. and Návrat, P., editors, ADBIS, volume 2435 of Lecture Notes in Computer Science, pp. 232-247. Springer.
  20. Pedersen, D., Riis, K., and Pedersen, T. B. (2002). A Powerful and SQL-compatible Data Model and Query Language for OLAP. In Zhou, X., editor, Australasian Database Conference, volume 5 of CRPIT. Australian Computer Society.
  21. Reilles, A. (2007). Canonical Abstract Syntax Trees. Electronic Notes in Theoretical Computer Science, 176(4):165 - 179. Proceedings of the 6th International Workshop on Rewriting Logic and its Applications (WRLA 2006).
  22. Rossgatterer, T. (2008). Query Processing in a Federated Data Warehouse System (in German). Master's thesis, University of Linz.
  23. Torlone, R. and Panella, I. (2005). Design and Development of a Tool for Integrating Heterogeneous Data Warehouses. In Tjoa, A. M. and Trujillo, J., editors, DaWaK, volume 3589 of Lecture Notes in Computer Science, pp. 105-114. Springer.
  24. Zhao, H. and Ram, S. (2007). Combining Schema and Instance Information for Integrating Heterogeneous Data Sources. Data Knowl. Eng., 61(2):281-303.
Download


Paper Citation


in Harvard Style

Berger S. and Schrefl M. (2009). FedDW: A TOOL FOR QUERYING FEDERATIONS OF DATA WAREHOUSES - Architecture, Use Case and Implementation . In Proceedings of the 11th International Conference on Enterprise Information Systems - Volume 1: ICEIS, ISBN 978-989-8111-84-5, pages 113-122. DOI: 10.5220/0001985601130122


in Bibtex Style

@conference{iceis09,
author={Stefan Berger and Michael Schrefl},
title={FedDW: A TOOL FOR QUERYING FEDERATIONS OF DATA WAREHOUSES - Architecture, Use Case and Implementation},
booktitle={Proceedings of the 11th International Conference on Enterprise Information Systems - Volume 1: ICEIS,},
year={2009},
pages={113-122},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001985601130122},
isbn={978-989-8111-84-5},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 11th International Conference on Enterprise Information Systems - Volume 1: ICEIS,
TI - FedDW: A TOOL FOR QUERYING FEDERATIONS OF DATA WAREHOUSES - Architecture, Use Case and Implementation
SN - 978-989-8111-84-5
AU - Berger S.
AU - Schrefl M.
PY - 2009
SP - 113
EP - 122
DO - 10.5220/0001985601130122