A UNIFIED MODEL DRIVEN METHODOLOGY FOR DATA WAREHOUSES AND ETL DESIGN

Faten Atigui, Franck Ravat, Ronan Tournier, Gilles Zurfluh

Abstract

During the last few years, several frameworks have dealt with Data Warehousing (DW) design issues. Most of these frameworks provide partial answers that focus either on multidimensional (MD) modelling or on Extraction-Transformation-Loading (ETL) modelling. However, less attention has been given neither to uni-fying both modelling issues into a single structured framework nor to automating the warehousing process. To overcome these limits, this paper provides a generic unified and semi-automated method that integrates DW and ETL processes design. The framework is handled within the Model Driven Architecture (MDA). It (i) first helps the designer in modelling the decision-makers requirements and then (ii) generates the MD model as well as (ii) the logical and the physical models and finally (iv) generates the source code. In this approach, the transformation rules are formalized using the Query/View/Transformation (QVT) language.

References

  1. Barateiro, J., Galhardas, H., 2005. A survey of data quality tools. Datenbank-Spektrum 14, 48.
  2. Bettin, J., 2003. Model-Driven Architecture Implementation & Metrics. SoftMetaWare, Ltd., Version 1.
  3. El Akkaoui, Z., Zimanyi, E., 2009. Defining ETL worfklows using BPMN and BPEL,12th international workshop on Data warehousing and OLAP. p. 41-48.
  4. Essaidi, M., Osmani, A., 2010. Model driven data warehouse using MDA and 2TUP. Journal of Computational Methods in Science and Engineering 10, 119- 134.
  5. Golfarelli, M., Rizzi, S., 1998. A methodological framework for data warehouse design, 1st international workshop on Data warehousing and OLAP. p. 3-9.
  6. Hüsemann, B., Lechtenbörger, J., Vossen, G., 2000. Conceptual data warehouse design. Citeseer.
  7. Kimball, R., 1996. The data warehouse toolkit: practical techniques for building dimensional data warehouses. John Wiley & Sons, Inc. New York, NY, USA.
  8. Kleppe, A.G., Warmer, J., Bast, W., 2003. MDA explained: the model driven architecture: practice and promise. Addison-Wesley Longman Publishing Co. Boston, MA, USA.
  9. Luján-Mora, S., Vassiliadis, P., Trujillo, J., 2004. Data mapping diagrams for data warehouse design with UML. Conceptual Modeling-ER 2004 191-204.
  10. Mazón, J.-N., Trujillo, J., 2009. A hybrid model driven development framework for the multidimensional modeling of data warehouses. SIGMOD Rec. vol. 38, 12.
  11. Muñoz, L., Mazón, J.N., Pardillo, J., Trujillo, J., 2008. Modelling ETL processes of data warehouses with uml activity diagrams, On the Move to Meaningful Internet Systems: OTM 2008 Workshops. p. 44-53.
  12. Muñoz, L., Mazón, J.-N., Trujillo, J., 2009. Automatic generation of ETL processes from conceptual models, 12th international workshop on Data warehousing and OLAP - 21 th international workshop, Hong Kong, China, p. 33.
  13. Object Management Group, 2003. OMG Document -- omg/03-06-01 (MDA Guide V1.0.1)
  14. Object Management Group, 2009. Query/View/Transformation.
  15. Prat, N., Akoka, J., Comyn-Wattiau, I., 2006. A UMLbased data warehouse design method. Decision Support Systems 42, 1449-1473.
  16. Ravat, F., Teste, O., Tournier, R., Zurfluh, G., 2007. Graphical querying of multidimensional databases, Advances in Databases and Information Systems. p. 298-313.
  17. Ravat, F., Teste, O., Zurfluh, G., 1999. Towards data warehouse design, 8th international conference on Information and knowledge management. p. 359-366.
  18. Rizzi, S., 2008. Conceptual modeling solutions for the data warehouse. Data Warehousing and Mining: Concepts, Methodologies, Tools, and Applications 208- 227.
  19. Rizzi, S., Abelló, A., Lechtenbörger, J., Trujillo, J., 2006. Research in data warehouse modeling and design: dead or alive?, 9th intern. workshop on Data warehousing and OLAP. p. 3-10.
  20. Romero, O., Abelló, A., 2009. A survey of multidimensional modeling methodologies. International Journal of Data Warehousing and Mining 5, 1-23.
  21. Romero, O., Abelló, A., 2010. Automatic validation of requirements to support multidimensional design. Data & Knowledge Engineering.
  22. Salinesi, C., Gam, I., 2006. A Requirement-driven Approach for Designing Data Warehouses. In Requirements Engineering :Foundation for Software Quality (REFSQ).
  23. Simitsis, A., 2005. Mapping conceptual to logical models for ETL processes, 8th International workshop on Data warehousing and OLAP. p. 67-76.
  24. Simitsis, A., Skoutas, D., Castellanos, M., 2010. Representation of conceptual ETL designs in natural language using Semantic Web technology. Data & Knowledge Engineering 69, 96-115.
  25. Simitsis, A., Vassiliadis, P., 2003. A methodology for the conceptual modeling of ETL processes, CAiSE workshops.
  26. Trujillo, J., Luján-Mora, S., 2003. A UML based approach for modeling ETL processes in data warehouses. Conceptual Modeling-ER 2003 307-320.
  27. Tsois, A., Karayannidis, N., Sellis, T., 2001. MAC: Conceptual data modeling for OLAP, the International Workshop on DMDW. p. 28-55.
  28. Vassiliadis, P., 2009. A Survey of Extract-TransformLoad Technology. International Journal of Data Warehousing and Mining 5, 1-27.
  29. Vassiliadis, P., Simitsis, A., Skiadopoulos, S., 2002. Conceptual modeling for ETL processes, dans: Proceedings of the 5th international workshop on Data Warehousing and OLAP. p. 14-21.
  30. Zepeda, L., Celma, M., Zatarain, R., 2008. A mixed approach for data warehouse conceptual design with MDA. Computational Science and Its ApplicationsICCSA 2008 1204-1217.
Download


Paper Citation


in Harvard Style

Atigui F., Ravat F., Tournier R. and Zurfluh G. (2011). A UNIFIED MODEL DRIVEN METHODOLOGY FOR DATA WAREHOUSES AND ETL DESIGN . In Proceedings of the 13th International Conference on Enterprise Information Systems - Volume 1: ICEIS, ISBN 978-989-8425-53-9, pages 247-252. DOI: 10.5220/0003502002470252


in Bibtex Style

@conference{iceis11,
author={Faten Atigui and Franck Ravat and Ronan Tournier and Gilles Zurfluh},
title={A UNIFIED MODEL DRIVEN METHODOLOGY FOR DATA WAREHOUSES AND ETL DESIGN},
booktitle={Proceedings of the 13th International Conference on Enterprise Information Systems - Volume 1: ICEIS,},
year={2011},
pages={247-252},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003502002470252},
isbn={978-989-8425-53-9},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 13th International Conference on Enterprise Information Systems - Volume 1: ICEIS,
TI - A UNIFIED MODEL DRIVEN METHODOLOGY FOR DATA WAREHOUSES AND ETL DESIGN
SN - 978-989-8425-53-9
AU - Atigui F.
AU - Ravat F.
AU - Tournier R.
AU - Zurfluh G.
PY - 2011
SP - 247
EP - 252
DO - 10.5220/0003502002470252