A Data Analysis Framework for High-variety Product Lines in the Industrial Manufacturing Domain

Christian Lettner, Michael Zwick

Abstract

Industrial manufacturing companies produce a variety of different products, which, despite their differences in function and application area, share common requirements regarding quality assurance and data analysis. The goal of the approach presented in this paper is to automatically generate Extract-Transform-Load (ETL) packages for semi-generic operational database schema. This process is guided by a descriptor table, which allows for identifying and filtering the required attributes and their values. Based on this description model, an ETL process is generated which first loads the data into an entity-attribute-value (EAV) model, then gets transformed into a pivoted model for analysis. The resulting analysis model can be used with standard business intelligence tools. The descriptor table used in the implementation can be substituted with any other non-relational description language, as long as it has the same descriptive capabilities.

References

  1. Acharya, S., Carlin, P., Galindo-Legaria, C., Kozielczyk, K., Terlecki, P., and Zabback, P. (2008). Relational support for flexible schema scenarios. Proc. VLDB Endow., 1(2):1289-1300.
  2. Atigui, F., Ravat, F., Teste, O., and Zurfluh, G. (2012). Using ocl for automatically producing multidimensional models and etl processes. In Data Warehousing and Knowledge Discovery, pages 42-53. Springer.
  3. Bernstein, P. A. and Melnik, S. (2007). Model management 2.0: manipulating richer mappings. In Proceedings of the 2007 ACM SIGMOD international conference on Management of data, SIGMOD 7807, pages 1-12, New York, NY, USA. ACM.
  4. Chang, F., Dean, J., Ghemawat, S., Hsieh, W. C., Wallach, D. A., Burrows, M., Chandra, T., Fikes, A., and Gruber, R. E. (2008). Bigtable: A distributed storage system for structured data. ACM Trans. Comput. Syst., 26(2):4:1-4:26.
  5. Chaudhuri, S., Dayal, U., and Narasayya, V. (2011). An overview of business intelligence technology. Commun. ACM, 54(8):88-98.
  6. Dinu, V., Nadkarni, P., and Brandt, C. (2006). Pivoting approaches for bulk extraction of entity-attribute-value data. Comput. Methods Prog. Biomed., 82(1):38-43.
  7. Dinuab, V. and Nadkarnia, P. (2007). Guidelines for the effective use of entityattributevalue modeling for biomedical databases. International Journal of Medical Informatics, 76(11-12):769-779.
  8. Jiao, J., Tseng, M. M., Ma, Q., and Zou, Y. (2000). Generic bill-of-materials-and-operations for high-variety production management. Concurrent Engineering, 8(4):297-321.
  9. Khedri, N. and Khosravi, R. (2013). Handling database schema variability in software product lines. In To appear: The 20th Asia-Pacific Software Engineering Conference, APSEC 2013.
  10. Mun˜oz, L., Maz ón, J.-N., and Trujillo, J. (2009). Automatic generation of etl processes from conceptual models. In Proceedings of the ACM twelfth international workshop on Data warehousing and OLAP, DOLAP 7809, pages 33-40, New York, NY, USA. ACM.
  11. Skoutas, D. and Simitsis, A. (2006). Designing etl processes using semantic web technologies. In Proceedings of the 9th ACM international workshop on Data warehousing and OLAP, DOLAP 7806, pages 67-74, New York, NY, USA. ACM.
  12. Stumptner, R., Freudenthaler, B., and Krenn, M. (2012). Bia accelerator - a template-based approach for rapid etl development. In ISMIS'2012, Foundations of Intelligent Systems. Springer.
Download


Paper Citation


in Harvard Style

Lettner C. and Zwick M. (2014). A Data Analysis Framework for High-variety Product Lines in the Industrial Manufacturing Domain . In Proceedings of the 16th International Conference on Enterprise Information Systems - Volume 1: ICEIS, ISBN 978-989-758-027-7, pages 209-216. DOI: 10.5220/0004887802090216


in Bibtex Style

@conference{iceis14,
author={Christian Lettner and Michael Zwick},
title={A Data Analysis Framework for High-variety Product Lines in the Industrial Manufacturing Domain},
booktitle={Proceedings of the 16th International Conference on Enterprise Information Systems - Volume 1: ICEIS,},
year={2014},
pages={209-216},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004887802090216},
isbn={978-989-758-027-7},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 16th International Conference on Enterprise Information Systems - Volume 1: ICEIS,
TI - A Data Analysis Framework for High-variety Product Lines in the Industrial Manufacturing Domain
SN - 978-989-758-027-7
AU - Lettner C.
AU - Zwick M.
PY - 2014
SP - 209
EP - 216
DO - 10.5220/0004887802090216