ETL for a relational implementation is very time
confusing and a slow process.
Therefore, we propose to use a NoSQL database.
After all the information needs of the DW to-be have
been identified by the requirements engineering
phase, we propose mapping rules to take us to the
logical model of NoSQL databases. For this, the
information model is examined. Our preliminary
work is for Cassandra, a column oriented database.
Once the mapping is complete, OLAP operations
can be performed.
Future work includes:
a) Defining the way in which OLAP operations will
be implemented in Cassandra
b) Applying our mapping rules to a real-world
example and evaluating our rules
c) Developing mapping rules for a document store.
We have selected Mongodb as the database.
d) Developing mapping rules for XML databases.
REFERENCES
Boehnlein, M. and Ulbrich-vom Ende, A., 1999,
November. Deriving initial data warehouse structures
from the conceptual data models of the underlying
operational information systems. In Proceedings of the
2nd ACM international workshop on Data
warehousing and OLAP (pp. 15-21). ACM.
Böhnlein, M. and Ulbrich-vom Ende, A., 2000. Business
process oriented development of data warehouse
structures. In Data Warehousing 2000 (pp. 3-21).
Physica, Heidelberg.
Bonifati, A., Cattaneo, F., Ceri, S., Fuggetta, A. and
Paraboschi, S., 2001. Designing data marts for data
warehouses. ACM transactions on software
engineering and methodology, 10(4), pp.452-483.
Chevalier, M., El Malki, M., Kopliku, A., Teste, O. and
Tournier, R., 2015, April. How can we implement a
Multidimensional Data Warehouse using NoSQL?. In
International Conference on Enterprise Information
Systems (pp. 108-130). Springer, Cham.
Corr, L. and Stagnitto, J., 2011. Agile data warehouse
design: Collaborative dimensional modeling, from
whiteboard to star schema. DecisionOne Consulting.
Dehdouh, K., Bentayeb, F., Boussaid, O. and Kabachi, N.,
2015, January. Using the column oriented NoSQL
model for implementing big data warehouses.
In Proceedings of the International Conference on
Parallel and Distributed Processing Techniques and
Applications (PDPTA) (p. 469). The Steering
Committee of The World Congress in Computer
Science, Computer Engineering and Applied
Computing (WorldComp).
Duda, J., 2012. Business intelligence and NoSQL
databases. Information Systems in Management, 1(1),
pp.25-37.
Giorgini, P., Rizzi, S. and Garzetti, M., A Goal-Oriented
Approach to Requirement Analysis in Data
Warehouses. Decision Support Systems (DSS) jounal,
Elsevier, pp.4-21.
Inmon, W.H., 1995. What is a data warehouse?. Prism
Tech Topic, 1(1).
Inmon, W.H., Strauss, D. and Neushloss, G., 2010. DW
2.0: The architecture for the next generation of data
warehousing. Elsevier.
Kimball, R., 1996. The data warehouse toolkit: practical
techniques for building dimensional data warehouses
(Vol. 1). New York: John Wiley & Sons.
Prakash, D. and Prakash, N., 2019. A multifactor approach
for elicitation of Information requirements of data
warehouses. Requirements Engineering, 24(1),
pp.103-117.
Prakash, N. and Gosain, A., 2003, June. Requirements
Driven Data Warehouse Development. In CAiSE Short
Paper Proceedings (Vol. 252).
Prakash, D., 2016. Eliciting Information Requirements for
DW Systems. In CAiSE (Doctoral Consortium).
Prakash, D., 2018, September. Direct Conversion of Early
Information to Multi-dimensional Model.
In International Conference on Database and Expert
Systems Applications (pp. 119-126). Springer, Cham.
Prakash, N., Singh, Y. and Gosain, A., 2004, November.
Informational scenarios for data warehouse
requirements elicitation. In International Conference
on Conceptual Modeling (pp. 205-216). Springer,
Berlin, Heidelberg.
Santos, M.Y., Martinho, B. and Costa, C., 2017.
Modelling and implementing big data warehouses for
decision support. Journal of Management Analytics,
4(2), pp.111-129.
Stonebraker, M., 2012. New opportunities for new sql.
Commun. ACM, 55(11), pp.10-11.