data. To benefit from this new technology, we
proposed, in this paper, an approach to build a
NoSQL Document Warehouse. More accurately, we
transform the multidimensional galaxy model of the
DocW into the column-oriented NoSQL model. From
NoSQL models, we elected the column-oriented
model because its performance has been proven in the
literature works.
To build a NoSQL DocW, we distinguish two
transformation types: simple and hierarchical. The
first transformation converts the concepts of the
galaxy model into a NoSQL model without detailing
the hierarchy concept. For this transformation, we
define a set of four rules. While, the hierarchical
transformation explains the hierarchy concept when
transforming the multidimensional model into a
column-oriented NoSQL model. It retains three rules
from the simple transformation and defines one
specific rule for this transformation.
To substantiate these rules, we use the NoSQL
database management system Cassandra and
Cassandra Query Language (CQL) to apply the
simple and hierarchical transformation rules. We
obtain respectively DocW-S
and DocW-H. Moreover,
the evaluation of the obtained NoSQL DocW in terms
of the two metrics Write Request Latency and Read
Request Latency on a medical collection shows that
the DocW-H is better than the DocW-S.
As a future work, we will propose rules to
transform the multidimensional model of the
document warehouse into the document-oriented
NoSQL model and compare the performance of the
two NoSQL DocWs: column-oriented DocW and
document-oriented DocW. In addition, we expect
define a set of analytical operations dedicated to the
galaxy model of the NoSQL DocW.
REFERENCES
Agrawal, D., Das, S., El Abbadi, A., 2011. Big data and
cloud computing: current state and future opportunities.
In EDBT/ICDT’11, 14
th
International Conference on
Extending Database Technology, pp. 530–533.
Ben Mefteh S., Khrouf K., Feki J., Ben Kraiem M., Soule-
Dupuy C., 2016. A Semantic Approach for XML
Document Warehousing and OLAP Analysis. In
IJIDS’16, International Journal of Information &
Decision Sciences, vol.8, n°.3, pp. 254-283,
DOI: 10.1504/IJIDS.2016.078587.
Ben Messaoud, I., Feki, J., Zurfluh, G., 2015. A Semi-
automatic Approach to Build XML Document
Warehouse, in CCIS’15, Communications in Computer
and Information Science, Springer International
Publishing Switzerland 2015, A. Fred et al. (Eds.), pp.
347–363.
Chandawni, G., 2016. NOSQL DATA-WAREHOUSE. In
IJIRCCE’16, International Journal of Innovative
Research in Computer and Communication
Engineering, Vol. 4, Special Issue 4, pp. 96-104.
Chevalier, M., Elmalki, M., Kupliku, A., Teste, O.,
Tournier, R., 2015a. Entrepôts de données
multidimensionnelles NoSQL. In EDA’15, 11
ème
Journées francophones sur les Entrepôts de Données et
l’Analyse en ligne, p.161-176
Chevalier, M., El Malki, M., Kopliku, A., Teste, O.,
Tournier, R., 2015b. Implementing multidimensional
data warehouses into NoSQL. In ICEIS’15, 17
th
International Conference on Enterprise Information
Systems, pp.108-130.
Chevalier, M., El Malki, M., Kopliku, A., Teste, O.,
Tournier, R., 2015c. Implementation of
Multidimensional Databases in Column-Oriented
NoSQL Systems. In ADMIS’15, 19th East-European
Conference on Advances in Databases and Information
Systems, pp. 79-91.
Chevalier, M., El Malki, Teste, O., Tournier, R., 2015d.
Implementation of Multidimensional Databases with
Document-Oriented NoSQL. In DAWAK’15, 17th
International Conference on Big Data Analytics and
Knowledge Discovery, pp. 379-390.
Dehdouh, K., Bentayeb, F., Boussaid, O., 2014. Columnar
NoSQL Star Schema Benchmark. In MEDI’14, 4
th
Model and Data Engineering, pp. 281-288.
Dehdouh, K., Bentayeb, F., Boussaid, O., Kabachi, N.,
2015. Using the column oriented NoSQL model for
implementing big data warehouses. In PDPTA'15, 21
st
International Conference on Parallel and Distributed
Processing Techniques and Applications, pp.469-475.
Favre, C., Bentayeb, F., Boussaid, O., Darmont, J., Gavin,
G., Harbi, N., Kabachi, N., Loudcher, S., 2013. Les
entrepôts de données pour les nuls ou pas !. In EGC’13,
13
th
conference francophone sur l’extraction et la
gestion de connaissance, pp. 1-18.
Feki, J., Ben Messaoud, I., Zurfluh, G., 2013. Building an
XML Document Warehouse. In JDS, Journal of
Decision Systems, Vol. 22 No. 2. pp. 122-148, DOI:
10.1080/12460125.2013.780322.
Jacobs, A., 2009. The pathologies of big data. In
Communications of the ACM 52(8), pp. 36–44.
Lemberger, P., Batty, M., Morel, M., Rafaelli, JL, 2015.
Big Data et Machine Learning, Dunod. 1
st
edition,
ISBN: 978-2-10-072074-3.
Li, C., 2010. Transforming relational database into HBase:
A case study. In ICSESS’10, International Conference
on Software Engineering and Service Sciences, pp.
683–687.
McCabe, C., Lee, J., Chowdhury, A., Grossman, D.,
Frieder, O., 2000. On the design and evaluation of a
multi-dimensional approach to information retrieval. In
SIGIR’00, 23rd International Conference on Research
and Development in Information Retrieval, pp. 363-
365.