Operational QVT plugin provided within EMF to
implement the QVT rules. Finally, we tested the
transformation by running the QVT script. The
execution of this script transforms the UML class
diagram (figure 6) into the NoSQL table
corresponding to the logical model (figure 7).
5.3 Logical PIM to PSMs
Transformation
Choosing column-oriented model at logical level
does not imply a specific target platform.
Consequently, several implementation platforms
could be used. In this paper, we choose to consider
two PSMs that correspond to HBase and Cassandra
platforms.
Cassandra PSM
Cassandra is a columns-oriented database. It consists
of one data container named Keyspace. The
Keyspace is associated to a set of columns families;
each column-family is identified by a PrimaryKey
and contains a set of columns that must be declared
up front at schema definition time. We note that the
concepts of “Table” and “Row-Key” used at the
logical level will be replaced respectively by
“Keyspace” and ‘PrimaryKey”.
HBase PSM
HBase is a column-oriented database built on top of
Hadoop (Grover, 2015). HBase database consists of
one table named HTable. The HTable is associated
with a set of columns families that must be declared
up front at schema definition time, whereas columns
do not need to be defined at schema time but can be
conjured on the fly. Each row in HTable is identified
by a RowKey.
Based on Cassandra PSM and HBase PSM, we
have created manually Cassandra and HBase
databases.
6 CONCLUSION AND
PERSPECTIVES
In this paper we have presented a MDA-based
approach to implement UML conceptual model
describing Big Data in column-oriented NoSQL
systems. Our approach consists of a chain of
transformations that generate a column-oriented
logical model independent of a particular NoSQL
platform; this independence makes it easier to
implement conceptual models into several column-
oriented databases such as Cassandra and HBase
regardless of their specific technical details.
As future work, we plan to complete our
transformation process and propose a mapping for
OCL expressions defined in the conceptual model;
queries languages provided by NoSQL databases
(such as CQL, HiveQL …) could be used for this.
Another ongoing work concerns the validation of the
proposed transformation process on scientific
medical applications.
REFERENCES
Angadi, A. B., Angadi, A. B., Gull, K. C., 2013. Growth
of New Databases & Analysis of NOSQL Datastores.
International Journal of Advanced Research in
Computer Science and Software Engineering.
Cattell, R., 2011. Scalable SQL and NoSQL data stores.
Acm Sigmod Record.
Abelló, A., 2015. Big data design. In Proceedings of the
ACM DOLAP.
Li, C., 2010. Transforming relational database into HBase:
A case study. In IEEE ICSESS.
Chen, C. P., Zhang, C. Y., 2014. Data-intensive
applications, challenges, techniques and technologies:
A survey on Big Data. Information Sciences.
Bézivin, J., Gerbé, O., 2001. Towards a precise definition
of the OMG/MDA framework. In ASE.
Chevalier, M., El Malki, M., Kopliku, A., Teste, O.,
Tournier, R., 2015. Implementing multidimensional
data warehouses into NoSQL. In ICEIS.
Abadi, D. J., Madden, S. R., Hachem, N., 2008. Column-
stores vs. row-stores: How different are they really?.
In Proceedings of the ACM SIGMOD.
Li, Y., Gu, P., Zhang, C., 2014. Transforming UML class
diagrams into HBase based on meta-model. In ISEEE.
Grover, M., Malaska, T., Seidman, J., Shapira, G., 2015.
Hadoop application architectures. O'Reilly Media.
Abadi, D., Boncz, P., Harizopoulos, S., Idreos, S.,
Madden, S., 2013. The design and implementation of
modern column-oriented database systems. Now.
Fredriksson, C., 2015. knowledge management with big
data creating new possibilities for organizations. In
NORKOM.
Daniel, G., Sunyé, G., Cabot, J., 2016. UMLtoGraphDB:
Mapping Conceptual Schemas to Graph Databases. In
ER.
Vajk, T., Fehér, P., Fekete, K., Charaf, H., 2013.
Denormalizing data into schema-free databases. In
CogInfoCom, IEEE
OMG, 2011. MOF 2.0 QVT Specification.