Design and Implementation of the CloudMdsQL Multistore System
Boyan Kolev, Carlyna Bondiombouy, Oleksandra Levchenko, Patrick Valduriez, Ricardo Jimenez-Peris, Raquel Pau, José Pereira
2016
Abstract
The blooming of different cloud data management infrastructures has turned multistore systems to a major topic in the nowadays cloud landscape. In this paper, we give an overview of the design of a Cloud Multidatastore Query Language (CloudMdsQL), and the implementation of its query engine. CloudMdsQL is a functional SQL-like language, capable of querying multiple heterogeneous data stores (relational, NoSQL, HDFS) within a single query that can contain embedded invocations to each data store’s native query interface. The major innovation is that a CloudMdsQL query can exploit the full power of local data stores, by simply allowing some local data store native queries (e.g. a breadth-first search query against a graph database) to be called as functions, and at the same time be optimized.
References
- Armbrust, M., Xin, R., Lian, C., Huai, Y., Liu, D., Bradley, J., Meng, X., Kaftan, T., Franklin, M., Ghodsi, A., Zaharia, M. 2015. Spark SQL: Relational Data Processing in Spark. In ACM SIGMOD (2015), 1383-1394.
- Bondiombouy, C., Kolev, B., Levchenko, O., Valduriez, P. 2015. Integrating Big Data and Relational Data with a Functional SQL-like Query Language. Int. Conf. on Databases and Expert Systems Applications (DEXA) (2015), 170-185.
- CoherentPaaS, http://coherentpaas.eu (2013).
- DeWitt, D., Halverson, A., Nehme, R., Shankar, S., Aguilar-Saborit J., Avanes, A., Flasza, M., Gramling, J. 2013. Split Query Processing in Polybase. In ACM SIGMOD (2013), 1255-1266.
- Duggan, J., Elmore, A. J., Stonebraker, M., Balazinska, M., Howe, B., Kepner, J., Madden, S., Maier, D., Mattson, T., Zdonik, S. 2015. The BigDAWG Polystore System. SIGMOD Rec. 44, 2 (August 2015), 11-16.
- Kolev, B., Valduriez, P., Bondiombouy, C., Jiménez-Peris, R., Pau, R., Pereira, J. 2015. CloudMdsQL: Querying Heterogeneous Cloud Data Stores with a Common Language. Distributed and Parallel Databases, pp 1- 41, http://hal-lirmm.ccsd.cnrs.fr/lirmm-01184016.
- LeFevre, J., Sankaranarayanan, J., Hacigümüs, H., Tatemura, J., Polyzotis, N., Carey, M. 2014. MISO: Souping Up Big Data Query Processing with a Multistore System. In ACM SIGMOD (2014), 1591- 1602.
- Özsu, T., Valduriez, P. 2011. Principles of Distributed Database Systems - Third Edition. Springer, 850 pages.
Paper Citation
in Harvard Style
Kolev B., Bondiombouy C., Levchenko O., Valduriez P., Jimenez-Peris R., Pau R. and Pereira J. (2016). Design and Implementation of the CloudMdsQL Multistore System . In Proceedings of the 6th International Conference on Cloud Computing and Services Science - Volume 1: DataDiversityConvergence, (CLOSER 2016) ISBN 978-989-758-182-3, pages 352-359. DOI: 10.5220/0005923803520359
in Bibtex Style
@conference{datadiversityconvergence16,
author={Boyan Kolev and Carlyna Bondiombouy and Oleksandra Levchenko and Patrick Valduriez and Ricardo Jimenez-Peris and Raquel Pau and José Pereira},
title={Design and Implementation of the CloudMdsQL Multistore System},
booktitle={Proceedings of the 6th International Conference on Cloud Computing and Services Science - Volume 1: DataDiversityConvergence, (CLOSER 2016)},
year={2016},
pages={352-359},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005923803520359},
isbn={978-989-758-182-3},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 6th International Conference on Cloud Computing and Services Science - Volume 1: DataDiversityConvergence, (CLOSER 2016)
TI - Design and Implementation of the CloudMdsQL Multistore System
SN - 978-989-758-182-3
AU - Kolev B.
AU - Bondiombouy C.
AU - Levchenko O.
AU - Valduriez P.
AU - Jimenez-Peris R.
AU - Pau R.
AU - Pereira J.
PY - 2016
SP - 352
EP - 359
DO - 10.5220/0005923803520359