• Clustrix
18
ClustrixDB is a distributed database. Latest ver-
sion (6.0) brings many new capabilities and per-
formance improvements, with specific optimiza-
tions for Magento-based and custom e-commerce
implementations. It is able to specify that a
copy of the table resides on every node, max-
imizing performance in certain workloads. It
combines automatic data distribution to maximize
parallelism, and the ability to override that for
highly accessed, highly-joined tables (for exam-
ple, metadata, etc.). Optimized Scheduler priori-
tizes critical OLTP transactions when heavy long-
running analytics queries are also running.
• VoltDB
19
VoltDBs in-memory architecture is de-
signed for performance. It eliminates the sig-
nificant overhead of multi-threading and lock-
ing responsible for the poor performance of tra-
ditional RDBMSs that rely on disks. VoltDB
was designed for High Availability from the
ground up. VoltDB’s supports virtualized and
cloud infrastructures and combines the rich-
ness and flexibility of SQL for data interaction
with a modern, distributed, fault-tolerant, cloud-
deployable clustered architecture while maintain-
ing the ACID guarantees of a traditional database
system. VoltDB supports the JSON data type
and several client access methods including stored
procedures, JDBC and ad hoc queries. Further-
more, VoltDB supports a wide range of integra-
tions including JDBC (Java Database Connectiv-
ity) and ODBC (Open Database Connectivity) for
data exchange. Operating System: OS Indepen-
dent.
4 CONCLUSIONS
This work has provided a first evaluation of the most
spread solutions existing in the Big Data landscape.
As shown in the previous sections, a great number
of solutions are open-source projects demonstrating
the great interest that the community of developers
has in such topics. At the same time, the work has
highlighted the flexibility of the most part of tools
that are generally multi-platformor programming lan-
guage agnostic as they are provided with HTTP Rest-
full APIs which allow clients to easily access them.
In other cases, the great availability of APIs writ-
ten in the most popular programming languages (in
most cases developed by third parties as depending
18
http://www.clustrix.com/
19
http://voltdb.com/
or separate projects) contribute yet to ease the inter-
operability between the client tools and the back-end
store database. Future works can be directed to dif-
ferent objectives. On the one hand, it can be im-
proved the evaluation framework by adding other cri-
teria not yet considered in this work, such as those re-
lated to security, scalability, and quantitative analysis
performed by authoritative groups like YCSB lab. On
the other hand, new but complementary study can be
approached by surveying the technological solutions
existing to deal with the other challenges of Big Data,
such as: analytics, heterogeneity, timeliness, aggrega-
tion and transfer and finally visualization.
REFERENCES
Albanese, M., Capasso, P., Picariello, A., and Rinaldi,
A. M. (2005). Information retrieval from the web:
an interactive paradigm. In Advances in Multimedia
Information Systems, pages 17–32. Springer.
Alnafoosi, A. B. and Steinbach, T. (2013). An integrated
framework for evaluating big-data storage solutions-
ida case study. In Science and Information Conference
(SAI), 2013, pages 947–956. IEEE.
Amato, F., De Santo, A., Gargiulo, F., Moscato, V., Persia,
F., Picariello, A., and Poccia, S. (2015a). Semindex:
an index for supporting semantic retrieval of docu-
ments. In Proceedings of the IEEE DESWeb ICDE
2015.
Amato, F., De Santo, A., Moscato, V., Persia, F., Picariello,
A., and Poccia, S. (2015b). Partitioning of ontolo-
gies driven by a structure-based approach. In Semantic
Computing (ICSC), 2015 IEEE International Confer-
ence on, pages 320–323.
Bhanu, S. (2013). Companies adopting big data analytics to
deal with challenges. The Economic Times.
Caldarola, E. G., Picariello, A., and Castelluccia, D. (2015).
Modern enterprises in the bubble: Why big data mat-
ters. ACM SIGSOFT Software Engineering Notes,
40(1):1–4.
Caldarola, E. G., Sacco, M., and Terkaj, W. (2014). Big
data: The current wave front of the tsunami. ACS Ap-
plied Computer Science, 10(4):7–18.
Cattell, R. (2011). Scalable sql and nosql data stores. ACM
SIGMOD Record, 39(4):12–27.
Chang, F., Dean, J., Ghemawat, S., Hsieh, W. C., Wallach,
D. A., Burrows, M., Chandra, T., Fikes, A., and Gru-
ber, R. E. (2008). Bigtable: A distributed storage sys-
tem for structured data. ACM Transactions on Com-
puter Systems (TOCS), 26(2):4.
Chen, M., Mao, S., and Liu, Y. (2014). Big data: A survey.
Mobile Networks and Applications, 19(2):171–209.
Cooper, B. F., Silberstein, A., Tam, E., Ramakrishnan, R.,
and Sears, R. (2010). Benchmarking cloud serving
systems with ycsb. In Proceedings of the 1st ACM
Symposium on Cloud Computing, SoCC ’10, pages
143–154, New York, NY, USA. ACM.
BigData:ASurvey-TheNewParadigms,MethodologiesandTools
369