Longer searches look for information in the TPC-H
tables with the largest number of information
(Lineitem, Orders, Customer, Supplier). It is
understandable that the search time is longer when it
is necessary to look up information in the tables with
largest number of rows. We also verify that queries
with aggregations have the slowest execution time.
Based on the experimental results, it is possible to
verify that MemSQL stands out, with a 92%
improvement over VoltDB. As mentioned in section
3.1 in MemSQL, when a new search is received on
the system, it checks if there is already a hash code,
then it reuses the previously compiled code, passing
the parameters to the already compiled code,
significantly reducing the processing time. Due to this
distinct feature, the query execution time of
MemSQL queries are smaller compared to VoltDB.
In this way it was verified that the two NewSQL
engines are fast to present the results of the searches
but the MemSQL was able to surpass the VoltDB in
all the searches as we can verify in Figure 5. The
VoltDB being a system based on the memory, uses
the memory on the computer considerably, and
consequently led to longer execution times.
6 CONCLUSIONS AND FUTURE
WORK
Today information is vital for organizations that have
multiple data sources and systems to store them.
However, there is a huge problem due to the massive
quantity of data inserted in the databases, that causes
poor query performance and worse data analysis.
These problems highlighted the advantage of
NewSQL databases by providing increased
throughput, and improved performance, solving also
storage problems. With this it is possible to solve the
current storage problems, but also fix some flaws that
exist in other database systems, that is why NewSQL
databases are designed to be scalable and support
large amounts of data and remain efficient.
During the evaluation of the tests, only Q19 was
not possible to execute. Therefore, it is possible to
verify that NewSQL systems use considerably the
primary memory to perform searches. To obtain good
performance it is necessary to have a computer with
good processing and storage capacity that contributes
to the better results of NewSQL databases.
We can conclude that MemSQL NewSQL engine
behaved better than the VoltDB for 1GB of data.
Moreover, MemSQL use standard SQL, without the
necessity of queries rewriting, while in VoltDB it is
necessary to rewrite the queries. For example,
VoltDB does not support the date type, but only the
timestamp date type.
As a future work, we intend to evaluate other
NewSQL database engines and comparing the
performance with traditional relational databases,
such as MySQL or PostgreSQL. We also intend to
increase the scale factor of TPC-H using a database
size of 10GB and more.
REFERENCES
Aslett, Matthew, 2011. NoSQL, NewSQL and Beyond,
https://blogs.the451group.com/information_managemen
t/2011/04/15/nosql-NewSQL-and-beyond/
Binani, S., Gutti, A. and Upadhyay, S. (2016) ‘SQL vs.
NoSQL vs. NewSQL-A Comparative Study’, Commu-
nications on Applied Electronics, 6(1), pp. 43–46.
Chen, J. et al. (2016) ‘The MemSQL Query Optimizer : A
modern optimizer for real-time analytics in a distributed
database’, 9(13), pp. 1401–1412. doi: 10.14778/
3007263.3007277.
DBEngines (2017). [online] Available at https://db-
engines.com/en/ranking [Accessed 7 Jun. 2017].
Grolinger, K. et al. (2013) ‘Data management in cloud
environments: NoSQL and NewSQL data stores’,
Journal of Cloud Computing: Advances, Systems and
Applications, 2, p. 22.
Kumar, R. et al. (2014) ‘Apache Hadoop, NoSQL and
NewSQL Solutions of Big Data’, International Journal
of Advance Foundation and Research in Science &
Engineering, 1(6), pp. 28–36.
Lourenço, J.R. et al. (2015a) ‘Choosing the right NoSQL
database for the job: a quality attribute evaluation’,
Journal of Big Data, 2 (1), art. no. 18.
Lourenço, J. R., et al. (2015b) ‘NOSQL databases: A
software engineering perspective’, Springer Advances in
Intelligent Systems and Computing, 353(6), pp. 741–750.
Moniruzzaman, A. B. M. (2014) ‘NewSQL : Towards Next-
Generation Scalable RDBMS for Online Transaction
Processing (OLTP) for Big Data Management’,
International Journal of Database Theory &
Application, 7(6), pp. 121–130.
Pavlo, A. and Aslett, M. (2016) ‘What’s Really New with
NewSQL?’, SIGMOD Record, 45(2), pp. 45–55. doi:
10.1145/3003665.3003674.
Santos, R. J., Bernardino, J. and Vieira, M. (2011) ‘Balancing
security and performance for enhancing data privacy in
data warehouses’, Proc. 10th IEEE Int. Conf. on Trust,
Security and Privacy in Computing and
Communications, TrustCom 2011, pp. 242-249.
Stonebraker, M. (2012) ‘New opportunities for New SQL’,
Communications of the ACM, 55(11), p. 10. doi:
10.1145/2366316.2366319.
TPC-H Documentation. (2017, July) Retrevied from
http://www.tpc.org/tpch/default.asp
VoltDB Documentation. (2017, July) Retrevied from
https://www.voltdb.com.